Back to Glossary
AI Basics
What is Large Language Model (LLM)?
An AI system trained on massive text data to understand and generate human language.
A Large Language Model (LLM) is an AI system trained on enormous amounts of text data to understand, generate, and manipulate human language.
Examples of LLMs
- GPT-4 (OpenAI) - Powers ChatGPT
- Claude (Anthropic) - Known for safety
- Gemini (Google) - Multimodal capabilities
- LLaMA (Meta) - Open source
- Mistral - Efficient open source model
Why "Large"?
These models have billions of parameters (settings the model learns):
- GPT-3: 175 billion parameters
- GPT-4: Estimated 1+ trillion parameters
- Requires massive computing power to train
What LLMs Can Do
- Answer questions and explain concepts
- Write articles, emails, and code
- Translate between languages
- Summarize long documents
- Have conversations
- Analyze and extract information
Limitations
- No real understanding (pattern matching, not reasoning)
- Can produce confident but wrong answers
- Training data biases
- Can't learn from conversations
- Knowledge cutoffs