Core AI

Large Language Model (LLM)

Last reviewed: April 2026

A type of AI trained on vast amounts of text to understand and generate human language. ChatGPT, Claude, and Gemini are all LLMs.

A large language model is an AI system trained on enormous quantities of text — books, articles, websites, code, conversations — to understand and generate human language. When you use ChatGPT, Claude, or Gemini, you are interacting with an LLM.

What makes them "large"

The "large" in large language model refers to two things: the size of the training data (often trillions of words) and the number of parameters in the model. Parameters are the internal settings the model adjusts during training to improve its predictions. Modern LLMs have hundreds of billions of parameters. More parameters generally means more nuanced understanding, but also more computational cost to run.

How LLMs generate text

LLMs work by predicting the next word (technically, the next token) in a sequence. When you type a prompt, the model calculates the probability of every possible next word, picks one, then repeats the process for the word after that, and so on. This happens incredibly fast — often generating hundreds of words per second.

This prediction mechanism is why LLMs can write essays, answer questions, summarise documents, translate languages, and write code. They have seen so many examples of each task during training that their predictions are remarkably useful.

What LLMs can and cannot do

LLMs excel at language tasks: writing, editing, summarising, translating, explaining, brainstorming, and analysing text. They can also handle structured reasoning, coding, and data analysis when given clear instructions.

However, LLMs have important limitations:

They do not have access to real-time information unless connected to the internet or external tools. Their knowledge has a training cutoff date.
They can hallucinate — generate plausible-sounding but incorrect information — because they are optimising for statistically likely text, not verified truth.
They do not remember previous conversations unless the conversation is within the same context window.
They struggle with precise mathematical computation, though this is improving rapidly.

The major LLMs you should know

Claude (Anthropic) — Known for nuanced reasoning, long context windows, and careful handling of sensitive topics.
GPT-5.4/ChatGPT (OpenAI) — The most widely known LLM, strong across general tasks.
Gemini (Google) — Integrated with Google's ecosystem, strong at search and multimodal tasks.
Llama (Meta) — An open-source model that can be self-hosted, giving organisations more control over their data.

Each model has different strengths, pricing, and privacy characteristics. The right choice depends on your specific use case, budget, and data sensitivity requirements.

Want to go deeper?

This topic is covered in our Foundations level. Access all 100+ lessons free.

Why This Matters

LLMs are the technology behind every AI assistant your team is using or considering. Understanding how they work helps you set realistic expectations, choose the right model for each task, and avoid common pitfalls like treating AI output as fact without verification. When your CEO asks "should we use ChatGPT or Claude?", knowing what an LLM actually does lets you give an informed answer instead of guessing.

Related Terms

Artificial Intelligence (AI)

Software that can perform tasks that normally require human intelligence, such as understanding language, recognising patterns, and making decisions.

Token

The smallest unit of text an AI model processes. Roughly 3-4 characters or three-quarters of a word. AI pricing is typically measured in tokens.

Context Window

The maximum amount of text an AI can process at once. Think of it as the AI's working memory — everything it can see and consider when generating a response.

Transformer

The neural network architecture behind modern AI assistants like ChatGPT and Claude. Introduced in 2017, it processes all words simultaneously using an attention mechanism.

Hallucination

When AI generates confident but incorrect information. The AI is not lying — it is producing statistically plausible text that happens to be wrong.

Inference

The process of an AI model generating output from your input. Every time you send a prompt and get a response, that is inference.

Related Comparisons

ChatGPT vs Claude

An honest, detailed comparison of ChatGPT and Claude across writing quality, accuracy, context window, coding, pricing, and more. Updated for 2026.

ChatGPT vs Gemini

A detailed comparison of ChatGPT and Google Gemini across writing, research, Google Workspace integration, pricing, and real-world performance.

Cursor vs GitHub Copilot

Cursor and GitHub Copilot are the two leading AI coding assistants. This comparison covers features, model support, pricing, and which is better for different developers.

Learn More

Continue learning in Foundations

This topic is covered in our lesson: How Large Language Models Actually Work

← Back to Glossary