AI & LLM Glossary

Plain-English definitions of the AI terms you'll see thrown around. From attention to zero-shot, with links to the models and tools where each concept matters.

Core AI terms

LLM (Large Language Model)An AI model trained on huge amounts of text that can read and generate natural language. ChatGPT, Claude and Gemini are all LLMs.
Foundation modelA large general-purpose AI model (like GPT-4 or Claude Sonnet) that's been trained on broad data and can be adapted for many tasks.
TransformerThe neural network architecture introduced in 2017 that powers every modern LLM - ChatGPT, Claude, Gemini, all of it.
TokenThe unit AI models read and write in. Roughly 4 characters or 0.75 words. Pricing and context windows are measured in tokens.
Context windowHow much text an AI can process at once, measured in tokens. A bigger context window means analysing longer documents, longer chats, and more complex tasks.
MultimodalA model that can handle multiple input types - Text, images, audio, video - Not just text.

Prompting terms

Models and providers

Accuracy and safety

A

B

BERT

An older Google model (2018) that was a major step before LLMs. Today mostly used inside Google Search, not for chat.

C

E

Embedding

A list of numbers that represents a piece of text in a way that lets computers measure 'similarity' mathematically. The foundation of semantic search and RAG.

F

G

H

Hallucination

When an AI confidently states something false. The biggest reliability issue with LLMs - Understanding hallucinations helps you use AI more safely.

I

Inference

Running a trained model to get an answer. Distinct from training, which is teaching the model in the first place.

J

Jailbreak

A prompt that tricks an AI into ignoring its safety training and doing something it normally refuses.

K

Knowledge cutoff

The date after which the AI doesn't know about world events. ChatGPT, Claude and Gemini all have one - For current events use Perplexity.

L

LLM (Large Language Model)

An AI model trained on huge amounts of text that can read and generate natural language. ChatGPT, Claude and Gemini are all LLMs.

M

N

Neural network

The mathematical structure that LLMs are built from - Billions of simple equations connected together to learn patterns.

O

OpenAI

The AI lab behind ChatGPT and the GPT family of models. Founded 2015, now valued in the hundreds of billions.

P

R

S

T

V

Vision model

An AI model that can understand images, not just text. Most modern flagship LLMs are now vision models.

Z

Zero-shot

Asking the AI to do something without giving it any examples. The opposite of few-shot prompting.