How AI gets smart

This is how AI gets good at anything before you ever use it.

Training is the process that turns a blank network into a useful one. It is a loop. The model makes a guess. The guess gets scored. The score tells the model how wrong it was. The model adjusts every weight to be less wrong next time. Then it does that again. A trillion times.

The process

Every weight in a neural network starts as a random number. The forward pass runs an input through the network and produces a prediction. The loss function measures how far off that prediction was from the correct answer. The further off, the higher the loss.

Backpropagation works backwards through the network. It calculates how much each weight contributed to the error. Weights that caused more error get nudged more. Weights that were fine get nudged less. This nudge is the learning step.

The team calls the size of each nudge the learning rate. Too large and the model overshoots and never settles. Too small and training takes forever. Getting this right is half the work.

Two ways to train

Classical training

You show the model labelled examples. It makes a guess. The loss measures how wrong it was. The weight update nudges every dial to make the next guess slightly better. Repeat until the loss is small.

A decision tree trains in seconds. A spam filter trains in minutes.

Switch between Classical and LLM training. Watch the active step travel around the loop.

During training, a large language model sees each token and must predict the next one. It does this across about 15 trillion tokens. That is more text than every human has written in recorded history, read once.

A familiar example

Think of a student studying for an exam with a practice paper. They answer a question. They check the mark scheme. They see which parts they got wrong. They go back and study those parts harder. Then they take another practice paper. Then another. After enough practice papers, the exam is easy. Training is that loop, but instead of a student and a practice paper, it is a model and 15 trillion tokens.

The breaking point

Nobody programs the model to learn facts about chemistry, history, or code. It picks those up as side effects of getting better at predicting the next word. The training loop can create behaviour no one designed.

After training: alignment

A model fresh from training is not the helpful assistant you talk to. The raw model has read the internet and produces text in the style of the internet. That includes the helpful parts, the unhelpful parts, the toxic parts, and everything in between. The model that ships to you has been aligned.

Alignment uses two main techniques. The first is RLHF: reinforcement learning from human feedback. Humans rank pairs of model answers. The model learns to produce the kind of answer humans rank higher. The second is constitutional AI: the model is given a written set of principles and trained to follow them. Anthropic uses constitutional AI for Claude. OpenAI uses RLHF heavily.

Alignment is what turns a raw text predictor into a model that refuses harmful requests, asks clarifying questions, and admits when it does not know. Training makes the model smart. Alignment makes the model usable.

Your takeaway

The model you are talking to was shaped by what it was wrong about, billions of times, on text it will never see again. Every capability it has, and every gap, traces back to what it was trained to predict.

What powers ChatGPT AI on its own

Learn