Standalone lesson.
Lesson 2106 of 2116
Reasoning vs. Instruct Models
o-series, Claude extended thinking, Gemini reasoning.
Around 2024, a new class of model emerged: “reasoning” models that think for longer before answering. OpenAI’s o-series, Anthropic’s extended-thinking Claude, and Google’s Gemini reasoning models are all versions of the same idea.
The key insight — test-time compute
For years, AI capability was a function of training compute: more compute during training = smarter model. The reasoning models showed that you can also get smarter by spending more compute at inference time. The model generates a long internal chain-of-thought, evaluates multiple paths, and only commits to a final answer after extensive private reasoning.
When to use reasoning models
- Hard math and logic. Reasoning models dramatically outperform instruct models on benchmarks like AIME and FrontierMath.
- Multi-step code refactors. They catch their own bugs during the thinking phase.
- Strategy and planning. Whenever the answer requires considering multiple alternatives and comparing trade-offs.
When NOT to use them
- Fast conversational turns. They’re slow and often expensive per query.
- Creative writing. The long deliberation can flatten a model’s voice.
- Simple retrieval. An instruct model is fine.
The hybrid future
Modern APIs increasingly let you set a reasoning_effort or thinking budgetparameter per request — so a single model can toggle between fast instruct behavior and deep reasoning based on the task. As a developer, that’s the primary trade-off you’re tuning: latency and cost vs. accuracy.
Tutor
Curious about “Reasoning vs. Instruct Models”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Creators · 40 min
Reasoning Models (o-series, Claude Extended Thinking, Gemini Deep Think): When the Extra Tokens Are Worth It
When to spend 10x the tokens on a reasoning model — and when a normal model is fine.
Creators · 10 min
The Reasoning-Model Family: When To Pay Extra For Thinking
The o-series, Opus thinking modes, Gemini Deep Think — reasoning models cost more per token but think before answering. Knowing when to pay is a money-and-time tradeoff.
Explorers · 5 min
Did the AI Actually Do What You Asked?
Sometimes AI agents say they did something but actually did something different. Always check the result.
