Reasoning vs. Instruct Models

o-series, Claude extended thinking, Gemini reasoning.

CreatorsCreators~27 min readInteractiveBI2 · Representation & ReasoningPrint / PDF

Around 2024, a new class of model emerged: “reasoning” models that think for longer before answering. OpenAI’s o-series, Anthropic’s extended-thinking Claude, and Google’s Gemini reasoning models are all versions of the same idea.

The key insight — test-time compute

For years, AI capability was a function of training compute: more compute during training = smarter model. The reasoning models showed that you can also get smarter by spending more compute at inference time. The model generates a long internal chain-of-thought, evaluates multiple paths, and only commits to a final answer after extensive private reasoning.

When to use reasoning models

Hard math and logic. Reasoning models dramatically outperform instruct models on benchmarks like AIME and FrontierMath.
Multi-step code refactors. They catch their own bugs during the thinking phase.
Strategy and planning. Whenever the answer requires considering multiple alternatives and comparing trade-offs.

When NOT to use them

Fast conversational turns. They’re slow and often expensive per query.
Creative writing. The long deliberation can flatten a model’s voice.
Simple retrieval. An instruct model is fine.

The hybrid future

Modern APIs increasingly let you set a reasoning_effort or thinking budgetparameter per request — so a single model can toggle between fast instruct behavior and deep reasoning based on the task. As a developer, that’s the primary trade-off you’re tuning: latency and cost vs. accuracy.

Tutor

Curious about “Reasoning vs. Instruct Models”?

Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Reasoning vs. Instruct Models

The key insight — test-time compute

When to use reasoning models

When NOT to use them

The hybrid future

Curious about “Reasoning vs. Instruct Models”?

Keep going

Reasoning vs. Instruct Models

The key insight — test-time compute

When to use reasoning models

When NOT to use them

The hybrid future

Curious about “Reasoning vs. Instruct Models”?

Keep going