Reasoning Models: OpenAI o1 and After

In 2024, a new class of models traded fast answers for slow, deliberate thinking, and benchmarks jumped.

BuildersAI Foundations~15 min readIntermediateAdvancedBI3 · LearningBI5 · Societal ImpactPrint / PDF

Lesson map

What this lesson covers

25 min12 blocks4 concepts

Learning path

The main moves in order

1Thinking Longer, On Purpose
2reasoning models
3o1
4inference-time compute

Concept cluster

Terms to connect while reading

reasoning modelso1inference-time computechain of thought

Sections2

Lists1

Notes3

Quotes1

Terms1

Section 1

Thinking Longer, On Purpose

In September 2024, OpenAI previewed o1, a model that spent extra compute before answering, generating long internal chains of reasoning. On hard math, coding, and science benchmarks, o1 leapt past GPT-4o, sometimes by double-digit points on tests where progress had been crawling.

The core idea was not prompt-level chain of thought. It was training the model, often through reinforcement learning, to use its own thinking tokens effectively, and then letting it spend as many of those tokens as needed at inference time.

Check-in 1. Got it so far?

What reasoning models do well

Multi-step math and proofs where intermediate errors compound
Competitive programming problems requiring search
Scientific reasoning on benchmarks like GPQA Diamond
Agentic tasks that benefit from planning and reflection

Competitors followed quickly. Google's Gemini 2.0 Flash Thinking, DeepSeek's R1 in early 2025, and Anthropic's extended thinking mode all adopted variants of the paradigm. Some published training recipes openly; others kept them secret.

“We've developed a new series of AI models designed to spend more time thinking before they respond.”
OpenAI, o1 announcement, 2024

Check-in 2. Got it so far?

Key terms in this lesson

The big idea: reasoning models reopened the scaling frontier by moving compute from training time into inference time. A model that can think longer is a different kind of model.

Check-in 3. Got it so far?

End-of-lesson quiz

Check what stuck

15 questions · Score saves to your progress.

Tutor

Curious about “Reasoning Models: OpenAI o1 and After”?

Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Reasoning Models: OpenAI o1 and After

Thinking Longer, On Purpose

What reasoning models do well

Curious about “Reasoning Models: OpenAI o1 and After”?

Keep going

Reasoning Models: OpenAI o1 and After

Thinking Longer, On Purpose

What reasoning models do well

Curious about “Reasoning Models: OpenAI o1 and After”?

Keep going