Loading lesson…
Asking a model to 'think step by step' makes it better at hard problems. Here is why, and when it fails.
Chain-of-thought (CoT) prompting is the discovery that LLMs are often wildly better at multi-step problems when asked to reason aloud rather than answer in one shot. Wei et al. (2022) documented this effect across math, logic, and commonsense reasoning.
Without CoT: Q: If I have 5 apples and buy 3 more, then eat 2, how many do I have? A: 6 With CoT: Q: (same) A: Let me think step by step. Start with 5 apples. Buy 3 more → 5 + 3 = 8. Eat 2 → 8 - 2 = 6. The answer is 6.CoT looks verbose but consistently improves correctness on multi-step problemsSince 2024, models like OpenAI's o1, o3, Claude with extended thinking, and DeepSeek-R1 have been explicitly trained to reason for a long time before answering. Test-time compute is now a core axis of capability, distinct from parameter scaling.
| Standard CoT (prompted) | Trained reasoning model |
|---|---|
| Works at inference only | Baked into training |
| Quality depends on the prompt | Robust to prompt wording |
| Limited by context length | Long internal deliberation |
| Can be coaxed to faster | Explicitly uses more compute per query |
Chain-of-thought prompting significantly improves the ability of large language models to perform complex reasoning.
— Wei et al., Chain-of-Thought Prompting (2022)
The big idea: making thinking visible usually makes thinking better. But visible is not the same as faithful — and that distinction matters.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-creators-chain-of-thought
What is the main idea of "Chain-of-Thought Mechanics"?
Which concept is most central to "Chain-of-Thought Mechanics"?
Which use of AI fits this topic best?
What should a careful learner remember about "Faithfulness is an open question"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about chain of thought be treated?
Name one way to verify an AI answer about chain of thought.
Which action would help you apply "Chain-of-Thought Mechanics" responsibly?