Lesson 69 of 1570
Claude Opus 4.7 vs. Sonnet 4.6 — which Claude to pick
Opus is the flagship, Sonnet is the workhorse. Here is the five-minute decision tree for when to pay 2x more for Opus and when Sonnet handles it.
Lesson map
What this lesson covers
Learning path
The main moves in order
- 1Two Claudes, one question
- 2What you are actually paying for
- 3The API call looks the same
Concept cluster
Terms to connect while reading
Section 1
Two Claudes, one question
Open the Anthropic model picker and you will see Opus 4.7 at the top and Sonnet 4.6 right under it. Both have a 1 million token context window, both handle vision, both run extended thinking. So why is Opus five times more expensive per output token? The honest answer: Opus earns that premium on a small slice of tasks. For the other 80% of your work, Sonnet is the right call.
Section 2
What you are actually paying for
Compare the options
| Dimension | Claude Opus 4.7 | Claude Sonnet 4.6 |
|---|---|---|
| API price | $5 in / $25 out per M tokens | $3 in / $15 out per M tokens |
| Context window | 1,000,000 tokens | 1,000,000 tokens |
| Extended thinking | Yes, deeper budget | Yes |
| Release date | April 16, 2026 | 2025 |
| Best at | multi-file refactors, legal briefs, tough coding agents | everyday chat, code review, document QA |
| Speed | Slower per token | Faster per token |
Reach for Opus when
- You need to refactor code that spans 5+ files and depends on subtle invariants
- You are drafting a legal brief, policy memo, or regulatory filing where mistakes cost real money
- You are pushing a long-horizon agent (Claude Code on a complex branch, auto-PR generation)
- You are analyzing a 500-page report and the answer matters for a real decision
Sonnet is enough when
- Customer-support chatbot answering product questions
- Daily email drafts, meeting notes, summaries
- Code review on single files or small PRs
- Routine document Q&A where citations matter more than depth
Section 3
The API call looks the same
The only difference is the model string — swap when cost matters.
from anthropic import Anthropic
client = Anthropic()
# Opus for a hard coding job
opus_reply = client.messages.create(
model="claude-opus-4-7",
max_tokens=4096,
thinking={"type": "enabled", "budget_tokens": 8000},
messages=[{"role": "user", "content": "Refactor this multi-file auth module..."}],
)
# Sonnet for everyday chat — same shape, cheaper
sonnet_reply = client.messages.create(
model="claude-sonnet-4-6",
max_tokens=1024,
messages=[{"role": "user", "content": "Summarize this meeting transcript."}],
)Key terms in this lesson
End-of-lesson quiz
Check what stuck
15 questions · Score saves to your progress.
Tutor
Curious about “Claude Opus 4.7 vs. Sonnet 4.6 — which Claude to pick”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Builders · 28 min
Claude Opus 4.7 — extended thinking cost math
Extended thinking makes Opus smarter but burns hidden tokens. Here is how to budget it without blowing your bill.
Builders · 30 min
GPT-5.5 vs. Claude Opus 4.7 — which chatbot wins your day
Two frontier models, same subscription price, very different personalities. Pick by vibe, not by benchmark — here is how to figure out which one clicks for you.
Builders · 28 min
Gemini 2.5 Pro — how a 1M context actually helps
Everyone brags about million-token windows. Here is what you can actually do with one when you learn how Gemini 2.5 Pro handles long documents.
