Lesson 657 of 1596
When to Fine-Tune vs When to Just Prompt: A Decision Framework
Fine-tuning is expensive and slow to iterate on. Prompting is fast and free. Knowing when fine-tuning actually pays off saves teams from premature optimization.
Creators · Model Families · ~24 min read
The premise
Fine-tuning is rarely the right first move; most teams should exhaust prompting + RAG before considering fine-tuning.
What AI does well here
- Try prompt engineering first — well-engineered prompts often match fine-tuning performance at zero cost
- Try RAG second when knowledge or domain context is the gap
- Consider fine-tuning when you have: stable use case, large labeled dataset, latency or cost issues prompt engineering can't solve
- Use LoRA / parameter-efficient methods rather than full fine-tuning when possible
What AI cannot do
- Make a bad use case good with fine-tuning
- Substitute for high-quality training data — fine-tuning amplifies data quality, good or bad
- Eliminate the iteration cost — fine-tuning slows your iteration speed dramatically
Key terms in this lesson
End-of-lesson quiz
Check what stuck
10 questions · Score saves to your progress.
Tutor
Curious about “When to Fine-Tune vs When to Just Prompt: A Decision Framework”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Builders · 8 min
When Fine-Tuning Actually Beats Just Writing a Better Prompt
Fine-tune for style and format consistency at high volume; for everything else, prompt better first.
Creators · 40 min
Context Window Strategy: When You Have Millions of Tokens
Frontier models offer massive context windows. Using them effectively requires understanding what context helps vs costs.
Creators · 11 min
How prompt portability differs between Claude, GPT, and Gemini
A prompt that hits 95% on Claude can hit 70% on GPT — design for portability or pick one.
