Tendril

Lesson 73 of 1570

Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class

When you need sub-second responses at pennies per thousand calls, you are choosing from the mini tier. Here is the honest Haiku vs. mini comparison.

BuildersModel Families~13 min readIntermediateBI2 · Representation & ReasoningBI4 · Natural InteractionPrint / PDF

Lesson map

What this lesson covers

22 min17 blocks3 concepts

Learning path

The main moves in order

1The class you actually ship on
2Head to head
3Cost of a million user turns

Concept cluster

Terms to connect while reading

Claude Haiku 4.5GPT-5.4 miniGemini 2.5 Flash-Lite

Sections6

Lists3

Notes4

Code1

Compare1

Section 1

The class you actually ship on

Frontier models are fun to demo. They lose money in production. The models that ship inside real customer products are the mini tier — Haiku, GPT-5.4 mini, Gemini Flash-Lite — because they are fast, cheap, and good enough for 80% of user turns.

Section 2

Head to head

Compare the options

Feature	Claude Haiku 4.5	GPT-5.4 mini	Gemini 2.5 Flash-Lite
Input price per M	$1.00	$0.75	$0.10
Output price per M	$5.00	$4.50	$0.40
Context window	200k	400k	1M
Vision	Yes	Yes	No
Reasoning toggle	No	Yes	No
Best at	Tone and nuance in support chat	Reasoning-light tasks with vision	Sheer throughput, cheapest floor

Check-in 1. Got it so far?

Pick Haiku if

You run a customer-support chat where tone and de-escalation matter
You need vision (image moderation, receipt OCR with commentary)
You already use Anthropic tooling in your stack

Pick GPT-5.4 mini if

You want reasoning capability at mini prices (flip the reasoning flag on for harder turns)
You need 400k context for longer conversations
You are already on OpenAI for other products

Pick Gemini Flash-Lite if

Your cost per call is the whole game (data labeling at scale, embeddings pre-processing)
You do not need vision or nuanced tone
You are doing classification on a 1M-row dataset

Check-in 2. Got it so far?

Section 3

Cost of a million user turns

At 1M turns, Flash-Lite costs 9% of Haiku. GPT-5.4 mini costs closer to Haiku, but buys you OpenAI tool support and reasoning controls.

python

# Assume 500 input tokens, 200 output tokens per turn
# 1,000,000 turns

haiku = (500 * 1_000_000 / 1_000_000) * 1.00 + (200 * 1_000_000 / 1_000_000) * 5.00
# = $500 + $1000 = $1,500

gpt_mini = (500 * 1_000_000 / 1_000_000) * 0.75 + (200 * 1_000_000 / 1_000_000) * 4.50
# = $375 + $900 = $1,275

flash_lite = (500 * 1_000_000 / 1_000_000) * 0.10 + (200 * 1_000_000 / 1_000_000) * 0.40
# = $50 + $80 = $130

print(haiku, gpt_mini, flash_lite)

Key terms in this lesson

Check-in 3. Got it so far?

End-of-lesson quiz

Check what stuck

15 questions · Score saves to your progress.

Tutor

Curious about “Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class”?

Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class

The class you actually ship on

Head to head

Pick Haiku if

Pick GPT-5.4 mini if

Pick Gemini Flash-Lite if

Cost of a million user turns

Curious about “Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class”?

Keep going

Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class

The class you actually ship on

Head to head

Pick Haiku if

Pick GPT-5.4 mini if

Pick Gemini Flash-Lite if

Cost of a million user turns

Curious about “Claude Haiku 4.5 vs. GPT-5.4 mini — the cheap-and-fast class”?

Keep going