ChatGPT 'Plus' is $20/month for you. The math behind that price — and why prices keep dropping — explains a lot about the industry.
7 min · Reviewed 2026
The big idea
Running a frontier AI model uses thousands of dollars worth of NVIDIA H100 GPUs per minute. The cost per query has dropped ~99% from 2022 to 2026 because of better hardware, smaller specialized models, and engineering tricks like quantization. That price drop is why AI features keep getting added to free tiers.
Some examples
An H100 GPU rents for $2-$4/hour on cloud services; training GPT-4 reportedly cost $100M+ in compute.
GPT-4 cost $30 per million input tokens at launch (March 2023). GPT-4o-mini in 2026 costs $0.15 per million — a 200x drop.
DeepSeek's V3 model in late 2024 trained for under $6M, partly by using cleverer methods on cheaper hardware.
Inference (running the model after training) is cheaper than training but happens billions of times per day across all users.
Try it!
Visit openai.com/api/pricing or anthropic.com/pricing. Compare the same model from a year ago to today. The drop is real and steady. That's the trend that makes 'AI features in everything' possible.
End-of-lesson check
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-builders-foundations-ai-cost-of-running-models-r9a10-teen
What is the main idea of "What It Actually Costs to Run a Big AI Model"?
ChatGPT 'Plus' is $20/month for you. The math behind that price — and why prices keep dropping — explains a lot about the industry.
Use AI as the final authority for the whole decision
Avoid checking the answer once it sounds polished
Focus only on speed instead of judgment
Which concept is most central to "What It Actually Costs to Run a Big AI Model"?
GPU
compute
inference cost
scaling
Which use of AI fits this topic best?
Let the AI decide what matters without your review
Use the answer before checking whether it fits the situation
An H100 GPU rents for $2-$4/hour on cloud services; training GPT-4 reportedly cost $100M+ in compute.
Use the first answer without checking it
What should a careful learner remember about "The rule"?
Use AI to draft or organize ideas about compute, then verify before acting.
Skip the context so the tool can guess faster
Treat the output as private even after sharing it online
Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
Act immediately because the AI answer is written clearly
Use the AI answer as a draft, then check it against a reliable source.
Hide uncertainty so the final answer looks cleaner
Use private or sensitive details before checking permission
How should AI output about compute be treated?
As proof that no other source is needed
As a replacement for context, consent, or expert review
As a draft or helper output that still needs human judgment and verification
As something that becomes correct when it sounds confident
Name one way to verify an AI answer about compute.
Which action would help you apply "What It Actually Costs to Run a Big AI Model" responsibly?
Use the tool to avoid thinking through the tradeoff
Keep going even if the output conflicts with a trusted source
Use the first answer without checking it
GPT-4 cost $30 per million input tokens at launch (March 2023). GPT-4o-mini in 2026 costs $0.15 per million — a 200x drop.