Loading lesson…
An AI that paints starts with pure noise and removes it, one step at a time, until a picture appears. Here's the surprisingly beautiful math behind it.
Almost every modern image AI — DALL-E 3, Midjourney, Stable Diffusion, Flux, Imagen — is a diffusion model. The core idea is strange and brilliant: instead of 'drawing,' the AI subtracts. It starts with a canvas of pure random noise (like TV static) and removes noise step by step until a picture emerges. Your prompt steers which picture emerges.
Do that with billions of images and their captions. The network learns, deeply, what noise looks like AT EVERY LEVEL and how to peel it back toward a coherent image — guided by the caption.
Doing diffusion on full 1024x1024 pixel images is slow. Stable Diffusion's 2022 breakthrough was to work in latent space — a compressed representation (roughly 64x64 with many channels) learned by a separate autoencoder. Diffusion happens in latent space, which is 50x smaller, then the decoder turns the final latent into a full image. Flux and Stable Diffusion 3.5 use the same approach.
| Diffusion (SD, Flux, Midjourney) | Autoregressive (GPT-4o image, some experimental) |
|---|---|
| Generate whole image at once, refine. | Generate pixel or patch, then next, like text tokens. |
| Fast, parallel, high quality. | Slower, but natural fit with LLMs. |
| Dominant approach in 2026. | Growing as multimodal LLMs improve. |
| ControlNet, LoRA, IP-Adapter work here. | Different adapter ecosystem. |
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-creative-diffusion-explained-builders
What is the main idea of "How Diffusion Models Actually Work"?
Which concept is most central to "How Diffusion Models Actually Work"?
Which use of AI fits this topic best?
What should a careful learner remember about "The magic detail: classifier-free guidance"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about diffusion be treated?
Name one way to verify an AI answer about diffusion.
Which action would help you apply "How Diffusion Models Actually Work" responsibly?