Loading lesson…
Sometimes a network memorizes, then — long after you would have stopped training — suddenly generalizes. That is grokking, a real and weird phenomenon. Why it matters beyond the toy Grokking suggests that 'more training' can sometimes qualitatively change a model's behavior — not just improve a score but switch to a different algorithm internally.
In 2022, Power, Burda, Edwards, and colleagues at OpenAI reported something strange: a small transformer trained on modular arithmetic would reach ~100% training accuracy while test accuracy stayed near zero for thousands of epochs. Then, suddenly, test accuracy would snap to ~100%. They called the phenomenon grokking, after the Heinlein word for 'to understand fully.'
Accuracy 1.0 | Train ______________________ | / | / Test 0.5 | / ________ | / / 0.0 |____/__________/______________ Time memorize generalize (early) (much later)Training accuracy saturates. Test accuracy stays low — then snaps up far later.Grokking suggests that 'more training' can sometimes qualitatively change a model's behavior — not just improve a score but switch to a different algorithm internally. That has implications for how we evaluate safety during training.
We show that neural networks can 'grok' algorithmic tasks, generalizing well after overfitting the training set.
— Power et al., Grokking paper (2022)
The big idea: learning is not monotonic. Grokking proves that long after training looks 'done,' the internal algorithm can still be changing.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-creators-grokking
What is the main idea of "Grokking: Learning That Snaps Into Place"?
Which concept is most central to "Grokking: Learning That Snaps Into Place"?
Which use of AI fits this topic best?
What should a careful learner remember about "Mechanistic interpretability found the circuit"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about grokking be treated?
Name one way to verify an AI answer about grokking.
Which action would help you apply "Grokking: Learning That Snaps Into Place" responsibly?