Loading lesson…
Some capabilities grow smoothly with scale. Others seem to appear out of nowhere. Telling them apart is a whole research program. The Big Question Is AI capability a smooth climb or a staircase?
Is AI capability a smooth climb or a staircase? The answer is probably 'both, depending on how you measure.' Understanding the argument is central to forecasting what the next generation of models will and will not do.
Wei et al. (2022) catalogued capabilities that appeared to 'emerge' at particular scales — arithmetic, instruction following, in-context learning. Below a threshold, performance was near random; above it, performance jumped sharply.
Schaeffer, Miranda, and Koyejo (2023) argued that many emergent abilities are a function of the metric, not the model. Switch from strict exact-match to partial-credit scoring, and the cliff becomes a gentle hill. Emergence might be about how we look, not what is there.
| View | Claim | Implication |
|---|---|---|
| Strong emergence | Capabilities really do appear at thresholds | Forecasting is hard; surprises are inevitable |
| Mirage view | Smoothness is hidden by harsh metrics | Forecasting is possible with better metrics |
| Middle ground | Some emergence is real, some is measurement | Depends on task — check both framings |
Our findings suggest that existing claims of emergent abilities are creations of the researcher's choice of metrics.
— Schaeffer et al., Are Emergent Abilities of Large Language Models a Mirage? (2023)
The big idea: whether AI capabilities emerge suddenly or grow smoothly depends partly on how you look. Either way, the surprises are real enough to plan for.
15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-creators-emergence-vs-scaling
What is the core idea behind "Emergence vs. Scaling"?
Which term best describes a foundational idea in "Emergence vs. Scaling"?
A learner studying Emergence vs. Scaling would need to understand which concept?
Which of these is directly relevant to Emergence vs. Scaling?
Which of the following is a key point about Emergence vs. Scaling?
Which of these does NOT belong in a discussion of Emergence vs. Scaling?
What is the key insight about "Why this is not settled" in the context of Emergence vs. Scaling?
What is the key insight about "The policy stakes" in the context of Emergence vs. Scaling?
What is the recommended tip about "Ground your practice in fundamentals" in the context of Emergence vs. Scaling?
Which statement accurately describes an aspect of Emergence vs. Scaling?
What does working with Emergence vs. Scaling typically involve?
Which of the following is true about Emergence vs. Scaling?
Which best describes the scope of "Emergence vs. Scaling"?
Which section heading best belongs in a lesson about Emergence vs. Scaling?
Which section heading best belongs in a lesson about Emergence vs. Scaling?