Loading lesson…
A game thought to be a decade away for AI fell in Seoul, and move 37 rewrote what humans knew about Go.
Go had resisted computers for decades. The board has 19 by 19 intersections and a branching factor roughly ten times chess. Experts had predicted a decade more before a machine could beat a top human.
DeepMind's AlphaGo, led by David Silver and team, defeated Lee Sedol 4 to 1 in a televised five-game match in Seoul in March 2016. Over 200 million people watched online. Lee won game four with a move now called the divine move, a rare reminder that humans still had tricks.
The following year, AlphaGo Zero started from random play, used only self-play and the rules of Go, and surpassed the Lee Sedol version in a matter of days. It proved that a system could reach superhuman play without imitating human games at all.
It's not a human move. I've never seen a human play this move.
— Fan Hui, after move 37
The big idea: reinforcement learning plus deep networks plus self-play produced superhuman play in domains humans had studied for millennia. The technique generalizes far beyond games.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-history-alphago-2016-builders
What is the main idea of "AlphaGo Beats Lee Sedol, 2016"?
Which concept is most central to "AlphaGo Beats Lee Sedol, 2016"?
Which use of AI fits this topic best?
What should a careful learner remember about "Move 37"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about AlphaGo be treated?
Name one way to verify an AI answer about AlphaGo.
Which action would help you apply "AlphaGo Beats Lee Sedol, 2016" responsibly?