Loading lesson…
A game thought to be a decade away for AI fell in Seoul, and move 37 rewrote what humans knew about Go.
Go had resisted computers for decades. The board has 19 by 19 intersections and a branching factor roughly ten times chess. Experts had predicted a decade more before a machine could beat a top human.
DeepMind's AlphaGo, led by David Silver and team, defeated Lee Sedol 4 to 1 in a televised five-game match in Seoul in March 2016. Over 200 million people watched online. Lee won game four with a move now called the divine move, a rare reminder that humans still had tricks.
The following year, AlphaGo Zero started from random play, used only self-play and the rules of Go, and surpassed the Lee Sedol version in a matter of days. It proved that a system could reach superhuman play without imitating human games at all.
It's not a human move. I've never seen a human play this move.
— Fan Hui, after move 37
The big idea: reinforcement learning plus deep networks plus self-play produced superhuman play in domains humans had studied for millennia. The technique generalizes far beyond games.
15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-history-alphago-2016-builders
In what city was the AlphaGo versus Lee Sedol match held in March 2016?
What was the final score of the five-game match between AlphaGo and Lee Sedol?
How many people watched the match online?
Which game did Lee Sedol win to avoid a shutout?
What was special about move 37 in game two?
What did commentators initially think about move 37?
What does the policy network in AlphaGo do?
What does the value network in AlphaGo do?
What is Monte Carlo tree search used for in AlphaGo?
What is self-play reinforcement learning in AlphaGo used for?
How did AlphaGo Zero differ from the original AlphaGo?
How long did it take AlphaGo Zero to surpass the Lee Sedol version?
What did AlphaGo Zero prove about AI systems?
What is the branching factor of Go compared to chess?
What did experts predict before AlphaGo's victory?