A Short History: From Expert Systems to Transformers

AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps you spot hype from real progress.

35 min · Reviewed 2026

The 70-Year Story in Four Acts

AI research began in the 1950s. The field has gone through booms and winters — periods of huge funding followed by collapse. Understanding this rhythm helps you calibrate today's excitement.

Act 1: Symbolic AI (1950s–1980s)

Early researchers believed intelligence was logic. They wrote programs that manipulated symbols according to formal rules. Expert systems of the 1980s, like MYCIN for medical diagnosis, encoded human expert knowledge as if-then rules. They worked for narrow problems but crumbled outside them.

1956: Dartmouth Conference coins the term artificial intelligence
1966: ELIZA simulates a therapist with simple text patterns
1980s: expert systems boom, then bust due to brittleness

Act 2: Statistical ML (1990s–2010)

Researchers pivoted to data-driven methods. Support vector machines, decision trees, and shallow neural networks dominated. IBM's Deep Blue beat Kasparov at chess in 1997, but it was hand-tuned search, not general intelligence.

Act 3: Deep Learning (2012–2017)

In 2012, a neural network called AlexNet won the ImageNet competition by a huge margin, kicking off the deep learning revolution. GPUs, big datasets, and backpropagation combined to finally make deep networks trainable. By 2016, AlphaGo beat the world champion at Go, a feat nobody thought was close.

Act 4: The Transformer Era (2017–today)

The 2017 paper Attention Is All You Need introduced the transformer architecture. It replaced the recurrent networks used for language with a simpler, more parallel structure. Every modern LLM — GPT, Claude, Gemini, Llama — is a transformer at heart.

Year	Milestone
2017	Transformer architecture published
2018	BERT and GPT-1 released
2020	GPT-3 shows few-shot learning
2022	ChatGPT makes AI mainstream
2023-2024	GPT-4, Claude 3, Gemini, open-source Llama
2025-2026	Reasoning models, multimodality, agentic systems

AI winters end not with a new theory, but with enough compute.
— A long-time researcher

The big idea: today's AI is the fourth major wave, built on GPUs, internet-scale data, and the transformer. Knowing the cycle helps you see that hype is not new, but neither is real progress.

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-builders-history-of-ai

What is the core idea behind "A Short History: From Expert Systems to Transformers"?
1. AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps you spot hype from real progress.
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which term best describes a foundational idea in "A Short History: From Expert Systems to Transformers"?
1. expert system
2. symbolic AI
3. deep learning
4. transformer
A learner studying A Short History: From Expert Systems to Transformers would need to understand which concept?
1. symbolic AI
2. deep learning
3. expert system
4. transformer
Which of these is directly relevant to A Short History: From Expert Systems to Transformers?
1. symbolic AI
2. expert system
3. transformer
4. deep learning
Which of the following is a key point about A Short History: From Expert Systems to Transformers?
1. 1956: Dartmouth Conference coins the term artificial intelligence
2. 1966: ELIZA simulates a therapist with simple text patterns
3. 1980s: expert systems boom, then bust due to brittleness
4. ring attention
What is the key insight about "Why GPUs mattered" in the context of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Graphics cards were built to do millions of tiny parallel operations for games.
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
What is the recommended tip about "Build your mental model" in the context of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. AI isn't magic — it's pattern recognition at scale. The more you understand how it works, the more effectively you can u…
4. specific
Which statement accurately describes an aspect of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. specific
4. AI research began in the 1950s. The field has gone through booms and winters — periods of huge funding followed by collapse.
What does working with A Short History: From Expert Systems to Transformers typically involve?
1. Early researchers believed intelligence was logic. They wrote programs that manipulated symbols according to formal rules.
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which of the following is true about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Researchers pivoted to data-driven methods. Support vector machines, decision trees, and shallow neural networks dominated.
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which best describes the scope of "A Short History: From Expert Systems to Transformers"?
1. It is unrelated to foundations workflows
2. It applies only to the opposite beginner tier
3. It focuses on AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps
4. It was deprecated in 2024 and no longer relevant
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. specific
4. Act 1: Symbolic AI (1950s–1980s)
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. Act 2: Statistical ML (1990s–2010)
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Act 3: Deep Learning (2012–2017)
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. Act 4: The Transformer Era (2017–today)
4. specific

← Back to interactive lesson

Tendril · Builders · AI Foundations

A Short History: From Expert Systems to Transformers

AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps you spot hype from real progress.

35 min · Reviewed 2026

The 70-Year Story in Four Acts

AI research began in the 1950s. The field has gone through booms and winters — periods of huge funding followed by collapse. Understanding this rhythm helps you calibrate today's excitement.

Act 1: Symbolic AI (1950s–1980s)

1956: Dartmouth Conference coins the term artificial intelligence
1966: ELIZA simulates a therapist with simple text patterns
1980s: expert systems boom, then bust due to brittleness

Act 2: Statistical ML (1990s–2010)

Act 3: Deep Learning (2012–2017)

Act 4: The Transformer Era (2017–today)

Year	Milestone
2017	Transformer architecture published
2018	BERT and GPT-1 released
2020	GPT-3 shows few-shot learning
2022	ChatGPT makes AI mainstream
2023-2024	GPT-4, Claude 3, Gemini, open-source Llama
2025-2026	Reasoning models, multimodality, agentic systems

AI winters end not with a new theory, but with enough compute.
— A long-time researcher

The big idea: today's AI is the fourth major wave, built on GPUs, internet-scale data, and the transformer. Knowing the cycle helps you see that hype is not new, but neither is real progress.

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-builders-history-of-ai

What is the core idea behind "A Short History: From Expert Systems to Transformers"?
1. AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps you spot hype from real progress.
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which term best describes a foundational idea in "A Short History: From Expert Systems to Transformers"?
1. expert system
2. symbolic AI
3. deep learning
4. transformer
A learner studying A Short History: From Expert Systems to Transformers would need to understand which concept?
1. symbolic AI
2. deep learning
3. expert system
4. transformer
Which of these is directly relevant to A Short History: From Expert Systems to Transformers?
1. symbolic AI
2. expert system
3. transformer
4. deep learning
Which of the following is a key point about A Short History: From Expert Systems to Transformers?
1. 1956: Dartmouth Conference coins the term artificial intelligence
2. 1966: ELIZA simulates a therapist with simple text patterns
3. 1980s: expert systems boom, then bust due to brittleness
4. ring attention
What is the key insight about "Why GPUs mattered" in the context of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Graphics cards were built to do millions of tiny parallel operations for games.
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
What is the recommended tip about "Build your mental model" in the context of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. AI isn't magic — it's pattern recognition at scale. The more you understand how it works, the more effectively you can u…
4. specific
Which statement accurately describes an aspect of A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. specific
4. AI research began in the 1950s. The field has gone through booms and winters — periods of huge funding followed by collapse.
What does working with A Short History: From Expert Systems to Transformers typically involve?
1. Early researchers believed intelligence was logic. They wrote programs that manipulated symbols according to formal rules.
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which of the following is true about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Researchers pivoted to data-driven methods. Support vector machines, decision trees, and shallow neural networks dominated.
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which best describes the scope of "A Short History: From Expert Systems to Transformers"?
1. It is unrelated to foundations workflows
2. It applies only to the opposite beginner tier
3. It focuses on AI did not start in 2022. It has decades of wrong turns and breakthroughs. Knowing the history helps
4. It was deprecated in 2024 and no longer relevant
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. specific
4. Act 1: Symbolic AI (1950s–1980s)
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. Act 2: Statistical ML (1990s–2010)
2. ring attention
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Act 3: Deep Learning (2012–2017)
3. Apply AI Foundations: KTO with Binary Feedback in a live project this week
4. specific
Which section heading best belongs in a lesson about A Short History: From Expert Systems to Transformers?
1. ring attention
2. Apply AI Foundations: KTO with Binary Feedback in a live project this week
3. Act 4: The Transformer Era (2017–today)
4. specific

← Back to interactive lesson