Lesson 1056 of 1234
Humans Gave AI Thumbs Up to Train It
AI got better because humans clicked thumbs up or thumbs down.
Lesson map
What this lesson covers
Learning path
The main moves in order
- 1The big idea
- 2RLHF
- 3human feedback
- 4AI training
Concept cluster
Terms to connect while reading
Section 1
The big idea
Humans rated AI answers thumbs up or thumbs down. AI learned to give more answers humans liked. This is called RLHF.
Some examples
- RLHF stands for Reinforcement Learning from Human Feedback.
- Workers read AI answers all day and rated them.
- Good answers got copied. Bad answers got avoided.
- That is why modern AI sounds nicer and clearer than older AI.
Try it!
When AI gives a great answer, click thumbs up. That feedback can help train future AI versions.
Key terms in this lesson
End-of-lesson quiz
Check what stuck
15 questions · Score saves to your progress.
Tutor
Curious about “Humans Gave AI Thumbs Up to Train It”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Explorers · 18 min
World Geography: Exploring Places with AI
Geography used to be memorizing capitals. Now you can take a virtual tour, ask questions, and actually remember where things are and why.
Explorers · 18 min
ChatGPT, November 2022
A research preview posted on a Wednesday became the fastest-growing consumer product in history.
Explorers · 6 min
AI Helpers in Your Favorite Video Games
From the bad guys you fight to the buddies who help you — meet the AI hiding inside games.
