Standalone lesson.
Lesson 1228 of 1234
How Computers See & Hear
Cameras and microphones are an AI's eyes and ears.
Your eyes see light. Your ears hear sound. Computers don’t have eyes or ears. But cameras and microphones can turn light and sound into numbers. And computers are very, very good at numbers.
A photo is a grid of tiny colors
Zoom into any photo far enough and you’ll see little squares called pixels. Each pixel is just three numbers — how much red, how much green, how much blue. A computer “sees” a photo as a giant spreadsheet of red-green-blue numbers.
Sound is a wiggly line
A microphone turns sound waves into a wiggly line. The computer saves that wiggle as thousands of numbers per second. Loud = big numbers. Quiet = small numbers.
So what does the AI do?
It looks at those numbers and hunts for patterns. “The red numbers in the top-left usually mean sky.” “This wiggle shape is usually the letter M.” It’s not magic — it’s pattern hunting at an enormous speed.
Remember
- Cameras and microphones turn the world into numbers.
- The AI looks for patterns in those numbers.
Tutor
Curious about “How Computers See & Hear”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Explorers · 5 min
How AI Looks at Pictures Without Real Eyes
AI can 'see' photos by turning them into giant grids of numbers.
Explorers · 40 min
What Is an AI Agent? (And Why It Is Different From a Chatbot), Part 1
A chatbot answers questions. An AI agent goes off and DOES things for you. Big difference. Here is what that means.
Explorers · 6 min
Why AI Agents Are Tricky: When Doing Goes Wrong
Agents can be amazing helpers — or they can mess up in big ways because they actually take action. Here is why grown-ups are careful with them.
