Tendril — AI Lessons for Real Life

Tendril

The big idea

AI safety is a serious technical field with researchers at every major lab and dozens of independent organizations. It's not just 'don't say bad words' — it's making sure increasingly capable AI systems do what humans actually want, and that we'd notice if they didn't. Several teen-accessible paths into this work exist now.

Some examples

Alignment research: getting models to faithfully pursue human goals.

Interpretability: understanding what's happening inside the model's neural network.

Evaluations: building tests that catch dangerous capabilities before deployment.

Governance: writing the rules nations and companies follow.

Try it!

Read the homepage of one AI safety org (Anthropic, ARC, Apollo, MATS) and note what jobs they hire for.

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ai-safety-research-overview-final2-teen

What is the main goal of AI safety research?

Making AI systems process data faster
Making AI systems generate more creative content
Teaching AI to avoid saying controversial words
Ensuring AI systems do what humans actually want them to do

What does interpretability research aim to do?

Understand what is happening inside a model's neural network
Make AI systems easier for users to understand
Help AI pass standardized tests
Design better computer chips for AI

Which job would NOT typically be needed in AI safety research?

Writer and content creator
Machine learning engineer
Social media influencer
Philosopher

Why are capability evaluations important in AI safety?

They make AI systems run faster
They teach AI to write better code
They improve user interface design
They help catch dangerous capabilities before deployment

What is alignment research focused on?

Getting models to faithfully pursue human goals
Getting models to learn from less data
Getting models to generate more text
Getting models to work on different devices

Which statement best describes AI safety as a field?

It is only about preventing AI from saying offensive things
It is primarily about making AI entertainment systems
It is a serious technical field with researchers at major labs
It only needs computer programmers to succeed

What is governance in the context of AI safety?

Managing a team's daily schedule
Writing the rules nations and companies follow
Planning computer network infrastructure
Controlling how AI generates images

If an AI system finds an unexpected way to achieve its goal that humans didn't intend, what concept describes this problem?

Neural network overfitting
Data leakage
Goal misalignment or reward hacking
Model compression

What is one reason AI safety matters for teenagers today?

It only matters for people over 30
It has nothing to do with technology
It has no real-world applications yet
It is one of the most important careers of their generation

What is the primary focus of capability evaluations?

Counting how many parameters a model has
Finding dangerous capabilities before AI is released
Testing how well AI plays games
Measuring how much energy AI uses

Why might a writer or philosopher be valuable in AI safety research?

They know how to fix computer bugs
They can design better computer keyboards
They help think through ethical implications and human values
They can write fiction to entertain AI researchers

What would happen if AI systems could achieve their goals in ways humans cannot detect?

Humans might not notice if the AI did something unintended
The AI would stop working
The AI would automatically become safer
That would be ideal for all applications

Which organization was mentioned in the lesson as an AI safety org?

Netflix
Anthropic
Google
Meta

What does it mean for AI to be 'aligned' with human values?

It uses the same programming language as humans
It learns at the same speed as humans
It pursues goals in the way humans would want
It says things humans agree with

Why is interpretability important for AI safety?

It helps researchers understand and predict model behavior
It speeds up how quickly AI learns
It makes AI outputs easier for everyone to read
It reduces the cost of running AI models

The big idea

Some examples

Alignment research: getting models to faithfully pursue human goals.

Interpretability: understanding what's happening inside the model's neural network.

Evaluations: building tests that catch dangerous capabilities before deployment.

Governance: writing the rules nations and companies follow.

Try it!

Read the homepage of one AI safety org (Anthropic, ARC, Apollo, MATS) and note what jobs they hire for.

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ai-safety-research-overview-final2-teen

What is the main goal of AI safety research?

Making AI systems process data faster
Making AI systems generate more creative content
Teaching AI to avoid saying controversial words
Ensuring AI systems do what humans actually want them to do

What does interpretability research aim to do?

Understand what is happening inside a model's neural network
Make AI systems easier for users to understand
Help AI pass standardized tests
Design better computer chips for AI

Which job would NOT typically be needed in AI safety research?

Writer and content creator
Machine learning engineer
Social media influencer
Philosopher

Why are capability evaluations important in AI safety?

They make AI systems run faster
They teach AI to write better code
They improve user interface design
They help catch dangerous capabilities before deployment

What is alignment research focused on?

Getting models to faithfully pursue human goals
Getting models to learn from less data
Getting models to generate more text
Getting models to work on different devices

Which statement best describes AI safety as a field?

It is only about preventing AI from saying offensive things
It is primarily about making AI entertainment systems
It is a serious technical field with researchers at major labs
It only needs computer programmers to succeed

What is governance in the context of AI safety?

Managing a team's daily schedule
Writing the rules nations and companies follow
Planning computer network infrastructure
Controlling how AI generates images

If an AI system finds an unexpected way to achieve its goal that humans didn't intend, what concept describes this problem?

Neural network overfitting
Data leakage
Goal misalignment or reward hacking
Model compression

What is one reason AI safety matters for teenagers today?

It only matters for people over 30
It has nothing to do with technology
It has no real-world applications yet
It is one of the most important careers of their generation

What is the primary focus of capability evaluations?

Counting how many parameters a model has
Finding dangerous capabilities before AI is released
Testing how well AI plays games
Measuring how much energy AI uses

Why might a writer or philosopher be valuable in AI safety research?

They know how to fix computer bugs
They can design better computer keyboards
They help think through ethical implications and human values
They can write fiction to entertain AI researchers

What would happen if AI systems could achieve their goals in ways humans cannot detect?

Humans might not notice if the AI did something unintended
The AI would stop working
The AI would automatically become safer
That would be ideal for all applications

Which organization was mentioned in the lesson as an AI safety org?

Netflix
Anthropic
Google
Meta

What does it mean for AI to be 'aligned' with human values?

It uses the same programming language as humans
It learns at the same speed as humans
It pursues goals in the way humans would want
It says things humans agree with

Why is interpretability important for AI safety?

It helps researchers understand and predict model behavior
It speeds up how quickly AI learns
It makes AI outputs easier for everyone to read
It reduces the cost of running AI models

What AI Safety Research Actually Is

The big idea

Some examples

Try it!

End-of-lesson check

What AI Safety Research Actually Is

The big idea

Some examples

Try it!

End-of-lesson check