Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea

Trying to make AI break its safety rules can get you in real trouble.

Explorers · Safety & Governance · ~3 min read

The big idea

Some kids try to be 'sneaky' and trick AI into saying mean stuff or sharing things it shouldn't. This is called jailbreaking, and it can get you in trouble at school or with parents.

Some examples

Pretending the AI is 'in a movie' to make it say bad words.
Trying to get AI to share dangerous instructions.
Schools are starting to track these tricks and punish them.
Even if it 'works,' it makes AI worse for everyone.

Try it!

If a friend wants you to help trick an AI, say 'no thanks' and tell a grown-up. Practice it!

Key terms in this lesson

Practice this safely

Try this with a low-stakes example and a trusted adult nearby. The goal is to notice how AI talks about jailbreaking, not to let it make the decision for you.

1Ask AI to explain jailbreaking in plain language, then underline anything that sounds uncertain or too broad.
2Give it one detail from "Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea" and ask for two possible next steps plus one reason each step might be wrong.
3Check rules against a trusted source, teacher, adult, expert, or original document before you use it.

End-of-lesson quiz

Check what stuck

8 questions · Score saves to your progress.

Lesson help

Questions are best handled with a grown-up here.

For this age range, Tendril keeps freeform AI chat paused until parent/guardian consent and child-safe moderation are fully verified. Use the quiz, notes, and related lessons below, or ask a parent, guardian, teacher, or librarian to work through the question with you.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea

The big idea

Some examples

Try it!

Practice this safely

Questions are best handled with a grown-up here.

Keep going

Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea

The big idea

Some examples

Try it!

Practice this safely

Questions are best handled with a grown-up here.

Keep going