Lesson 738 of 1169
Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea
Trying to make AI break its safety rules can get you in real trouble.
Explorers · Safety & Governance · ~3 min read
The big idea
Some kids try to be 'sneaky' and trick AI into saying mean stuff or sharing things it shouldn't. This is called jailbreaking, and it can get you in trouble at school or with parents.
Some examples
- Pretending the AI is 'in a movie' to make it say bad words.
- Trying to get AI to share dangerous instructions.
- Schools are starting to track these tricks and punish them.
- Even if it 'works,' it makes AI worse for everyone.
Try it!
If a friend wants you to help trick an AI, say 'no thanks' and tell a grown-up. Practice it!
Key terms in this lesson
Practice this safely
Try this with a low-stakes example and a trusted adult nearby. The goal is to notice how AI talks about jailbreaking, not to let it make the decision for you.
- 1Ask AI to explain jailbreaking in plain language, then underline anything that sounds uncertain or too broad.
- 2Give it one detail from "Why Trying to Trick AI Into Doing Bad Stuff Is a Bad Idea" and ask for two possible next steps plus one reason each step might be wrong.
- 3Check rules against a trusted source, teacher, adult, expert, or original document before you use it.
End-of-lesson quiz
Check what stuck
8 questions · Score saves to your progress.
Lesson help
Questions are best handled with a grown-up here.
For this age range, Tendril keeps freeform AI chat paused until parent/guardian consent and child-safe moderation are fully verified. Use the quiz, notes, and related lessons below, or ask a parent, guardian, teacher, or librarian to work through the question with you.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Explorers · 5 min
AI Pranks Can Cross the Line — Be Careful
Some AI pranks are mean or scary, and they can really hurt feelings.
Explorers · 5 min
AI and Strangers Online: Stay Safe Like With Any Stranger
Some apps with AI are made by strangers. Treat AI products like any stranger — be careful what you share, and tell a grown-up.
Explorers · 5 min
Never Tell AI Your Passwords (Or Anyone's Passwords)
Passwords are secret. AI has no business knowing yours. Same for your family's. Here is why.
