Loading lesson…
AI agents can already find some software vulnerabilities and write exploits. What happens when those capabilities scale? A clear-eyed walk through the data.
Offensive cyber has been an AI-relevant domain for years. What is new — since roughly 2024 — is agentic capability: models that take multi-step actions, use tools, and pursue goals across hours of operation. This has moved AI from an assistant for human hackers to a plausible operator.
We're in the strange position of hoping the offense-defense balance stays close, because any big asymmetry either way breaks a lot of what holds the internet together.
— Heather Adkins, Google / board member commentary (paraphrased from public talks)
The big idea: AI in cyber is not science fiction. It is a real, scaling capability with measured progress on both sides. The question for the next several years is whether defense keeps up — and what policy levers help it.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-safety2-cyber-risk-ai-creators
What is the main idea of "Cyber Risk and Autonomous AI Attackers"?
Which concept is most central to "Cyber Risk and Autonomous AI Attackers"?
Which use of AI fits this topic best?
What should a careful learner remember about "What they cannot reliably do yet"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about autonomous agent be treated?
Name one way to verify an AI answer about autonomous agent.
Which action would help you apply "Cyber Risk and Autonomous AI Attackers" responsibly?