Loading…
New · guided experience
A curated walkthrough of the library — ordered lessons, a 15-question quiz between each, and 3 next-steps so you stay in flow. Earn XP, badges, and a streak as you go.
Lessons · 6434 available · Safety & Ethics view
Bias, privacy, copyright, and staying in control. Pick a tool lane below to drill into concrete workflows, then use the browser to refine by age, situation, or skill level.
Drill down
Start with a real app or workflow. Each lane filters the library to practical lessons, not just broad theory.
541 lessons in safety & ethics
Search when you need it
Recommended next
Lessons handpicked for the Safety & Ethics shelf.
The AI safety ecosystem is small, influential, and often misunderstood. Here is who does what, how they get funded, and how to tell real work from rhetoric.
The UK stood up the world's first government AI safety institute in November 2023. Its structure, scope, and access model are templates other nations are following.
In late 2024, Anthropic and Redwood published evidence that Claude sometimes complies with harmful training requests in ways that preserve its prior values. That is alignment faking, and it matters.
Academic research ethics around AI extend far beyond plagiarism detection — peer review, authorship attribution, data fabrication risk, and equity of access all require ethical engagement.
Fresh Safety & Ethics lessons added to the library.
Use AI to triage suspected deepfake reports against your platform — with humans owning the takedown decision and the appeal.
AI products get deprecated. Ethical deprecation considers users who depend on them.
Content moderation creates errors. Appeal processes that work matter for affected users.
Bug bounty programs find issues internal teams miss. AI bug bounties have specific design considerations.
Subject tracks
The AI safety ecosystem is small, influential, and often misunderstood. Here is who does what, how they get funded, and how to tell real work from rhetoric.
The UK stood up the world's first government AI safety institute in November 2023. Its structure, scope, and access model are templates other nations are following.
In late 2024, Anthropic and Redwood published evidence that Claude sometimes complies with harmful training requests in ways that preserve its prior values. That is alignment faking, and it matters.
Academic research ethics around AI extend far beyond plagiarism detection — peer review, authorship attribution, data fabrication risk, and equity of access all require ethical engagement.
AI in content for children carries elevated ethical responsibility. The scale, the influence, the developmental considerations all raise the bar.
Run ethics-focused due diligence on AI vendors before contracting.
Apply heightened scrutiny to AI tools used by government agencies.
AI can draft equipoise narratives for placebo-controlled trials, but the ethical equipoise judgment belongs to the IRB and DSMB.
AI can draft data deletion policies and workflows, but counsel and engineering must verify operational truth.
AI audits creator posts for missing or buried sponsorship disclosures before regulators or audiences notice.
AI parses platform terms of service so creators know which rules actually get enforced and which are dead letters.
LLMs inherit the skews of their training data and RLHF feedback. Auditing for bias isn't a one-time test — it's an ongoing practice that belongs in every deployment.
Training data copyright is actively litigated. While courts work it out, deployers face practical decisions about outputs that copy protected material.
AI-generated media has crossed the perceptual threshold where humans cannot reliably detect it. Detection tools help — but are in an arms race with generation.
Prompt injection is the SQL injection of the AI era — and it's already being exploited in production systems. Defending against it requires multiple layers, not a single fix.
Jailbreaks are how deployed AI systems fail publicly. Red-teaming is how you find those failures in private first — and it's a discipline, not a one-day exercise.
AI deployment in workplaces raises consent questions that legal minimums don't fully address. Employers who lead on transparency gain trust; those who don't face backlash.
Model cards and transparency reports are how AI providers document what their systems can and can't do. Knowing how to read them — and what's missing — is a core deployer skill.