Alignment is not a vibes debate. It is a concrete technical problem about getting systems to pursue goals we actually want. Here is what researchers work on when they say they work on alignment.

50 minNot started

Start

Ethics & Society○

Labor and AI: What the Data Actually Says

Most predictions about AI and jobs are either panic or dismissal. Here is what the best evidence through 2025 actually shows — including what is overstated.

45 minNot started

Start

Ethics & Society○

AI Safety Orgs and How They Actually Operate

The AI safety ecosystem is small, influential, and often misunderstood. Here is who does what, how they get funded, and how to tell real work from rhetoric.

40 minNot started

Start

Agentic AI○

MCP Deep Dive: The USB-C for AI Tools

Model Context Protocol is the most important open standard in agents. One protocol, 1,200+ servers, and your agent can plug into almost any system. Here's how it actually works.

55 minNot started

Start

Agentic AI○

Capstone: Build and Ship a Real Agent

Everything comes together. Design, code, test, secure, and ship a production-quality agent with open-source code you can fork today.

75 minNot started

Start

Tools Literacy○

Building a Personal AI Stack for School and Career

Assemble the four or five AI tools that actually belong in your daily life. A tested template for the stack that earns its keep.

38 minNot started

Start

Tools Literacy○

Projects and Spaces — Persistent Context Is the Future

Claude Projects, ChatGPT Projects, Notion AI, Perplexity Spaces. How persistent context changes AI from search box to actual assistant.

40 minNot started

Start

Tools Literacy○

Perplexity Comet — the AI browser

Perplexity Comet is a full web browser that treats AI as a first-class citizen. It reads, summarizes, and acts on pages you visit.

30 minNot started

Start

AI Foundations○

AP Biology: Using AI to Survive the Vocab Tsunami

AP Bio has roughly a thousand terms and four big concepts. NotebookLM and Claude Projects can turn your textbook into a custom tutor that actually knows what you are studying.

32 minNot started

Start

AI Foundations○

Debate Prep: Researching Both Sides Fast

Debate rewards knowing the other side's best argument better than they do. AI is built for exactly this kind of fast, balanced research.

30 minNot started

Start

Careers & Pathways○

Doctor in 2026: What AI Actually Does to Your Day

Ambient scribes, diagnostic copilots, and evidence engines sit in every exam room. Here is what a physician's workday now looks like — and what still rests on your judgment.

42 minNot started

Start

Careers & Pathways○

Medical Researcher in 2026: AlphaFold Changed Biology Forever

Literature review in minutes, protein structures on demand, AI-proposed drug candidates. The discovery cycle has compressed — but the human posing the question still sets the direction.

42 minNot started

Start

Careers & Pathways○

Robotics Engineer in 2026: Foundation Models Walk Around

NVIDIA GR00T, Physical Intelligence π0, and Figure Helix took the vision-language-action paradigm from research paper to factory floor. This is the hottest hardware-software frontier.

42 minNot started

Start

Careers & Pathways○

Lawyer in 2026: Directing the Associate That Never Sleeps

Harvey and CoCounsel research case law, draft briefs, and summarize depositions. The paralegal-and-first-year tier of the profession is genuinely shrinking. The judgment tier is thriving. What AI touches Legal research — Lexis+ AI, Westlaw Precision, Paxton AI, vLex Vincent search and synthesize case law.

44 minNot started

Start

Careers & Pathways○

Paralegal in 2026: Orchestrating the AI Workflow

The role has inverted: paralegals who used to do research and doc prep now direct the AI that does it. The job is not gone — but it is changing faster than any legal role.

36 minNot started

Start

Careers & Pathways○

Financial Analyst in 2026: Parse 10-Ks in Seconds, Judge Them for Hours

AlphaSense, Hebbia, and Bloomberg GPT read every filing before you do. The edge is the question you ask and the thesis you write.

38 minNot started

Start

Careers & Pathways○

Management Consultant in 2026: Decks at the Speed of Thought

McKinsey Lilli, Gamma, and Claude generate first-draft slides and research in minutes. The real consulting work — client relationships and implementation — is more human than ever.

36 minNot started

Start

Careers & Pathways○

Product Manager in 2026: Specs, Mocks, and Prototypes by Lunch

v0, Linear AI, and Dovetail synthesize research, draft PRDs, and ship prototypes in hours. The PM role has leveled up from communicator to quasi-builder.

40 minNot started

Start

Careers & Pathways○

Marine Biologist in 2026: Computer Vision in the Reef

Species identification from underwater footage used to take a season. A model trained on 8 million fish does it in a single afternoon.

30 minNot started

Start

Careers & Pathways○

Fashion Designer in 2026: Moodboards to Samples in a Week

Generative imagery, 3D garment sim, and on-demand pattern-making have collapsed the front end. Taste is still the scarce resource.

28 minNot started

Start

Careers & Pathways○

Brand Strategist in 2026: Signals, Stories, and Synthetic Audiences

AI runs the research and drafts the decks. The strategist still has to decide what a brand means.

26 minNot started

Start

Careers & Pathways○

Park Ranger in 2026: AI at the Trailhead

Wildfire detection, wildlife cameras, and visitor demand modeling changed the job. The ranger still walks the trail at dawn.

26 minNot started

Start

Tools Literacy○

Codex CLI: OpenAI's Answer to Claude Code

Codex CLI is OpenAI's open-source terminal coding agent. Look at how it compares to Claude Code, what it does uniquely, and why it matters to non-Anthropic shops.

35 minNot started

Start

Tools Literacy○

Consensus: The AI Search Engine That Only Knows Science

Consensus searches 200M+ academic papers and gives evidence-based answers. Deep look at how researchers use it, what it does differently from Perplexity, and its limits.

35 minNot started

Start

Tools Literacy○

Elicit: The AI Research Assistant For Systematic Reviews

Elicit automates slow parts of academic research: finding papers, extracting data, building literature matrices. Look at what it saves PhDs 20 hours a week.

38 minNot started

Start

Ethics & Society○

Constitutional AI: A Deep Dive on Anthropic's Approach

What a constitution actually contains, how the training loop works, where the research is now, and the honest trade-offs.

45 minNot started

Start

Ethics & Society○

Mechanistic Interpretability: Reading the Model's Mind

Sparse autoencoders, features, circuits. How researchers try to see what a model actually thinks, and why it may be the most strategically important safety work.

55 minNot started

Start

AI Foundations○

Golden-Dataset Curation

A golden dataset is a curated set of hard, representative examples you trust completely. It is the backbone of every serious eval.

40 minNot started

Start

AI Foundations○

Emergence vs. Scaling

Some capabilities grow smoothly with scale. Others seem to appear out of nowhere. Telling them apart is a whole research program. The Big Question Is AI capability a smooth climb or a staircase?

40 minNot started

Start

AI Foundations○

Running a Literature Review With AI

AI turns weeks of literature review into days — if you know how to use it. Here is a workflow that actually works.

35 minNot started

Start

AI Foundations○

Taking Good Notes With NotebookLM

NotebookLM turns a pile of PDFs into a searchable, askable brain. Here is how to build a research notebook that keeps paying dividends.

32 minNot started

Start

AI Foundations○

Citing AI-Assisted Work Honestly

The norms for disclosing AI use in research are still being written. Here is the emerging consensus and how to stay on the right side of it.

30 minNot started

Start

AI Foundations○

Running Your Own Small Experiment

The best way to truly understand an AI claim is to try it yourself. Here is how to run a small experiment that actually teaches you something.

45 minNot started

Start

AI Foundations○

Synthetic Data: When AI Trains on AI

Real data is expensive, private, or scarce. Synthetic data is generated by models themselves. It is rapidly becoming as important as scraped data.

32 minNot started

Start

AI Foundations○

Labeling at Scale: The Hidden Human Layer

Behind every supervised model is an army of human labelers. Understanding how labeling works is understanding who really builds AI.

35 minNot started

Start

AI Foundations○

Big Data vs. Good Data: The Tradeoff

The old mantra was more data always wins. The new reality is more complicated. Sometimes a small, hand-crafted dataset beats a giant messy one.

30 minNot started

Start

AI Foundations○

Data Cards: The Label on Your Dataset

A data card is like a nutrition label for a dataset: who collected it, how, what is in it, and what it should not be used for.

28 minNot started

Start

AI Foundations○

Representation Bias: Who Is in the Data?

If your training data is 90 percent men, your model will work worse for women. Representation bias is the most pervasive issue in AI.

32 minNot started

Start

AI Foundations○

Measurement Bias: When the Ruler Is Bent

Measurement bias happens when the thing you measure is a flawed stand-in for what you actually care about. It is subtle and surprisingly common.

30 minNot started

Start

AI Foundations○

Historical Bias: The COMPAS Case Study

Even accurate data can encode an unjust history. The COMPAS recidivism tool shows what happens when AI learns from a biased past.

35 minNot started

Start

AI Foundations○

Label Noise: When Your Ground Truth Is Wrong

Every labeled dataset has mistakes. Studies have found error rates of 3 to 6 percent in famous benchmarks like ImageNet. Noisy labels confuse models and mislead evaluations.

30 minNot started

Start

AI Foundations○

Inter-Annotator Agreement: Measuring Reality

If two reasonable humans cannot agree on a label, neither can a model. Inter-annotator agreement tells you if a task is even well-defined.

28 minNot started

Start

AI Foundations○

Underrepresented Groups: Building Inclusive Datasets

Small populations get hurt first when datasets are built carelessly. Fixing this requires intentional collection, not just better algorithms.

30 minNot started

Start

AI Foundations○

Geographic Bias: The West Dominates

AI has a geography problem. Training data over-represents North America and Europe, and it shows in subtle and not-so-subtle ways.

28 minNot started

Start

AI Foundations○

Language Bias: Why English Dominates AI

English is 6 percent of the world's speakers but 50+ percent of the training data. This asymmetry shapes every model we use.

30 minNot started

Start

AI Foundations○

Audit Methodology: How to Check a Dataset

A data audit is a structured process to find bias, errors, and ethical issues before a model goes live. Every creator should know how.

35 minNot started

Start

AI Foundations○

Debiasing: What Actually Works and What Does Not

Everyone wants to debias AI. But the literature is full of methods that look good on paper and fail in the wild. Here is the honest scorecard.

35 minNot started

Start

AI Foundations○

Mean, Median, Mode: Three Kinds of Average

Saying the average is 50,000 dollars can mean three different things. Picking the wrong kind of average is how statistics starts lying to you.

30 minNot started

Start

AI Foundations○

Variance and Standard Deviation: How Spread Out?

Mean tells you the center. Variance and standard deviation tell you the spread. Without both, you are missing half the story.

30 minNot started

Start

AI Foundations○

Distributions: Normal, Power-Law, and Bimodal

Data comes in shapes. The shape determines which tools you can use, and which assumptions will silently betray you.

32 minNot started

Start

AI Foundations○

Log-Scale Thinking: When Linear Lies

Some things grow multiplicatively, not additively. Log scales reveal patterns that linear scales hide, especially for anything related to scale or growth.

28 minNot started

Start

AI Foundations○

Simpson's Paradox: When Aggregated Data Lies

A trend that appears in every subgroup can reverse when you combine the groups. This is Simpson's Paradox, and it hides in plain sight.

30 minNot started

Start

AI Foundations○

Outliers: Keep Them, Remove Them, or Investigate?

A single weird value can distort your entire analysis. But outliers are also where the most interesting stories live. Knowing when to remove them is an art.

30 minNot started

Start

AI Foundations○

Resampling: Making Data Work Harder

Resampling techniques draw new samples from your data to estimate uncertainty, balance classes, or validate models. It is one of the most underused superpowers in statistics.

30 minNot started

Start

AI Foundations○

Bootstrapping: Confidence Without a Formula

Bootstrapping estimates the uncertainty of any statistic, even when you have no clean mathematical formula. It is simple, powerful, and surprisingly deep.

32 minNot started

Start

AI Foundations○

Who Owns the Data in a Dataset?

Ownership of data is not one question but a tangle of rights: copyright, contract, privacy, and control. Untangling them is essential for responsible use.

30 minNot started

Start

AI Foundations○

Copyright vs. Terms of Service: Two Different Fights

Violating a website's Terms of Service and violating copyright are different legal problems. Understanding the distinction is critical for data work. Fair use in training The argument AI companies make is that training is transformative fair use.

28 minNot started

Start

AI Foundations○

GDPR Basics: The Regulation That Changed Data

Europe's General Data Protection Regulation (2018) reshaped how the world handles personal data. Understanding its core concepts is now essential. In 2023, Italy briefly banned ChatGPT over GDPR concerns.

32 minNot started

Start

AI Foundations○

The Data Broker Ecosystem: The Shadow Industry

Thousands of companies you have never heard of trade your personal data every second. Understanding this invisible market is understanding modern privacy. Brokers and AI training Much training data for specialized models (ad targeting, credit scoring, risk assessment) comes from brokers.

30 minNot started

Start

AI Foundations○

Opt-Out Mechanisms: The Real State of Consent

Many AI companies now offer opt-outs from training. But how well do they actually work, and what are the catches?

28 minNot started

Start

AI Foundations○

robots.txt and ai.txt: The Web's Consent Signals

A 30-year-old simple text file, robots.txt, is how the web has tried to regulate crawlers. The new ai.txt proposal aims to refine this for the AI era.

25 minNot started

Start

AI Foundations○

Licensing Your Own Datasets

If you build a dataset, how you license it determines who can use it and how. Picking the right license matters as much as the data itself.

28 minNot started

Start

AI Foundations○

Anonymization and Why It Often Fails

Removing names does not make data anonymous. Combinations of a few seemingly innocent fields can re-identify nearly anyone.

32 minNot started

Start

AI Foundations○

Your First Dataset Project, End to End

A complete walkthrough from question to shareable dataset. The first project is the hardest; this lesson gets you to the other side.

45 minNot started

Start

AI Foundations○

Jupyter Notebook Basics

Jupyter is the data scientist's notebook. Code, output, and narrative in one document. Learning Jupyter well pays dividends for every future project.

30 minNot started

Start

AI Foundations○

Pandas Fundamentals in 40 Minutes

Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.

45 minNot started

Start

AI Foundations○

Reading and Writing CSV and JSON in Python

These two formats are the bread and butter of data interchange. Handling them well means handling edge cases well.

30 minNot started

Start

AI Foundations○

Creating Your First Small Labeled Dataset

Creating a dataset from scratch teaches you more than using someone else's. Here is how to build a high-quality small labeled dataset for a real task.

45 minNot started

Start

AI Foundations○

Sharing Datasets on Hugging Face Hub

Hugging Face Hub is the GitHub of AI data and models. Uploading a dataset there makes it instantly accessible to millions of practitioners.

40 minNot started

Start

AI Foundations○

ResNets and the Depth Breakthrough

A 2015 paper from Microsoft Research let neural networks go 150 layers deep by adding a shortcut.

28 minNot started

Start

AI Foundations○

Civics and Government: AI for Understanding the News

A lot of civics class is pretending you read the news. AI makes it possible to actually understand a bill, a court case, or a political ad in under ten minutes.

31 minNot started

Start

Creators · Ages 14–17

What can I build with it?

The full LLM pipeline, agentic AI with OpenClaw + Ollama, subscription-tier literacy, and a real capstone.

Meet your guide: Atlas — a minimal octahedron

← All lessons

Your progress

Loading your progress…

Where should I start?

Pick the door that sounds like you.

Chapters

Pick a chapter to settle into.

Chapter 1

Tools Literacy

Which model when? Claude, GPT, Gemini, Grok — and how to choose.

354 lessons

Browse track →

Chapter 2

AI Foundations

The core ideas — what AI is, how it learns, what it can and can't do.

273 lessons

Browse track →

Chapter 3

Model Families

Every family in the industry. Variants, strengths, limits, pricing.

272 lessons

Browse track →

Chapter 4

AI-Assisted Coding

Claude Code, Codex, Cursor, Windsurf. Real code with real agents.

234 lessons

Browse track →

Chapter 5

Ethics & Society

Bias, safety, labor, copyright — the questions that decide how AI lands.

196 lessons

Browse track →

Chapter 6

Agentic AI

Agents that do things — MCP, tool use, multi-model orchestration.

181 lessons

Browse track →

Chapter 7

Creative AI

Image, video, audio, music — the generative creative stack.

180 lessons

Browse track →

Chapter 8

Research & Analysis

Literature reviews, source checking, synthesis, and evidence-aware workflows.

163 lessons

Browse track →

Chapter 9

Careers & Pathways

80+ jobs mapped to the AI tools that transform them.

110 lessons

Browse track →

Chapter 10

AI for Business

Entrepreneurship, productivity, automation. For creator-tier career prep.

39 lessons

Browse track →

Chapter 11

Prompting

From first prompts to advanced patterns. The most practical skill in AI.

35 lessons

Browse track →

Chapter 12

AI for Educators

Lesson planning, feedback, differentiation, and classroom-safe AI practice.

30 lessons

Browse track →

Chapter 13

AI for Parents

Helping families talk about AI, schoolwork, safety, creativity, and trust.

20 lessons

Browse track →

Chapter 14

Safety & Governance

Practical safety systems, evaluation, provenance, policy, and human oversight.

5 lessons

Browse track →

Chapter 15

AI in Healthcare

Clinical documentation, patient education, operations, and safety boundaries.

4 lessons

Browse track →

Chapter 16

Operations & Automation

SOPs, triage, workflows, and the practical mechanics of AI-enabled teams.

3 lessons

Browse track →

Chapter 17

AI for Finance

Reports, models, controls, analysis, and the judgment calls finance teams face.

3 lessons

Browse track →

Chapter 18

AI for Legal Work

Contract review, research, privilege, confidentiality, and legal workflow support.

2 lessons

Browse track →

Skill lanes

All Creator Career Preview Coder Designer Researcher

Focus

All focus areas Coding Creative Research Career Preview

Modules · 72

Work through in order, or pick what looks good.

AI Foundations○

Open vs. Closed Models: Philosophy and Strategy

Open-source AI is both a technical movement and a political one. Understand the arguments so you can pick a stack and defend it.

45 minNot started

Start

Ethics & Society○

AI Alignment: The Actual Technical Problem

Alignment is not a vibes debate. It is a concrete technical problem about getting systems to pursue goals we actually want. Here is what researchers work on when they say they work on alignment.

50 minNot started

Start

Ethics & Society○

Labor and AI: What the Data Actually Says

Most predictions about AI and jobs are either panic or dismissal. Here is what the best evidence through 2025 actually shows — including what is overstated.

45 minNot started

Start

Ethics & Society○

AI Safety Orgs and How They Actually Operate

The AI safety ecosystem is small, influential, and often misunderstood. Here is who does what, how they get funded, and how to tell real work from rhetoric.

40 minNot started

Start

Agentic AI○

MCP Deep Dive: The USB-C for AI Tools

Model Context Protocol is the most important open standard in agents. One protocol, 1,200+ servers, and your agent can plug into almost any system. Here's how it actually works.

55 minNot started

Start

Agentic AI○

Capstone: Build and Ship a Real Agent

Everything comes together. Design, code, test, secure, and ship a production-quality agent with open-source code you can fork today.

75 minNot started

Start

Tools Literacy○

Building a Personal AI Stack for School and Career

Assemble the four or five AI tools that actually belong in your daily life. A tested template for the stack that earns its keep.

38 minNot started

Start

Tools Literacy○

Projects and Spaces — Persistent Context Is the Future

Claude Projects, ChatGPT Projects, Notion AI, Perplexity Spaces. How persistent context changes AI from search box to actual assistant.

40 minNot started

Start

Tools Literacy○

Perplexity Comet — the AI browser

Perplexity Comet is a full web browser that treats AI as a first-class citizen. It reads, summarizes, and acts on pages you visit.

30 minNot started

Start

AI Foundations○

AP Biology: Using AI to Survive the Vocab Tsunami

AP Bio has roughly a thousand terms and four big concepts. NotebookLM and Claude Projects can turn your textbook into a custom tutor that actually knows what you are studying.

32 minNot started

Start

AI Foundations○

Debate Prep: Researching Both Sides Fast

Debate rewards knowing the other side's best argument better than they do. AI is built for exactly this kind of fast, balanced research.

30 minNot started

Start

Careers & Pathways○

Doctor in 2026: What AI Actually Does to Your Day

Ambient scribes, diagnostic copilots, and evidence engines sit in every exam room. Here is what a physician's workday now looks like — and what still rests on your judgment.

42 minNot started

Start

Careers & Pathways○

Medical Researcher in 2026: AlphaFold Changed Biology Forever

Literature review in minutes, protein structures on demand, AI-proposed drug candidates. The discovery cycle has compressed — but the human posing the question still sets the direction.

42 minNot started

Start

Careers & Pathways○

Robotics Engineer in 2026: Foundation Models Walk Around

NVIDIA GR00T, Physical Intelligence π0, and Figure Helix took the vision-language-action paradigm from research paper to factory floor. This is the hottest hardware-software frontier.

42 minNot started

Start

Careers & Pathways○

Lawyer in 2026: Directing the Associate That Never Sleeps

44 minNot started

Start

Careers & Pathways○

Paralegal in 2026: Orchestrating the AI Workflow

The role has inverted: paralegals who used to do research and doc prep now direct the AI that does it. The job is not gone — but it is changing faster than any legal role.

36 minNot started

Start

Careers & Pathways○

Financial Analyst in 2026: Parse 10-Ks in Seconds, Judge Them for Hours

AlphaSense, Hebbia, and Bloomberg GPT read every filing before you do. The edge is the question you ask and the thesis you write.

38 minNot started

Start

Careers & Pathways○

Management Consultant in 2026: Decks at the Speed of Thought

McKinsey Lilli, Gamma, and Claude generate first-draft slides and research in minutes. The real consulting work — client relationships and implementation — is more human than ever.

36 minNot started

Start

Careers & Pathways○

Product Manager in 2026: Specs, Mocks, and Prototypes by Lunch

v0, Linear AI, and Dovetail synthesize research, draft PRDs, and ship prototypes in hours. The PM role has leveled up from communicator to quasi-builder.

40 minNot started

Start

Careers & Pathways○

Marine Biologist in 2026: Computer Vision in the Reef

Species identification from underwater footage used to take a season. A model trained on 8 million fish does it in a single afternoon.

30 minNot started

Start

Careers & Pathways○

Fashion Designer in 2026: Moodboards to Samples in a Week

Generative imagery, 3D garment sim, and on-demand pattern-making have collapsed the front end. Taste is still the scarce resource.

28 minNot started

Start

Careers & Pathways○

Brand Strategist in 2026: Signals, Stories, and Synthetic Audiences

AI runs the research and drafts the decks. The strategist still has to decide what a brand means.

26 minNot started

Start

Careers & Pathways○

Park Ranger in 2026: AI at the Trailhead

Wildfire detection, wildlife cameras, and visitor demand modeling changed the job. The ranger still walks the trail at dawn.

26 minNot started

Start

Tools Literacy○

Codex CLI: OpenAI's Answer to Claude Code

Codex CLI is OpenAI's open-source terminal coding agent. Look at how it compares to Claude Code, what it does uniquely, and why it matters to non-Anthropic shops.

35 minNot started

Start

Tools Literacy○

Consensus: The AI Search Engine That Only Knows Science

Consensus searches 200M+ academic papers and gives evidence-based answers. Deep look at how researchers use it, what it does differently from Perplexity, and its limits.

35 minNot started

Start

Tools Literacy○

Elicit: The AI Research Assistant For Systematic Reviews

Elicit automates slow parts of academic research: finding papers, extracting data, building literature matrices. Look at what it saves PhDs 20 hours a week.

38 minNot started

Start

Ethics & Society○

Constitutional AI: A Deep Dive on Anthropic's Approach

What a constitution actually contains, how the training loop works, where the research is now, and the honest trade-offs.

45 minNot started

Start

Ethics & Society○

Mechanistic Interpretability: Reading the Model's Mind

Sparse autoencoders, features, circuits. How researchers try to see what a model actually thinks, and why it may be the most strategically important safety work.

55 minNot started

Start

AI Foundations○

Golden-Dataset Curation

A golden dataset is a curated set of hard, representative examples you trust completely. It is the backbone of every serious eval.

40 minNot started

Start

AI Foundations○

Emergence vs. Scaling

Some capabilities grow smoothly with scale. Others seem to appear out of nowhere. Telling them apart is a whole research program. The Big Question Is AI capability a smooth climb or a staircase?

40 minNot started

Start

AI Foundations○

Running a Literature Review With AI

AI turns weeks of literature review into days — if you know how to use it. Here is a workflow that actually works.

35 minNot started

Start

AI Foundations○

Taking Good Notes With NotebookLM

NotebookLM turns a pile of PDFs into a searchable, askable brain. Here is how to build a research notebook that keeps paying dividends.

32 minNot started

Start

AI Foundations○

Citing AI-Assisted Work Honestly

The norms for disclosing AI use in research are still being written. Here is the emerging consensus and how to stay on the right side of it.

30 minNot started

Start

AI Foundations○

Running Your Own Small Experiment

The best way to truly understand an AI claim is to try it yourself. Here is how to run a small experiment that actually teaches you something.

45 minNot started

Start

AI Foundations○

Synthetic Data: When AI Trains on AI

Real data is expensive, private, or scarce. Synthetic data is generated by models themselves. It is rapidly becoming as important as scraped data.

32 minNot started

Start

AI Foundations○

Labeling at Scale: The Hidden Human Layer

Behind every supervised model is an army of human labelers. Understanding how labeling works is understanding who really builds AI.

35 minNot started

Start

AI Foundations○

Big Data vs. Good Data: The Tradeoff

The old mantra was more data always wins. The new reality is more complicated. Sometimes a small, hand-crafted dataset beats a giant messy one.

30 minNot started

Start

AI Foundations○

Data Cards: The Label on Your Dataset

A data card is like a nutrition label for a dataset: who collected it, how, what is in it, and what it should not be used for.

28 minNot started

Start

AI Foundations○

Representation Bias: Who Is in the Data?

If your training data is 90 percent men, your model will work worse for women. Representation bias is the most pervasive issue in AI.

32 minNot started

Start

AI Foundations○

Measurement Bias: When the Ruler Is Bent

Measurement bias happens when the thing you measure is a flawed stand-in for what you actually care about. It is subtle and surprisingly common.

30 minNot started

Start

AI Foundations○

Historical Bias: The COMPAS Case Study

Even accurate data can encode an unjust history. The COMPAS recidivism tool shows what happens when AI learns from a biased past.

35 minNot started

Start

AI Foundations○

Label Noise: When Your Ground Truth Is Wrong

Every labeled dataset has mistakes. Studies have found error rates of 3 to 6 percent in famous benchmarks like ImageNet. Noisy labels confuse models and mislead evaluations.

30 minNot started

Start

AI Foundations○

Inter-Annotator Agreement: Measuring Reality

If two reasonable humans cannot agree on a label, neither can a model. Inter-annotator agreement tells you if a task is even well-defined.

28 minNot started

Start

AI Foundations○

Underrepresented Groups: Building Inclusive Datasets

Small populations get hurt first when datasets are built carelessly. Fixing this requires intentional collection, not just better algorithms.

30 minNot started

Start

AI Foundations○

Geographic Bias: The West Dominates

AI has a geography problem. Training data over-represents North America and Europe, and it shows in subtle and not-so-subtle ways.

28 minNot started

Start

AI Foundations○

Language Bias: Why English Dominates AI

English is 6 percent of the world's speakers but 50+ percent of the training data. This asymmetry shapes every model we use.

30 minNot started

Start

AI Foundations○

Audit Methodology: How to Check a Dataset

A data audit is a structured process to find bias, errors, and ethical issues before a model goes live. Every creator should know how.

35 minNot started

Start

AI Foundations○

Debiasing: What Actually Works and What Does Not

Everyone wants to debias AI. But the literature is full of methods that look good on paper and fail in the wild. Here is the honest scorecard.

35 minNot started

Start

AI Foundations○

Mean, Median, Mode: Three Kinds of Average

Saying the average is 50,000 dollars can mean three different things. Picking the wrong kind of average is how statistics starts lying to you.

30 minNot started

Start

AI Foundations○

Variance and Standard Deviation: How Spread Out?

Mean tells you the center. Variance and standard deviation tell you the spread. Without both, you are missing half the story.

30 minNot started

Start

AI Foundations○

Distributions: Normal, Power-Law, and Bimodal

Data comes in shapes. The shape determines which tools you can use, and which assumptions will silently betray you.

32 minNot started

Start

AI Foundations○

Log-Scale Thinking: When Linear Lies

Some things grow multiplicatively, not additively. Log scales reveal patterns that linear scales hide, especially for anything related to scale or growth.

28 minNot started

Start

AI Foundations○

Simpson's Paradox: When Aggregated Data Lies

A trend that appears in every subgroup can reverse when you combine the groups. This is Simpson's Paradox, and it hides in plain sight.

30 minNot started

Start

AI Foundations○

Outliers: Keep Them, Remove Them, or Investigate?

A single weird value can distort your entire analysis. But outliers are also where the most interesting stories live. Knowing when to remove them is an art.

30 minNot started

Start

AI Foundations○

Resampling: Making Data Work Harder

Resampling techniques draw new samples from your data to estimate uncertainty, balance classes, or validate models. It is one of the most underused superpowers in statistics.

30 minNot started

Start

AI Foundations○

Bootstrapping: Confidence Without a Formula

Bootstrapping estimates the uncertainty of any statistic, even when you have no clean mathematical formula. It is simple, powerful, and surprisingly deep.

32 minNot started

Start

AI Foundations○

Who Owns the Data in a Dataset?

Ownership of data is not one question but a tangle of rights: copyright, contract, privacy, and control. Untangling them is essential for responsible use.

30 minNot started

Start

AI Foundations○

Copyright vs. Terms of Service: Two Different Fights

28 minNot started

Start

AI Foundations○

GDPR Basics: The Regulation That Changed Data

32 minNot started

Start

AI Foundations○

The Data Broker Ecosystem: The Shadow Industry

30 minNot started

Start

AI Foundations○

Opt-Out Mechanisms: The Real State of Consent

Many AI companies now offer opt-outs from training. But how well do they actually work, and what are the catches?

28 minNot started

Start

AI Foundations○

robots.txt and ai.txt: The Web's Consent Signals

A 30-year-old simple text file, robots.txt, is how the web has tried to regulate crawlers. The new ai.txt proposal aims to refine this for the AI era.

25 minNot started

Start

AI Foundations○

Licensing Your Own Datasets

If you build a dataset, how you license it determines who can use it and how. Picking the right license matters as much as the data itself.

28 minNot started

Start

AI Foundations○

Anonymization and Why It Often Fails

Removing names does not make data anonymous. Combinations of a few seemingly innocent fields can re-identify nearly anyone.

32 minNot started

Start

AI Foundations○

Your First Dataset Project, End to End

A complete walkthrough from question to shareable dataset. The first project is the hardest; this lesson gets you to the other side.

45 minNot started

Start

AI Foundations○

Jupyter Notebook Basics

Jupyter is the data scientist's notebook. Code, output, and narrative in one document. Learning Jupyter well pays dividends for every future project.

30 minNot started

Start

AI Foundations○

Pandas Fundamentals in 40 Minutes

Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.

45 minNot started

Start

AI Foundations○

Reading and Writing CSV and JSON in Python

These two formats are the bread and butter of data interchange. Handling them well means handling edge cases well.

30 minNot started

Start

AI Foundations○

Creating Your First Small Labeled Dataset

Creating a dataset from scratch teaches you more than using someone else's. Here is how to build a high-quality small labeled dataset for a real task.

45 minNot started

Start

AI Foundations○

Sharing Datasets on Hugging Face Hub

Hugging Face Hub is the GitHub of AI data and models. Uploading a dataset there makes it instantly accessible to millions of practitioners.

40 minNot started

Start

AI Foundations○

ResNets and the Depth Breakthrough

A 2015 paper from Microsoft Research let neural networks go 150 layers deep by adding a shortcut.

28 minNot started

Start

AI Foundations○

Civics and Government: AI for Understanding the News

A lot of civics class is pretending you read the news. AI makes it possible to actually understand a bill, a court case, or a political ad in under ten minutes.

31 minNot started

Start