Loading lesson…
The best way to truly understand an AI claim is to try it yourself. Here is how to run a small experiment that actually teaches you something.
You do not need a GPU cluster to do AI research. The best small experiments are tiny, specific, and fast — a clear question answered in a Jupyter notebook over an afternoon. Real understanding comes from running them.
Wei et al. (2022) showed chain-of-thought helps on hard problems. Does it help on grade-school problems too? Run 30 single-digit addition problems, once with 'think step by step' and once without, on the same model. Compare. If you see no difference, you have replicated a real published finding (CoT doesn't help easy tasks). That is real science.
# Tiny experiment skeleton import anthropic client = anthropic.Anthropic() problems = [ ("2 + 3", "5"), ("7 + 8", "15"), # 28 more ] def run(prompt_prefix): correct = 0 for q, a in problems: resp = client.messages.create( model="claude-opus-4-7", max_tokens=256, messages=[{"role":"user","content":prompt_prefix+q}] ) if a in resp.content[0].text: correct += 1 return correct / len(problems) print("Plain:", run("")) print("CoT:", run("Think step by step. "))An afternoon's worth of real AI research, in 20 linesThe most exciting phrase to hear in science is not 'Eureka!' but 'That's funny'
— Attributed to Isaac Asimov
The big idea: a one-afternoon experiment teaches you more about AI than a month of reading. Pick a question, run the test, write it up. Repeat.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-creators-run-your-own-experiment
What is the main idea of "Running Your Own Small Experiment"?
Which concept is most central to "Running Your Own Small Experiment"?
Which use of AI fits this topic best?
What should a careful learner remember about "Negative results are results"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about experiment be treated?
Name one way to verify an AI answer about experiment.
Which action would help you apply "Running Your Own Small Experiment" responsibly?