Running Your Own Small Experiment

Section 1

Be a Scientist, Not Just a Reader

An afternoon's worth of real AI research, in 20 lines

python

# Tiny experiment skeleton
import anthropic
client = anthropic.Anthropic()

problems = [
    ("2 + 3", "5"),
    ("7 + 8", "15"),
    # ... 28 more
]

def run(prompt_prefix):
    correct = 0
    for q, a in problems:
        resp = client.messages.create(
            model="claude-opus-4-7",
            max_tokens=256,
            messages=[{"role":"user","content":prompt_prefix+q}]
        )
        if a in resp.content[0].text:
            correct += 1
    return correct / len(problems)

print("Plain:", run(""))
print("CoT:", run("Think step by step. "))

Key terms in this lesson

Running Your Own Small Experiment

Be a Scientist, Not Just a Reader

A 7-step experiment recipe

Example: does CoT actually help on easy math?

What counts as a good experiment

Curious about “Running Your Own Small Experiment”?

Keep going

Running Your Own Small Experiment

Be a Scientist, Not Just a Reader

A 7-step experiment recipe

Example: does CoT actually help on easy math?

What counts as a good experiment

Curious about “Running Your Own Small Experiment”?

Keep going