Test-Driven AI Development

Section 1

The Tightest Feedback Loop in Software

Four tests describe the whole contract. The agent has zero room to invent requirements.

typescript

// src/pricing.test.ts
import { describe, it, expect } from 'vitest';
import { priceCart } from './pricing';

describe('priceCart', () => {
  it('returns 0 for empty cart', () => {
    expect(priceCart([])).toBe(0);
  });

  it('sums item prices', () => {
    expect(priceCart([{ price: 10 }, { price: 5 }])).toBe(15);
  });

  it('applies 10% discount when total > 100', () => {
    expect(priceCart([{ price: 120 }])).toBe(108);
  });

  it('rounds to 2 decimals', () => {
    expect(priceCart([{ price: 10.005 }])).toBe(10.01);
  });
});

// Now say to the agent:
//   "Implement src/pricing.ts so all tests in pricing.test.ts pass.
//    Only edit pricing.ts — do not modify the tests."

fast-check generates ~100 randomized carts per run. AI-written code that passes examples often fails properties — this catches it.

typescript

import fc from 'fast-check';

it('is always non-negative', () => {
  fc.assert(
    fc.property(
      fc.array(fc.record({ price: fc.float({ min: 0, max: 1000 }) })),
      (items) => priceCart(items) >= 0
    )
  );
});

Key terms in this lesson

Test-Driven AI Development

The Tightest Feedback Loop in Software

The canonical loop

A realistic Vitest example

Property-based testing: the force multiplier

Mutation testing: testing the tests

Coverage is necessary, not sufficient

When TDD with AI is the wrong tool

Curious about “Test-Driven AI Development”?

Keep going

Test-Driven AI Development

The Tightest Feedback Loop in Software

The canonical loop

A realistic Vitest example

Property-based testing: the force multiplier

Mutation testing: testing the tests

Coverage is necessary, not sufficient

When TDD with AI is the wrong tool

Curious about “Test-Driven AI Development”?

Keep going