Performance Bugs in AI-Generated Code

Section 1

Works on My Machine. Crawls in Production.

Compare the options

Pattern	Symptom	Fix
N+1 queries	Loop calls DB once per item	Single query with `IN`, JOIN, or batched fetch
Quadratic loops on lists	`for x in a: if x in b:` with b as list	Convert b to a set first
Synchronous in async	`requests.get(...)` inside async function	`httpx.AsyncClient`, `await`
Loading whole file/table to filter	`df = pd.read_csv(...).query(...)`	Filter at source (SQL WHERE, csv chunks)
No pagination	Endpoint returns all 50k records	Cursor or offset pagination
Allocating in a hot loop	`new Date()` per iteration	Hoist out of the loop

The N+1 is the most common AI-introduced perf bug. Every ORM has the same fix; AI rarely reaches for it unprompted.

python

# AI gives you this — looks fine, ships green:
def get_user_emails():
    users = User.objects.all()  # 1 query
    return [
        {"id": u.id, "email": u.email, "team": u.team.name}
        # u.team.name triggers a query per user. 10k users = 10,001 queries.
        for u in users
    ]

# The fix: prefetch / select_related
def get_user_emails():
    users = User.objects.select_related("team").all()  # 1 join query
    return [
        {"id": u.id, "email": u.email, "team": u.team.name}
        for u in users
    ]

# Same code, 10000x faster on large data.

Naming the input scale changes the model's defaults completely. "100k rows" produces different code than "a list".

text

# Prepend to any prompt where data size matters:

"This function will run on 100k+ rows in production.
Constraints:
  - Must complete in under 200ms.
  - O(N log N) or better.
  - No N+1 queries — use joins/IN clauses.
  - Stream the result if it doesn't fit in memory.
  - Add a comment with the expected complexity."

AI is a junior performance engineer when handed real profile data. Without it, AI is a guesser.

text

# 1. Run a profiler on the slow function (cProfile, py-spy, clinic.js, etc.)
# 2. Paste the profiler output into chat:

"Here is py-spy output for a function that takes 8s on 100k rows.
The top 3 hot spots are <paste>. Suggest the smallest possible change
to each that would speed it up. Show before/after for each."

# AI is excellent at reading flame graphs and profiler output.
# This is one of its highest-value uses for performance.

Benchmarking is a habit. Add it to every nontrivial function, just like tests.

text

# After AI writes the function, immediately:

"Write a microbenchmark that runs this function on:
  - 100 items (warm-up)
  - 10k items
  - 1M items
  Report time per call and memory peak. Use timeit + tracemalloc."

# 60 seconds of work, surfaces 80% of perf bugs before they ship.

Key terms in this lesson

Performance Bugs in AI-Generated Code

Works on My Machine. Crawls in Production.

The top six performance bugs AI generates

The N+1 trap, in detail

Performance prompts that work

Profile-then-fix, with AI

Memory bugs are quieter and meaner

Use the AI to generate benchmarks, not just code

When perf is the requirement, write the test first

Curious about “Performance Bugs in AI-Generated Code”?

Keep going

Performance Bugs in AI-Generated Code

Works on My Machine. Crawls in Production.

The top six performance bugs AI generates

The N+1 trap, in detail

Performance prompts that work

Profile-then-fix, with AI

Memory bugs are quieter and meaner

Use the AI to generate benchmarks, not just code

When perf is the requirement, write the test first

Curious about “Performance Bugs in AI-Generated Code”?

Keep going