Loading lesson…
Reviewing AI-written PRs is a different sport from reviewing human ones. Learn the structured review workflow that catches AI-specific bugs, plus the questions that separate confident-looking trash from real engineering.
When AI writes most of your code, reviewing becomes the engineer's primary craft. AI-written PRs look polished — better commit messages, better names, better formatting than humans usually produce. They also fail in stereotyped ways humans rarely do. The review checklist must adapt.
| Dimension | Human PR | AI PR |
|---|---|---|
| Surface polish | Variable | High — formatted, named well, comments present |
| Test coverage | Often skipped | Usually present (but may be tautological) |
| Off-by-ones | Common in custom logic | Common in loops AI generated |
| Hallucinated APIs | Rare | Common — calls to functions that don't exist in your codebase |
| Architectural fit | Usually consistent | May fight existing conventions |
| Risk of overconfidence | Self-aware | AI rarely flags its own uncertainty |
# Pin these to your monitor: 1. "What was here before, and why?" — AI loves to delete code it doesn't understand the purpose of. 2. "What happens to this function on the most surprising input?" — Empty? Null? Unicode? Very large? Concurrent? 3. "Does this change touch any business logic that has documented reasons?" — Check ADRs, comments, related issues. 4. "Where in this PR would a malicious user attack?" — Even if the feature isn't security-critical, ask. 5. "Could I reproduce this PR's intent in 50% fewer lines?" — AI tends toward verbose; cut what doesn't earn its keep.Five questions you ask of every PR. Answers reveal whether the AI understood the task or merely shaped output to look like it did.# Run in a fresh chat with a different model than wrote the PR: "You are a senior reviewer with no context on this PR's history. Do not trust the description. Read only the diff. For each changed file, list: 1. What this change does (1 sentence) 2. The single most likely bug introduced 3. The single most likely regression risk in adjacent code Do not be polite. Be blunt. List concrete line numbers."Cross-model adversarial review. Catches what same-family review misses.| Signal | Action |
|---|---|
| Imports verified, tests cover edges, scope tight | Approve |
| Tests are tautological — match implementation 1:1 | Request: "rewrite tests against the spec, not the code" |
| Diff includes unrelated changes | Request: "split into separate PRs" |
| Hallucinated function call | Request: "verify <function> exists and document its source" |
| Performance claim with no measurement | Request: "add benchmark or remove the claim" |
| Security-critical path with no human-written test | Block until written |
An AI PR with no review is a commit by an intern who has never seen production.
— An engineering manager
The big idea: AI PRs look better than human ones and fail in different ways. The structured walk, the five questions, and cross-model adversarial review catch most of it. Reviewing well is now the engineer's most leveraged skill — what you catch in review is what doesn't ship to production.
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-coding-debug-code-review-workflows-creators
What is the main idea of "Reviewing AI Code Like a Senior Engineer"?
Which concept is most central to "Reviewing AI Code Like a Senior Engineer"?
Which use of AI fits this topic best?
What should a careful learner remember about "Use AI to review AI"?
You want to use AI after this lesson. What is the safest next step?
How should AI output about code review be treated?
Name one way to verify an AI answer about code review.
Which action would help you apply "Reviewing AI Code Like a Senior Engineer" responsibly?