Multi-Agent Orchestration: Planner + Executor + Verifier

One smart agent is fine. Two agents checking each other's work is better. Master the canonical orchestration patterns: planner/executor, judge/worker, debate, and swarm.

48 min · Reviewed 2026

Why multi-agent

A single agent trying to do everything in one context window hits limits: context bloat, role confusion, weak self-critique. Splitting into specialized agents with narrow roles is the cheapest way to add reliability. The patterns below are well-attested in production at Anthropic, OpenAI, and research papers.

Pattern 1 — Planner / Executor / Verifier

Agent	Role	Model tier
Planner	Breaks the goal into ordered steps.	Smartest model (Opus 4.7, GPT-5).
Executor	Runs each step. Uses tools.	Mid tier (Sonnet 4.6, GPT-5-mini).
Verifier	Checks result against original goal.	Smart + strict (Opus 4.7 at low temp).

# Simplified planner/executor/verifier loop goal = "Migrate all CSV files in /data to parquet, preserving schemas." plan = planner(goal) # returns ordered steps for step in plan.steps: result = executor(step, tools=TOOLS) # has MCP + shell + file ok, notes = verifier(step, result, goal) if not ok: fix = planner(f"Step failed: {notes}. Replan from here.") plan.splice(step, fix) log(step, result, ok) final_ok, summary = verifier("final", plan.history, goal)Planner writes the plan. Executor runs it. Verifier checks. Replan on failure.

Pattern 2 — Judge / Worker (competitive)

Spawn N workers to attempt the same task with different prompts or temperatures. A judge scores their outputs and returns the best. Used in AlphaCode, Anthropic's research tooling, and most SWE-bench top submissions. More compute, better results.

Pattern 3 — Debate

Two agents argue opposite sides of a question. A third agent reads the debate and picks a winner. Effective for subjective tasks (editorial decisions, design tradeoffs) where a single pass lacks rigor. OpenAI's 'debate' research and Anthropic's CAI pipeline both use variants.

Pattern 4 — Swarm (parallel specialists)

A coordinator sends the same input to specialist agents (e.g., 'legal reviewer', 'UX reviewer', 'accessibility reviewer') and merges their feedback. Better than one generalist because each specialist can have a narrower, sharper system prompt and different MCP toolset. CrewAI and Microsoft Agent Framework lean into this pattern.

LangGraph state skeleton

from langgraph.graph import StateGraph, END from typing import TypedDict, List class State(TypedDict): goal: str plan: List[str] current_step: int results: List[dict] verdict: str graph = StateGraph(State) graph.add_node("plan", plan_fn) graph.add_node("execute", execute_fn) graph.add_node("verify", verify_fn) graph.add_node("replan", replan_fn) graph.set_entry_point("plan") graph.add_edge("plan", "execute") graph.add_conditional_edges("verify", lambda s: "execute" if s["current_step"] < len(s["plan"]) else "replan" if s["verdict"] == "fail" else END, ) graph.add_edge("execute", "verify") graph.add_edge("replan", "execute") app = graph.compile(checkpointer=MemorySaver()) # durable statePlanner/executor/verifier as an explicit state machine. Checkpointers let you pause, rewind, and resume.

Coordination pitfalls

Context duplication — N agents each get the full history → N× cost. Use summaries.
Role leakage — executor starts planning, verifier starts executing. Tighten system prompts.
Infinite replans — cap replan attempts (e.g., 3) before escalating to human.
Verifier sycophancy — a verifier trained by the same lab often over-approves. Mix providers.
Serial bottlenecks — if only the planner can proceed, you lose the parallelism you paid for.

Next lesson: how to actually build the planner/executor/verifier in LangGraph.

End-of-lesson check

8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-agentic-multi-agent-patterns-creators

What is the main idea of "Multi-Agent Orchestration: Planner + Executor + Verifier"?
1. Two agents checking each other's work is better.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Multi-Agent Orchestration: Planner + Executor + Verifier"?
1. planner-executor
2. orchestration
3. verifier
4. debate
Which use of AI fits this topic best?
1. Let the AI decide what matters without your review
2. Use the answer before checking whether it fits the situation
3. Context duplication — N agents each get the full history → N× cost. Use summaries.
4. Treat the AI output as automatically correct
What should a careful learner remember about "More agents ≠ better by default"?
1. Use AI to draft or organize ideas about orchestration, then verify before acting.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use AI for drafting and comparison, but verify before publishing or relying on it.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about orchestration be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about orchestration.
Which action would help you apply "Multi-Agent Orchestration: Planner + Executor + Verifier" responsibly?
1. Use the tool to avoid thinking through the tradeoff
2. Keep going even if the output conflicts with a trusted source
3. Treat the AI output as automatically correct
4. Role leakage — executor starts planning, verifier starts executing. Tighten system prompts.

← Back to interactive lesson

Tendril · Creators · Agentic AI

Multi-Agent Orchestration: Planner + Executor + Verifier

One smart agent is fine. Two agents checking each other's work is better. Master the canonical orchestration patterns: planner/executor, judge/worker, debate, and swarm.

48 min · Reviewed 2026

Why multi-agent

Pattern 1 — Planner / Executor / Verifier

Agent	Role	Model tier
Planner	Breaks the goal into ordered steps.	Smartest model (Opus 4.7, GPT-5).
Executor	Runs each step. Uses tools.	Mid tier (Sonnet 4.6, GPT-5-mini).
Verifier	Checks result against original goal.	Smart + strict (Opus 4.7 at low temp).

# Simplified planner/executor/verifier loop goal = "Migrate all CSV files in /data to parquet, preserving schemas." plan = planner(goal) # returns ordered steps for step in plan.steps: result = executor(step, tools=TOOLS) # has MCP + shell + file ok, notes = verifier(step, result, goal) if not ok: fix = planner(f"Step failed: {notes}. Replan from here.") plan.splice(step, fix) log(step, result, ok) final_ok, summary = verifier("final", plan.history, goal)Planner writes the plan. Executor runs it. Verifier checks. Replan on failure.

Pattern 2 — Judge / Worker (competitive)

Pattern 3 — Debate

Pattern 4 — Swarm (parallel specialists)

LangGraph state skeleton

from langgraph.graph import StateGraph, END from typing import TypedDict, List class State(TypedDict): goal: str plan: List[str] current_step: int results: List[dict] verdict: str graph = StateGraph(State) graph.add_node("plan", plan_fn) graph.add_node("execute", execute_fn) graph.add_node("verify", verify_fn) graph.add_node("replan", replan_fn) graph.set_entry_point("plan") graph.add_edge("plan", "execute") graph.add_conditional_edges("verify", lambda s: "execute" if s["current_step"] < len(s["plan"]) else "replan" if s["verdict"] == "fail" else END, ) graph.add_edge("execute", "verify") graph.add_edge("replan", "execute") app = graph.compile(checkpointer=MemorySaver()) # durable statePlanner/executor/verifier as an explicit state machine. Checkpointers let you pause, rewind, and resume.

Coordination pitfalls

Context duplication — N agents each get the full history → N× cost. Use summaries.
Role leakage — executor starts planning, verifier starts executing. Tighten system prompts.
Infinite replans — cap replan attempts (e.g., 3) before escalating to human.
Verifier sycophancy — a verifier trained by the same lab often over-approves. Mix providers.
Serial bottlenecks — if only the planner can proceed, you lose the parallelism you paid for.

Next lesson: how to actually build the planner/executor/verifier in LangGraph.

End-of-lesson check

8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-agentic-multi-agent-patterns-creators

What is the main idea of "Multi-Agent Orchestration: Planner + Executor + Verifier"?
1. Two agents checking each other's work is better.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Multi-Agent Orchestration: Planner + Executor + Verifier"?
1. planner-executor
2. orchestration
3. verifier
4. debate
Which use of AI fits this topic best?
1. Let the AI decide what matters without your review
2. Use the answer before checking whether it fits the situation
3. Context duplication — N agents each get the full history → N× cost. Use summaries.
4. Treat the AI output as automatically correct
What should a careful learner remember about "More agents ≠ better by default"?
1. Use AI to draft or organize ideas about orchestration, then verify before acting.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use AI for drafting and comparison, but verify before publishing or relying on it.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about orchestration be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about orchestration.
Which action would help you apply "Multi-Agent Orchestration: Planner + Executor + Verifier" responsibly?
1. Use the tool to avoid thinking through the tradeoff
2. Keep going even if the output conflicts with a trusted source
3. Treat the AI output as automatically correct
4. Role leakage — executor starts planning, verifier starts executing. Tighten system prompts.

← Back to interactive lesson