After this, you'll be able to describe the human/supervisor/worker architecture, write a minimal supervisor prompt, and explain why the middle tier is the one that makes scale possible.
Before you start
Complete Cross the Diff-vs-Code Threshold first; this lesson builds on calibrated trust so you can assign human/supervisor/worker roles with the right confidence levels.
The idea
Here is the before and after: You need to add input validation to 12 API endpoints. You could prompt an agent 12 times. Or you could write one supervisor prompt, dispatch it once, and come back to a summary table. The difference is the three-tier architecture.
Three tiers: you (human) set the goal and approve the final output. A supervisor agent coordinates the work, enforces quality, and decides what to surface. Worker agents each handle one task with no awareness of the larger system.
The middle tier is what makes this work. Without a supervisor, you become the coordinator. You track which agent is on which branch, review every intermediate result, and manually dispatch retries. That is more work than doing it yourself. The supervisor absorbs all of that. It creates the worker tasks, dispatches them in parallel, reads each result, runs the checks, and reports back with a summary.
The supervisor prompt is the critical artifact. It defines: what workers are allowed to do, what quality criteria a worker result must meet before it is accepted, how to handle worker failure, and how to report back. Most teams underestimate this. A vague supervisor produces vague outputs at 12x the cost.
Tier separation also maps to blast radius. Workers never touch external APIs or production state without supervisor approval. The supervisor never pushes to production without your approval. The hierarchy is not bureaucracy. It is a safety boundary.
Try it (22 min)
Watch out for
Paste this into Claude:
I want to set up a three-tier architecture for a multi-step task. Here is a task that has multiple parallel subtasks: [describe a task with at least 4-6 parallel components, e.g. 'add input validation to 6 API endpoints', 'update the docs for 8 public functions', 'add error handling to 5 service files']. Help me: (1) Write a supervisor prompt that specifies the goal, the quality criteria each worker result must meet, how to handle worker failure, and how to summarize results. (2) Define what each worker receives as its task prompt. (3) Describe how the supervisor reads and evaluates each worker output. Do not run the workers yet. Just set up the architecture.
What good looks like:
Go deeper (25 min)
Paste this into Claude:
Now run your three-tier setup on the task you designed. Dispatch the supervisor. Let it coordinate the workers. When it reports back, review only the supervisor's summary, not the individual worker outputs. Check: (1) Did the supervisor's quality gate catch anything a worker got wrong? (2) Was the summary enough to decide whether to approve the batch? (3) What would you change in the supervisor prompt for the next run?
What good looks like:
When this breaks
Claude can do it for you
Say to Claude: 'Act as a supervisor agent. Your goal is: [goal]. Workers will each handle one subtask. For each worker result, check: [your criteria]. If a result fails, request a revision. When all workers pass, summarize the results in a table with task, status, and one-line output. Do not surface individual worker outputs unless I ask.'
You can now
Write a supervisor prompt that includes explicit quality criteria, failure handling, and a reporting format, then dispatch it on a 5+ subtask job and review only the summary table.
Key takeaways
The supervisor is not overhead. It is what makes the middle tier valuable. Without it, you are doing the coordination. Write the supervisor prompt carefully before dispatching a single worker.