Forcing Opus to use subagents instead of reading files itself: no quality loss, ~4× token reduction

stonianua · April 24, 2026, 4:58pm

I tested the opposite. On a 2000-file codebase, I wrote system prompts that hard-prohibit Opus from reading files during diagnosis and planning sessions. It must dispatch subagents (cheaper fast models) to explore, map, and draft — then synthesize their short structured returns.

Every Opus row is under 750K. The large rows are the cheap fast model doing exploration.

Quality did not regress. Plans are more executable, not less. Diagnosis is sharper, not vaguer. The reason: when forced to delegate exploration, Opus spends its tokens on what it’s actually better at — framing the problem, resolving ambiguity, and making architectural decisions. When left unconstrained, it burns most of its context on grep-and-read tours any cheap model could do.

The token math: A subagent that maps 20 files and returns 300 words costs ~10K tokens on a fast model. Opus reading those same 20 files inline costs ~500K-1M tokens — and that content then rides in context for every subsequent turn, compounding. That compounding was the real leak.

The specific failure mode this prevents: Opus reasons “I need to read the spec doc to understand scope” → opens a 1000-line file → triggers 5 more reads → 7.5M tokens before any actual work starts. That row is in my logs at 2:52 PM.

The rule that matters: Dispatch is the reversible decision. Inline reading is not. When in doubt, dispatch — the downside is 10K tokens of overhead; the upside is avoiding a 5M-token session.

Topic		Replies	Views
Cursor subagents are… kinda insane Built for Cursor rules , subagents	6	1694	June 12, 2026
My Subagent List v2 Guides rules , subagents	2	1602	February 15, 2026
Fixed Gemini 3.1 pro in Cursor Guides rules , anthropic , gemini	0	148	April 29, 2026
Ralph Cursor Guide Guides terminal , plan-mode , context , cli	1	6642	July 8, 2026
Opus 4.7 Not Utilizing SubAgent Optimally? Discussions plan-mode , context , anthropic , subagents	3	146	May 24, 2026

Forcing Opus to use subagents instead of reading files itself: no quality loss, ~4× token reduction

Related topics