Lawrence Jengar
Apr 09, 2026 18:50
Anthropic’s new advisor software pairs Opus with cheaper fashions, delivering near-premium intelligence at a fraction of the associated fee for builders constructing AI brokers.
Anthropic simply dropped a software that might reshape how builders funds for AI brokers. The advisor software, introduced April 9, lets cheaper Claude fashions faucet into Opus-level intelligence solely when wanted—chopping prices by as much as 85% whereas sustaining aggressive efficiency.
The mechanic is simple: Sonnet or Haiku runs your agent end-to-end, dealing with software calls and iterations. When it hits a wall, it escalates to Opus for steerage. Opus by no means touches instruments or person output straight—it simply advises and palms management again.
The Numbers That Matter
Anthropic’s benchmarks inform an attention-grabbing story. Sonnet with an Opus advisor scored 2.7 proportion factors larger on SWE-bench Multilingual than Sonnet alone, whereas really costing 11.9% much less per process. That is higher efficiency for much less cash—not a tradeoff most builders anticipate.
Haiku customers see much more dramatic shifts. On BrowseComp, Haiku with Opus advisor hit 41.2%—greater than double its solo rating of 19.7%. Sure, it nonetheless trails Sonnet’s standalone efficiency by 29%, however here is the kicker: it prices 85% much less per process. For prime-volume operations the place you are burning by 1000’s of agent calls each day, that math will get very engaging very quick.
Why This Issues Now
The timing is not unintentional. Anthropic shipped Sonnet 4.6 in mid-February, which already matched Opus-level efficiency in lots of duties. OpenAI countered with GPT-5.4 in early March, unifying their Codex and GPT strains with million-token context. The AI agent area is getting crowded, and value effectivity is changing into the battleground.
The advisor software flips the everyday orchestration sample. As an alternative of an enormous mannequin delegating to smaller employees, an inexpensive mannequin drives every little thing and solely escalates when caught. No decomposition logic, no employee swimming pools—only a single API name with built-in handoffs.
Implementation Particulars
Builders add one software declaration to their Messages API request. The advisor_20260301 kind routes context to Opus routinely when the executor mannequin decides it wants assist. A max_uses parameter caps advisor calls per request, and tokens invoice at every mannequin’s respective price.
Since Opus usually generates simply 400-700 tokens of steerage per session whereas the executor handles full output at decrease charges, general spend stays properly under operating Opus end-to-end.
The software slots alongside current capabilities—net search, code execution, no matter you are already utilizing. No architectural overhaul required.
For groups already operating Claude brokers at scale, it is a simple optimization. For these evaluating Anthropic in opposition to OpenAI’s increasing lineup, it is one other variable within the cost-performance equation that is value modeling earlier than committing infrastructure.
Picture supply: Shutterstock

