claude-code - 💡(How to fix) Fix Feature request: fast-mode dispatch for Agent() subagents (model=opus-fast or fastMode flag) [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#54514Fetched 2026-04-30 06:43:33
View on GitHub
Comments
0
Participants
1
Timeline
2
Reactions
0
Participants
Timeline (top)
labeled ×2

/fast mode toggles the main session to Opus 4.6 with faster output (no model downgrade). There is no equivalent for subagent dispatch via the Agent tool — model accepts sonnet | opus | haiku, with no way to request the Opus-4.6/fast-output variant for a spawned subagent.

In the orchestrator/worker pattern that Claude Code increasingly encourages — a thin coordinator dispatching specialized subagents in parallel for heterogeneous workloads — most of the substantive compute happens in the subagents. Without fast-mode dispatch, the parent can be on /fast while every dispatched worker reverts to default Opus 4.7 latency. Fast-mode latency wins land where the work isn't.

Root Cause

/fast mode toggles the main session to Opus 4.6 with faster output (no model downgrade). There is no equivalent for subagent dispatch via the Agent tool — model accepts sonnet | opus | haiku, with no way to request the Opus-4.6/fast-output variant for a spawned subagent.

In the orchestrator/worker pattern that Claude Code increasingly encourages — a thin coordinator dispatching specialized subagents in parallel for heterogeneous workloads — most of the substantive compute happens in the subagents. Without fast-mode dispatch, the parent can be on /fast while every dispatched worker reverts to default Opus 4.7 latency. Fast-mode latency wins land where the work isn't.

Fix Action

Fix / Workaround

Feature request: model=opus-fast (or equivalent) for Agent() subagent dispatch

/fast mode toggles the main session to Opus 4.6 with faster output (no model downgrade). There is no equivalent for subagent dispatch via the Agent tool — model accepts sonnet | opus | haiku, with no way to request the Opus-4.6/fast-output variant for a spawned subagent.

In the orchestrator/worker pattern that Claude Code increasingly encourages — a thin coordinator dispatching specialized subagents in parallel for heterogeneous workloads — most of the substantive compute happens in the subagents. Without fast-mode dispatch, the parent can be on /fast while every dispatched worker reverts to default Opus 4.7 latency. Fast-mode latency wins land where the work isn't.

RAW_BUFFERClick to expand / collapse

Feature request: model=opus-fast (or equivalent) for Agent() subagent dispatch

Summary

/fast mode toggles the main session to Opus 4.6 with faster output (no model downgrade). There is no equivalent for subagent dispatch via the Agent tool — model accepts sonnet | opus | haiku, with no way to request the Opus-4.6/fast-output variant for a spawned subagent.

In the orchestrator/worker pattern that Claude Code increasingly encourages — a thin coordinator dispatching specialized subagents in parallel for heterogeneous workloads — most of the substantive compute happens in the subagents. Without fast-mode dispatch, the parent can be on /fast while every dispatched worker reverts to default Opus 4.7 latency. Fast-mode latency wins land where the work isn't.

Use case

Common pattern: a session-level orchestrator partitions a job into heterogeneous subtasks and fans them out to subagents differentiated by role and capability. Subtasks that need frontier-tier reasoning (multi-step analysis, structured synthesis, expert-domain judgment) are routed to Opus subagents; mechanical work (search, formatting, extraction, summarization) is routed to Sonnet. These dispatches are typically backgrounded — wall-clock latency for the longest Opus worker dominates time-to-result.

The natural API shape would let the orchestrator pick the fast variant per dispatch — exactly as /fast does for the main session — without sacrificing model class.

Proposed shape

Either:

  • model accepts opus-fast as an additional symbolic value (or opus-4-6 if exposing the model ID directly is preferred), OR
  • Agent({ fastMode: true, ... }) parameter that pairs with model: "opus" to select the fast variant.

Should compose cleanly with run_in_background: true (where these dispatches typically live) and isolation: "worktree".

Why this matters now

Subagent fan-out is the dominant scaling axis for Claude Code usage as fleets and agent teams expand. Fast-mode benefit on the parent applies to roughly 1/N of the actual reasoning compute when N specialized subagents are in flight — exactly inverse to what a power user wants. Closing this gap is a single API surface change with broad benefit across orchestration-style usage.

Workarounds considered

  • Routing reasoning through Sonnet: real quality regression on tasks that genuinely require Opus class.
  • Pre-decomposing Opus-class work into a Sonnet pass + an Opus pass: partial mitigation; the Opus pass still pays full latency.
  • Running the reasoning in main context: defeats the orchestration architecture, blocks the parent, and burns parent context window.
  • Pinning a model ID literal: model enum validation rejects model-ID strings; no documented escape hatch.

Affects

Agent tool dispatch, the run_in_background workflow (subagent latency dominates background turnaround), and any orchestration framework built on the Claude Agent SDK.

extent analysis

TL;DR

To address the issue, consider adding support for opus-fast or opus-4-6 as an additional value for the model parameter in the Agent() tool.

Guidance

  • Review the proposed API shape changes, such as adding opus-fast or opus-4-6 to the model enum, to enable fast-mode dispatch for subagents.
  • Evaluate the feasibility of introducing a fastMode parameter in the Agent() constructor to select the fast variant when model is set to opus.
  • Assess the compatibility of these changes with existing workflows, particularly those using run_in_background: true and isolation: "worktree".
  • Consider the potential impact on performance and latency in subagent dispatch and the overall orchestration architecture.

Notes

The proposed solution requires modifications to the Agent tool and its API, which may involve updates to the underlying implementation and documentation.

Recommendation

Apply a workaround by exploring the possibility of adding opus-fast or opus-4-6 as a valid value for the model parameter, as this change seems to be a straightforward solution to the problem, allowing for fast-mode dispatch without sacrificing model class.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - 💡(How to fix) Fix Feature request: fast-mode dispatch for Agent() subagents (model=opus-fast or fastMode flag) [1 participants]