claude-code - 💡(How to fix) Fix Opus 4.7 regression vs 4.6: ~2x token burn, mid-session stalls, no quality gain

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…

Paying Claude Code user reporting concrete regressions on Opus 4.7 (1M context) vs Opus 4.6 in daily use.

Root Cause

Paying Claude Code user reporting concrete regressions on Opus 4.7 (1M context) vs Opus 4.6 in daily use.

RAW_BUFFERClick to expand / collapse

Summary

Paying Claude Code user reporting concrete regressions on Opus 4.7 (1M context) vs Opus 4.6 in daily use.

Observed regressions

  1. No perceptible capability gain over 4.6. For daily coding/agent tasks (TypeScript, Shopify app work, CI debugging) 4.7 produces output of similar or lower quality than 4.6.
  2. ~2× token consumption for equivalent tasks. Usage cost has roughly doubled without a matching quality improvement.
  3. Mid-session stalls. 4.7 gets stuck mid-session (tool loop / no forward progress) noticeably more often than 4.6. Session restart required to recover.
  4. UI/harness regression. The recent Claude Code interface changes coincide with models behaving worse — shorter reasoning, more truncation, more confident-but-wrong answers. Whatever changed in the harness/system prompt around the UI refresh appears to have hurt model behavior.

Request

  • Investigate the 4.7 regression vs 4.6.
  • Keep 4.6 selectable as a stable fallback while 4.7 is being addressed.

Environment

  • Claude Code CLI (Windows)
  • Model: claude-opus-4-7 (1M context)
  • Account email: [email protected]

extent analysis

TL;DR

Reverting to Opus 4.6 may mitigate the regressions observed in Opus 4.7, including no perceptible capability gain, increased token consumption, mid-session stalls, and UI/harness regression.

Guidance

  • Investigate the differences in model behavior between Opus 4.6 and 4.7 to identify the root cause of the regressions.
  • Compare the system prompts and harness changes introduced in the UI refresh to determine if they are contributing to the model's worse behavior.
  • Test Opus 4.6 with the same tasks and inputs as Opus 4.7 to verify that the regressions are specific to the newer version.
  • Consider collecting more data on the mid-session stalls to understand the patterns and triggers of this issue.

Example

No code snippet is provided as the issue does not involve specific code changes.

Notes

The investigation should focus on the changes introduced in Opus 4.7 and the UI refresh, as these are the most likely causes of the regressions. The fact that Opus 4.6 is still available as a fallback suggests that it may be possible to revert to the previous version while the issues with Opus 4.7 are being addressed.

Recommendation

Apply workaround: Keep Opus 4.6 selectable as a stable fallback while Opus 4.7 is being addressed, as this will allow users to continue using a version that is known to work without the regressions.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - 💡(How to fix) Fix Opus 4.7 regression vs 4.6: ~2x token burn, mid-session stalls, no quality gain