claude-code - 💡(How to fix) Fix Opus 4.8 burns 5h limit in ~30min on operational work; no signal to downgrade to Sonnet

claude-code2026-06-01 00:36:43

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

A 1.5-hour session of routine operational work (run script → read log → fix bug → restart) on Claude Code with claude-opus-4-8 consumed:

26% of the 1M context window (263k tokens) in a single session
93% of the 5-hour rate limit in ~30 minutes
94% of the weekly limit in a few hours

The work itself was not Opus-grade: shell orchestration, reading 80-line log tails, editing two Python files, running diagnostic SQL. Sonnet 4.5 (or even Haiku) would have done it identically at ~⅓ the cost.

Error Message

No model-tier guidance. Running /model claude-opus-4-8 should warn: "This is a premium tier. For shell orchestration / log reading / routine edits, consider claude-sonnet-4-5 to preserve your budget." Currently you only find out after the limit is gone.
Worktree state isn't persisted between sessions. Sessions resumed in stale .claude/worktrees/claude-* directories that diverged from main by ~100 commits. The agent then "discovered" and "fixed" things that have been on main for weeks (table renames, calendar-token JWT, dead method removal). Pure token waste + hallucinated work summaries. Either (a) prune worktrees on session end or (b) on resume, warn "this worktree is N commits behind main".

Error Messages/Logs

Root Cause

A 1.5-hour session of routine operational work (run script → read log → fix bug → restart) on Claude Code with claude-opus-4-8 consumed:

26% of the 1M context window (263k tokens) in a single session
93% of the 5-hour rate limit in ~30 minutes
94% of the weekly limit in a few hours

Code Example

Above

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single bug report (please file separate reports for different bugs)
I am using the latest version of Claude Code

What's Wrong?

Summary

A 1.5-hour session of routine operational work (run script → read log → fix bug → restart) on Claude Code with claude-opus-4-8 consumed:

26% of the 1M context window (263k tokens) in a single session
93% of the 5-hour rate limit in ~30 minutes
94% of the weekly limit in a few hours

Concrete pain points from the session

System-reminder spam. Every tool call received a 71-item completed-tasks list appended via system-reminder, regardless of whether the Task API was being used. Conservative estimate: 30-40k tokens of redundant context across the session.
No model-tier guidance. Running /model claude-opus-4-8 should warn: "This is a premium tier. For shell orchestration / log reading / routine edits, consider claude-sonnet-4-5 to preserve your budget." Currently you only find out after the limit is gone.
Worktree state isn't persisted between sessions. Sessions resumed in stale .claude/worktrees/claude-* directories that diverged from main by ~100 commits. The agent then "discovered" and "fixed" things that have been on main for weeks (table renames, calendar-token JWT, dead method removal). Pure token waste + hallucinated work summaries. Either (a) prune worktrees on session end or (b) on resume, warn "this worktree is N commits behind main".
Read-on-large-files is expensive. Reading an 80-line log tail via the Read tool costs ~3k tokens. Multiplied by 20-30 progress checks during a long-running background task, that's a chunk of the context window gone to polling. A built-in tail -N / log-stream primitive would be significantly cheaper.

What would have helped

Default model = Sonnet 4.5 when invoked without --model, Opus only on explicit ask
Or: a soft per-session token meter visible to the user (e.g. "Opus session at 50k tokens — consider switching to Sonnet")
Worktree GC on session end
Cheaper "tail log" primitive

Environment

Claude Code: (your version — claude --version)
Model: claude-opus-4-8
Platform: Windows 11 / PowerShell 5.1
Session length: ~1.5h interactive

What Should Happen?

Above

Error Messages/Logs

Above

Steps to Reproduce

Above

Claude Model

Opus

Is this a regression?

Yes, this worked in a previous version

Last Working Version

No response

Claude Code Version

latest

Platform

Anthropic API

Operating System

Windows

Terminal/Shell

Other

Additional Information

No response

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Opus 4.8 burns 5h limit in ~30min on operational work; no signal to downgrade to Sonnet

Recommended Tools

GitHub issue graph ai analysis

Error Message

Error Messages/Logs

Root Cause

Code Example

Preflight Checklist

What's Wrong?

Summary

Concrete pain points from the session

What would have helped

Environment

What Should Happen?

Error Messages/Logs

Steps to Reproduce

Claude Model

Is this a regression?

Last Working Version

Claude Code Version

Platform

Operating System

Terminal/Shell

Additional Information

Still need to ship something?

TRENDING