claude-code - 💡(How to fix) Fix [BUG] Opus 4.7 cache hit rate collapse after May 27 incident — Messages 1.1k→88.9k in 9 minutes, $630/session

claude-code2026-05-28 18:54:24

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Error Message

Error Messages/Logs

No error messages — the issue is silent.

Code Example

No error messages — the issue is silent.
  Token consumption explodes without any visible warnings.
  Only observable through Context usage panel or external tools like CodeBurn.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single bug report (please file separate reports for different bugs)
I am using the latest version of Claude Code

What's Wrong?

After the May 27 Opus 4.7 "elevated errors" incident (status.claude.com), cache hit rate collapsed from ~95% to ~50%, causing extreme token consumption.

After running /compact, Messages started at 1.1k tokens
9 minutes later it reached 88.9k tokens — with only one short user input ("finish the previous edit")
The project is a lightweight mobile app (small Kotlin codebase)
Single session cost: $630.42 API-equivalent (377 calls)
Full day (May 28): $1,142.35 (4,226 calls) on Max 20x ($200/month) plan

Cache hit rate comparison (CodeBurn dashboard):

Opus 4.7, 7-day average: 95.7%
Opus 4.7, May 29 only: 50.9% ← broken
Opus 4.6, same period: 89.9% ← normal

Autocompact buffer stayed fixed at 33.0k regardless of Messages growth. A fresh session on the same project shows Messages at 13 tokens (normal baseline).

Environment: Windows 11, VS Code extension, claude-opus-4-7, Max 20x plan

What Should Happen?

Messages should grow proportionally to actual tool calls and user input. Cache hit rate should remain ~95% as observed before the May 27 incident. A short instruction on a small project should not consume 88k tokens in 9 minutes.

Error Messages/Logs

No error messages — the issue is silent.
  Token consumption explodes without any visible warnings.
  Only observable through Context usage panel or external tools like CodeBurn.

Steps to Reproduce

Use Claude Code with Opus 4.7 in VS Code extension (Max 20x plan)
Open any small project
Run /compact to reset Messages to ~1k
Give a short follow-up instruction (e.g. "finish the previous edit")
Open Context usage panel repeatedly over the next 10 minutes
Observe Messages growing ~10k/minute without proportional user input

Claude Model

Opus

Is this a regression?

Yes, this worked in a previous version

Last Working Version

Worked normally until May 27, 2026

Claude Code Version

2.1.80 (Claude Code)

Platform

Anthropic API

Operating System

Windows

Terminal/Shell

VS Code integrated terminal

Additional Information

The May 27 "elevated errors on Claude Opus 4.7" incident (status.claude.com) was marked resolved, but cache behavior has not recovered.

Observed across multiple projects — not limited to one session or project. Messages reached 385k+ in other projects on the same day.

Screenshots attached: Context usage timeline (3:17→3:26), CodeBurn dashboard (today + 7-day), fresh session baseline.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [BUG] Opus 4.7 cache hit rate collapse after May 27 incident — Messages 1.1k→88.9k in 9 minutes, $630/session

Recommended Tools

GitHub issue graph ai analysis

Error Message

Error Messages/Logs

Code Example

Preflight Checklist

What's Wrong?

What Should Happen?

Error Messages/Logs

Steps to Reproduce

Claude Model

Is this a regression?

Last Working Version

Claude Code Version

Platform

Operating System

Terminal/Shell

Additional Information

Still need to ship something?

TRENDING