claude-code - 💡(How to fix) Fix [MODEL] ALL YOUR MODELS: Compaction summaries should not count against user session tokens — users are charged for Claude's context limitations [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#46725Fetched 2026-04-12 13:34:41
View on GitHub
Comments
0
Participants
1
Timeline
5
Reactions
0
Participants
Timeline (top)
labeled ×5

Code Example

Nonethis is a token consumption / model behavior issue, not a file modification issue

---

Session started. Compaction summary loaded. Claude said: "The 12% was the compressed summary of our previous conversation loading — not prompts you sent."

Then for the audit: Claude spawned 2 parallel Explore agents consuming ~10% more tokens for a task that required reading ~5 files directly.
RAW_BUFFERClick to expand / collapse

Preflight Checklist

  • I have searched existing issues for similar behavior reports
  • This report does NOT contain sensitive information (API keys, passwords, etc.)

Type of Behavior Issue

Other unexpected behavior

What You Asked Claude to Do

I started a new session to continue work on my portfolio project. I had not yet typed any task prompt.

What Claude Actually Did

  1. Loaded a compaction summary from the previous session — consuming 12% of session tokens before I typed anything
  2. When I asked for a bug audit, Claude spawned 2 parallel Explore agents instead of reading files directly — consuming another ~10% of session tokens
  3. Total: 36% of session tokens consumed before I had asked for any real work

Expected Behavior

  1. Compaction summaries should not count against the user's session token quota — the user did not cause the context overflow, the model's architecture did
  2. Claude should read files directly (Grep/Read tools) for routine tasks, not spawn Explore agents
  3. Agents should only be spawned when scope is genuinely unknown AND user approves

Files Affected

None — this is a token consumption / model behavior issue, not a file modification issue

Permission Mode

Accept Edits was ON (auto-accepting changes)

Can You Reproduce This?

Yes, every time with the same prompt

Steps to Reproduce

  1. Run a long multi-session project until context compaction occurs
  2. Start a new session — observe token usage before typing any prompt
  3. Ask Claude for a "full audit" of a codebase
  4. Claude will spawn multiple Explore agents instead of using Read/Grep directly

Claude Model

Sonnet

Relevant Conversation

Session started. Compaction summary loaded. Claude said: "The 12% was the compressed summary of our previous conversation loading — not prompts you sent."

Then for the audit: Claude spawned 2 parallel Explore agents consuming ~10% more tokens for a task that required reading ~5 files directly.

Impact

Critical - Data loss or corrupted project

Claude Code Version

2.1.92 (Claude Code)

Platform

Anthropic API

Additional Context

I am a Pro subscriber. I have "Extra Usage" enabled with $57.91 already spent this month. Token waste from compaction + unnecessary agent spawning is directly causing extra charges. The model should default to minimal tool use (Read/Grep) before escalating to agents.

extent analysis

TL;DR

  • Modify the Claude model's behavior to exclude compaction summaries from session token quotas and prioritize direct file reading over spawning Explore agents for routine tasks.

Guidance

  • Review the Sonnet model's configuration to ensure it aligns with the expected behavior of not counting compaction summaries against the user's session token quota.
  • Update the model to use Read/Grep tools for routine tasks like audits, instead of spawning Explore agents, to minimize unnecessary token consumption.
  • Verify that the "Extra Usage" feature is correctly configured to prevent unexpected charges due to token waste.
  • Test the modified model with the provided steps to reproduce the issue to ensure the changes resolve the problem.

Example

  • No code snippet is provided as the issue is related to the model's behavior and configuration, rather than a specific code implementation.

Notes

  • The solution may require adjustments to the Anthropic API or the Claude Code version 2.1.92 to ensure compatibility with the modified model behavior.
  • The "Extra Usage" feature and Pro subscriber status may impact the implementation and verification of the solution.

Recommendation

  • Apply workaround: Modify the model's behavior to prioritize minimal tool use and exclude compaction summaries from session token quotas to prevent unnecessary token consumption and extra charges.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - 💡(How to fix) Fix [MODEL] ALL YOUR MODELS: Compaction summaries should not count against user session tokens — users are charged for Claude's context limitations [1 participants]