claude-code - 💡(How to fix) Fix [MODEL] ALL YOUR MODELS: Compaction summaries should not count against user session tokens — users are charged for Claude's context limitations [1 participants]

claude-code2026-04-11 17:27:37

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#46725•Fetched 2026-04-12 13:34:41

View on GitHub

Comments

Participants

Timeline

Reactions

Author

RobynAwesome

Participants

RobynAwesome

Timeline (top)

labeled ×5

Code Example

None — this is a token consumption / model behavior issue, not a file modification issue

---

Session started. Compaction summary loaded. Claude said: "The 12% was the compressed summary of our previous conversation loading — not prompts you sent."

Then for the audit: Claude spawned 2 parallel Explore agents consuming ~10% more tokens for a task that required reading ~5 files directly.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues for similar behavior reports
This report does NOT contain sensitive information (API keys, passwords, etc.)

Type of Behavior Issue

Other unexpected behavior

What You Asked Claude to Do

I started a new session to continue work on my portfolio project. I had not yet typed any task prompt.

What Claude Actually Did

Loaded a compaction summary from the previous session — consuming 12% of session tokens before I typed anything
When I asked for a bug audit, Claude spawned 2 parallel Explore agents instead of reading files directly — consuming another ~10% of session tokens
Total: 36% of session tokens consumed before I had asked for any real work

Expected Behavior

Compaction summaries should not count against the user's session token quota — the user did not cause the context overflow, the model's architecture did
Claude should read files directly (Grep/Read tools) for routine tasks, not spawn Explore agents
Agents should only be spawned when scope is genuinely unknown AND user approves

Files Affected

None — this is a token consumption / model behavior issue, not a file modification issue

Permission Mode

Accept Edits was ON (auto-accepting changes)

Can You Reproduce This?

Yes, every time with the same prompt

Steps to Reproduce

Run a long multi-session project until context compaction occurs
Start a new session — observe token usage before typing any prompt
Ask Claude for a "full audit" of a codebase
Claude will spawn multiple Explore agents instead of using Read/Grep directly

Claude Model

Sonnet

Relevant Conversation

Session started. Compaction summary loaded. Claude said: "The 12% was the compressed summary of our previous conversation loading — not prompts you sent."

Then for the audit: Claude spawned 2 parallel Explore agents consuming ~10% more tokens for a task that required reading ~5 files directly.

Impact

Critical - Data loss or corrupted project

Claude Code Version

2.1.92 (Claude Code)

Platform

Anthropic API

Additional Context

I am a Pro subscriber. I have "Extra Usage" enabled with $57.91 already spent this month. Token waste from compaction + unnecessary agent spawning is directly causing extra charges. The model should default to minimal tool use (Read/Grep) before escalating to agents.

extent analysis

TL;DR

Modify the Claude model's behavior to exclude compaction summaries from session token quotas and prioritize direct file reading over spawning Explore agents for routine tasks.

Guidance

Review the Sonnet model's configuration to ensure it aligns with the expected behavior of not counting compaction summaries against the user's session token quota.
Update the model to use Read/Grep tools for routine tasks like audits, instead of spawning Explore agents, to minimize unnecessary token consumption.
Verify that the "Extra Usage" feature is correctly configured to prevent unexpected charges due to token waste.
Test the modified model with the provided steps to reproduce the issue to ensure the changes resolve the problem.

Example

No code snippet is provided as the issue is related to the model's behavior and configuration, rather than a specific code implementation.

Notes

The solution may require adjustments to the Anthropic API or the Claude Code version 2.1.92 to ensure compatibility with the modified model behavior.
The "Extra Usage" feature and Pro subscriber status may impact the implementation and verification of the solution.

Recommendation

Apply workaround: Modify the model's behavior to prioritize minimal tool use and exclude compaction summaries from session token quotas to prevent unnecessary token consumption and extra charges.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #latency issue #model loading #dependency error #configuration error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [MODEL] ALL YOUR MODELS: Compaction summaries should not count against user session tokens — users are charged for Claude's context limitations [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Code Example

Preflight Checklist

Type of Behavior Issue

What You Asked Claude to Do

What Claude Actually Did

Expected Behavior

Files Affected

Permission Mode

Can You Reproduce This?

Steps to Reproduce

Claude Model

Relevant Conversation

Impact

Claude Code Version

Platform

Additional Context

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix [MODEL] ALL YOUR MODELS: Compaction summaries should not count against user session tokens — users are charged for Claude's context limitations [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Code Example

Preflight Checklist

Type of Behavior Issue

What You Asked Claude to Do

What Claude Actually Did

Expected Behavior

Files Affected

Permission Mode

Can You Reproduce This?

Steps to Reproduce

Claude Model

Relevant Conversation

Impact

Claude Code Version

Platform

Additional Context

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING