claude-code - 💡(How to fix) Fix Excessive token consumption on Opus 4.6 — thinking tokens disproportionate to output [4 comments, 2 participants]

Filament61 · 2026-04-08T16:51:18Z

[claude-code] Simple tasks 3 v8 ignore annotations + ~20 unit tests = 3 commits consumed 55% → 89% of a 5-hour session quota on Claude Code with Opus 4.6 1M co… Simple tasks (3 `v8 ignore` annotations + ~20 unit tests = 3 commits) consumed **55% → 89% of a 5-hour session quota** on Claude Code with Opus 4.6 (1M context). ## Summary Simple tasks (3 `v8 ignore` annotations + ~20 unit tests = 3 commits) consumed **55% → 89% of a 5-hour session quota** on Claude Code with Opus 4.6 (1M context). ## Evidence from transcript ``` "Churned for 3m 46s" → produced: nothing (waiting for user input) "Crunched for 1m 39s" → produced: "Commit efcf14f2. D'autres fichiers à couvrir ?" "Churned for 37s" → produced: 3 lines of text Agent Sonnet: 56 tool uses for branch coverage tests → interrupted by user ``` ## Quota impact - **Before:** 55% session quota used - **After (~20 min of work):** 89% session quota used - Context window: 58% filled, 38h58m accumulated session time - Paid overage: 44.62€ / 45€ limit (99% consumed) ## Screenshots (See attached — Settings > Usage page showing 72% then 89% shortly after) ![Usage 72%](https://github.com/user-attachments/assets/placeholder-screenshot-72pct) ![Usage 89%](https://github.com/user-attachments/assets/placeholder-screenshot-89pct) **Note:** Screenshots will be added in a follow-up comment as GitHub CLI does not support image uploads. ## Questions 1. Is it expected that "thinking tokens" (churning/crunching) consume quota even when no output is produced? 2. Does cost scale non-linearly with context size (38h of conversation)? 3. When a Sonnet sub-agent makes 56 tool calls, does each tool call also cost Opus tokens on the parent? ## Environment - Claude Code CLI (latest) - Model: Opus 4.6 (1M context) - Plan: Max with paid overage enabled - OS: macOS Darwin 24.6.0

claude-code2026-04-08 16:51:18

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#45333•Fetched 2026-04-09 08:07:47

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Filament61

Participants

Filament61

github-actions[bot]

Timeline (top)

commented ×4cross-referenced ×3labeled ×3

Simple tasks (3 v8 ignore annotations + ~20 unit tests = 3 commits) consumed 55% → 89% of a 5-hour session quota on Claude Code with Opus 4.6 (1M context).

Root Cause

Simple tasks (3 v8 ignore annotations + ~20 unit tests = 3 commits) consumed 55% → 89% of a 5-hour session quota on Claude Code with Opus 4.6 (1M context).

Code Example

"Churned for 3m 46s" → produced: nothing (waiting for user input)
"Crunched for 1m 39s" → produced: "Commit efcf14f2. D'autres fichiers à couvrir ?"
"Churned for 37s" → produced: 3 lines of text
Agent Sonnet: 56 tool uses for branch coverage tests → interrupted by user

RAW_BUFFERClick to expand / collapse

Summary

Simple tasks (3 v8 ignore annotations + ~20 unit tests = 3 commits) consumed 55% → 89% of a 5-hour session quota on Claude Code with Opus 4.6 (1M context).

Evidence from transcript

"Churned for 3m 46s" → produced: nothing (waiting for user input)
"Crunched for 1m 39s" → produced: "Commit efcf14f2. D'autres fichiers à couvrir ?"
"Churned for 37s" → produced: 3 lines of text
Agent Sonnet: 56 tool uses for branch coverage tests → interrupted by user

Quota impact

Before: 55% session quota used
After (~20 min of work): 89% session quota used
Context window: 58% filled, 38h58m accumulated session time
Paid overage: 44.62€ / 45€ limit (99% consumed)

Screenshots

(See attached — Settings > Usage page showing 72% then 89% shortly after)

Note: Screenshots will be added in a follow-up comment as GitHub CLI does not support image uploads.

Questions

Is it expected that "thinking tokens" (churning/crunching) consume quota even when no output is produced?
Does cost scale non-linearly with context size (38h of conversation)?
When a Sonnet sub-agent makes 56 tool calls, does each tool call also cost Opus tokens on the parent?

Environment

Claude Code CLI (latest)
Model: Opus 4.6 (1M context)
Plan: Max with paid overage enabled
OS: macOS Darwin 24.6.0

extent analysis

TL;DR

Review and optimize the usage of "thinking tokens" and tool calls in Claude Code to reduce quota consumption.

Guidance

Investigate if the "thinking tokens" (churning/crunching) are necessary for the tasks and if there are any optimization options available to reduce their usage.
Verify the cost scaling with context size to understand if the 38h of conversation is contributing to the high quota consumption.
Check the documentation or contact support to clarify if each tool call made by a Sonnet sub-agent also costs Opus tokens on the parent.

Example

No specific code snippet is provided as the issue is related to the usage and configuration of Claude Code and Opus 4.6.

Notes

The provided information suggests that the quota consumption is high due to the "thinking tokens" and tool calls, but more investigation is needed to understand the root cause and optimize the usage.

Recommendation

Apply workaround: Review and optimize the usage of "thinking tokens" and tool calls to reduce quota consumption, as the current usage pattern is leading to high quota consumption and paid overage.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#model compatibility #GPU setup #container setup #orchestration issue #cache issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Excessive token consumption on Opus 4.6 — thinking tokens disproportionate to output [4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Summary

Evidence from transcript

Quota impact

Screenshots

Questions

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix Excessive token consumption on Opus 4.6 — thinking tokens disproportionate to output [4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Summary

Evidence from transcript

Quota impact

Screenshots

Questions

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING