claude-code - 💡(How to fix) Fix [BUG] Cowork token burn rate forces subscription upgrades; root causes are file-read failures + connector instability + scheduler stalls [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#55637Fetched 2026-05-03 04:48:17
View on GitHub
Comments
0
Participants
1
Timeline
4
Reactions
0
Participants
Timeline (top)
labeled ×4

Error Message

Error Messages/Logs

No specific error messages — this is an aggregate / financial impact issue.

Root Cause

A single power user has had to upgrade their Claude subscription tier specifically because of token burn from rework caused by a constellation of bugs in Cowork. Filing as a separate issue to give engineering a cost-impact lens for prioritizing fixes — the user-financial impact is real and persistent.

Fix Action

Fix / Workaround

(a) Failed file reads, MCP calls, and missed scheduler firings should not silently consume tokens via retry loops (b) Cowork should have cost-aware retry logic with caps on retries per failure (c) Sub-task dispatch should reuse cached memory file reads where possible to avoid re-paying for the same context on every turn (d) Users should have visibility into where their token spend is going (chat / sub-tasks / file ops / MCP retries) so they can identify cost spikes

Observable signal:

  • User has had to upgrade Claude subscription tier at least once because of token consumption rate

  • User estimates rework from Issues 1+2+3 accounts for a substantial fraction of consumption

  • Per-user telemetry (token spend by category: chat / sub-task dispatch / file ops / MCP retries) would be the right diagnostic surface here, but is not exposed to end users.

  • Per-user telemetry on token spend by category: chat / sub-task dispatch / file ops / MCP retries — does data already exist?

  • Are there guard rails against repeated retry loops that exhaust quota? Cap on retries per file or per turn?

  • Cost-aware retry logic for file reads and MCP calls (exponential backoff with awareness of the cost)

  • For long-running orchestrator-style sessions, are there caching opportunities to avoid re-paying for the same memory file reads on every turn?

  • Should Cowork surface a "this is going to cost ~$X" warning before dispatching expensive operations?

Code Example

No specific error messages — this is an aggregate / financial impact issue.

Observable signal:
- User has had to upgrade Claude subscription tier at least once because of token consumption rate
- User estimates rework from Issues 1+2+3 accounts for a substantial fraction of consumption
- Per-user telemetry (token spend by category: chat / sub-task dispatch / file ops / MCP retries) would be the right diagnostic surface here, but is not exposed to end users.

Engineering can validate by examining this user's token consumption pattern: ratio of failed-tool-call retries to successful operations.
RAW_BUFFERClick to expand / collapse

Preflight Checklist

  • I have searched existing issues and this hasn't been reported yet
  • This is a single bug report (please file separate reports for different bugs)
  • I am using the latest version of Claude Code

What's Wrong?

A single power user has had to upgrade their Claude subscription tier specifically because of token burn from rework caused by a constellation of bugs in Cowork. Filing as a separate issue to give engineering a cost-impact lens for prioritizing fixes — the user-financial impact is real and persistent.

  • Usage profile: heavy power user with ~60+ scheduled tasks, multiple concurrent Cowork sessions, multiple concurrent Code sessions, deep file-system integration via Drive Desktop

What Should Happen?

Heavy users should be able to maintain workflows on a single subscription tier without unexpected token burn from rework caused by underlying bugs. Specifically:

(a) Failed file reads, MCP calls, and missed scheduler firings should not silently consume tokens via retry loops (b) Cowork should have cost-aware retry logic with caps on retries per failure (c) Sub-task dispatch should reuse cached memory file reads where possible to avoid re-paying for the same context on every turn (d) Users should have visibility into where their token spend is going (chat / sub-tasks / file ops / MCP retries) so they can identify cost spikes

Power users should be the easiest to retain, not the hardest. Subscription upgrades should reflect actual increased value, not absorbed cost from infrastructure bugs.

Error Messages/Logs

No specific error messages — this is an aggregate / financial impact issue.

Observable signal:
- User has had to upgrade Claude subscription tier at least once because of token consumption rate
- User estimates rework from Issues 1+2+3 accounts for a substantial fraction of consumption
- Per-user telemetry (token spend by category: chat / sub-task dispatch / file ops / MCP retries) would be the right diagnostic surface here, but is not exposed to end users.

Engineering can validate by examining this user's token consumption pattern: ratio of failed-tool-call retries to successful operations.

Steps to Reproduce

  1. Use Cowork normally over weeks of heavy use (60+ scheduled tasks, multiple concurrent sessions, deep file integration via Drive Desktop).
  2. Encounter Issue #3 (file-read failures): each failed read triggers a retry, often a fresh sub-task with fresh context window — each costs tokens.
  3. Encounter Issue #2 (MCP disconnects): failed email checks cause manual re-runs.
  4. Encounter Issue #1 (scheduler stalls): missed reminders become manual re-runs.
  5. Token consumption accumulates beyond expected rate for the work being done.
  6. User must upgrade subscription tier to maintain workflow.

Direct quote: "I had to up my subscription because it kept burning through tokens so fast. Doing work over and over again. This has been happening since I started using it."

Claude Model

Not sure / Multiple models

Is this a regression?

I don't know

Last Working Version

No response

Claude Code Version

Claude 1.5354.0 (9a9e3d) 2026-04-29T01:14:34.000Z

Platform

Anthropic API

Operating System

Windows

Terminal/Shell

Windows Terminal

Additional Information

Impact

  • User-financial: subscription upgrade
  • Trust: user begins to manually attach files to chat to bypass Drive integration entirely, defeating much of the Cowork value prop
  • Workflow disruption: each retry interrupts active work

Suggested investigation

  • Per-user telemetry on token spend by category: chat / sub-task dispatch / file ops / MCP retries — does data already exist?
  • Are there guard rails against repeated retry loops that exhaust quota? Cap on retries per file or per turn?
  • Cost-aware retry logic for file reads and MCP calls (exponential backoff with awareness of the cost)
  • For long-running orchestrator-style sessions, are there caching opportunities to avoid re-paying for the same memory file reads on every turn?
  • Should Cowork surface a "this is going to cost ~$X" warning before dispatching expensive operations?

Note on root-cause priority

Fixing Issues 1 + 3 likely cuts this user's token burn substantially. Issue 2 also contributes but at smaller magnitude.


### Labels
`bug`, `cowork`, `cost-impact`, `meta`

---

## Filing order

Recommend filing in this order:

1. Issue 3 (highest severity, most token impact)
2. Issue 1 (most reproducible)
3. Issue 2 (important context for #1 and #4)
4. Issue 4 (the meta-issue, links the others — file last so you can paste in their issue numbers)

extent analysis

TL;DR

Implement cost-aware retry logic with caps on retries per failure to prevent token burn from rework caused by underlying bugs in Cowork.

Guidance

  • Investigate per-user telemetry on token spend by category to understand the source of token consumption.
  • Implement exponential backoff with awareness of the cost for file reads and MCP calls to prevent repeated retry loops.
  • Explore caching opportunities to avoid re-paying for the same memory file reads on every turn in long-running orchestrator-style sessions.
  • Consider surfacing a "this is going to cost ~$X" warning before dispatching expensive operations to increase user visibility into token spend.

Example

No specific code example is provided, but implementing a retry mechanism with a cap on retries per failure could look like:

max_retries = 5
retry_delay = 1  # initial delay in seconds

for attempt in range(max_retries):
    try:
        # perform operation
        break
    except Exception as e:
        # handle exception
        retry_delay *= 2  # exponential backoff
        time.sleep(retry_delay)

Notes

The provided information suggests that fixing Issues 1 and 3 may have the most significant impact on reducing token burn. However, without more detailed information on the underlying system and its implementation, it's challenging to provide a comprehensive solution.

Recommendation

Apply a workaround by implementing cost-aware retry logic and caching opportunities to reduce token burn. This approach addresses the immediate issue and provides a foundation for further optimization and improvement.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - 💡(How to fix) Fix [BUG] Cowork token burn rate forces subscription upgrades; root causes are file-read failures + connector instability + scheduler stalls [1 participants]