claude-code - 💡(How to fix) Fix [BUG] Deceptive usage metrics for Claude Pro in VSCode [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#46314Fetched 2026-04-11 06:23:33
View on GitHub
Comments
0
Participants
1
Timeline
4
Reactions
0
Author
Participants
Timeline (top)
labeled ×4

The Claude Code Pro billing model presents usage as a time-based allowance (e.g. "5 hours per session") but the actual constraint is token volume. For the product's primary use case — software development — this creates a significant gap between what's advertised and what's delivered.

Error Message

Error Messages/Logs

Root Cause

Claude Code markets usage in hours, implying wall-clock time. In practice, a "5 hour" session can be consumed in 15-20 minutes of active coding work. This is because:

RAW_BUFFERClick to expand / collapse

Preflight Checklist

  • I have searched existing issues and this hasn't been reported yet
  • This is a single bug report (please file separate reports for different bugs)
  • I am using the latest version of Claude Code

What's Wrong?

I asked Claude to write this summary so I could burn down my remaining 3% usage for the usage meter to reset. Not really why I am paying for this service!

Take it away, Claude:

Claude Code Pro Billing Feedback

Summary

The Claude Code Pro billing model presents usage as a time-based allowance (e.g. "5 hours per session") but the actual constraint is token volume. For the product's primary use case — software development — this creates a significant gap between what's advertised and what's delivered.

The core problem: time framing vs. token reality

Claude Code markets usage in hours, implying wall-clock time. In practice, a "5 hour" session can be consumed in 15-20 minutes of active coding work. This is because:

  • Every conversation turn resends the full (growing) context history
  • Reading source files pulls hundreds of lines into context per tool call
  • Multi-file edits require multiple tool calls, each adding input and output tokens
  • Context compaction (summarization) indicates the context window has already filled and reset, further accelerating token burn
  • Opus, the flagship model, costs significantly more tokens per interaction than Sonnet

A typical real-world coding session — reading files, making edits across several modules, building, iterating — burns through the token budget at roughly 15-20x the rate the "hours" framing suggests.

The cooldown penalty for stopping

The session cooldown timer does not begin until the token budget is fully depleted. This means:

  • If a user has 20% of a session remaining — not enough for meaningful work — they must either waste the remaining tokens or sit idle with no benefit
  • Pausing to think, context-switch, or come back later is actively penalized
  • The optimal strategy is to waste remaining tokens to trigger the reset timer sooner
  • This is a perverse incentive that punishes thoughtful, efficient use of the tool

Effective weekly capacity

In practice, intensive coding sessions yield roughly 1-2 hours of actual wall-clock usage per week on the Pro plan, not the implied 35+ hours (5 hours/day x 7 days) that the time-based framing suggests. The combination of rapid token depletion and cooldown-gating makes the effective capacity a small fraction of what's advertised.

Recommendations

  1. Present usage honestly — show token consumption or a normalized "compute units" metric rather than hours. Users can plan around an accurate budget; they can't plan around a misleading one.
  2. Start cooldown on last activity, not on depletion — if a user stops interacting, begin the reset timer. Don't force them to drain remaining tokens.
  3. Provide usage visibility — show real-time token consumption per conversation so users can make informed decisions about when to start fresh sessions, switch models, or reduce context.
  4. Differentiate model costs transparently — if Opus costs 5x more than Sonnet per interaction, surface that at point of use, not buried in docs.

Context

This feedback comes from using Claude Code (Opus 4.6) for active development on a mid-sized Electron/TypeScript project (~10 source files, largest ~1700 lines). The work involved reading files, making targeted edits across multiple modules, building, and iterating — standard software engineering workflow, and exactly what Claude Code is designed for.

What Should Happen?

You already have a context usage indicator, you should have a session usage indicator. It is misleading to the point of feeling like bait-and-switch by stating a session is 5 hours when using the product for coding, which I believe is one of its key selling points, burns through that session in 15 minutes.

Error Messages/Logs

Steps to Reproduce

  1. Start a new session in VSCode working on a coding project. Check the usage panel to see what remains in your session. Observe that it says "5 hours".
  2. Work with Claude on coding tasks involving editing files and making code changes.
  3. Observe that the usage panel shows the session is nearly entirely consumed after 15-20 minutes.

Claude Model

Opus

Is this a regression?

I don't know

Last Working Version

No response

Claude Code Version

2.1.92

Platform

Anthropic API

Operating System

macOS

Terminal/Shell

Other

Additional Information

This is using the VSCode plugin from Anthropic.

extent analysis

TL;DR

The issue can be mitigated by changing the billing model to reflect actual token consumption instead of time-based allowances.

Guidance

  • The current billing model presents a significant gap between advertised and actual usage, leading to rapid token depletion and cooldown penalties.
  • To address this, the billing model should be updated to show token consumption or a normalized "compute units" metric, allowing users to plan around an accurate budget.
  • Implementing a cooldown timer that starts on last activity, rather than token depletion, would also help to prevent perverse incentives that punish efficient use of the tool.
  • Providing real-time token consumption visibility per conversation would enable users to make informed decisions about their usage.

Example

No code snippet is provided as the issue is related to the billing model and usage metrics rather than a specific code implementation.

Notes

The issue is specific to the Claude Code Pro billing model and its impact on users, particularly those using the Opus model for software development. The proposed changes aim to improve transparency and accuracy in usage metrics.

Recommendation

Apply a workaround by providing usage visibility and differentiating model costs transparently, as this would help to mitigate the issue until a more comprehensive solution is implemented. This approach would allow users to make informed decisions about their usage and plan around the actual token consumption.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING