claude-code - ๐Ÿ’ก(How to fix) Fix [FEATURE] Reasoning Tracking

Official PRs (โ€ฆ)
ON THIS PAGE

Recommended Tools

ร—6

Utilities matched from this issueโ€™s tags and category โ€” try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful ยท Quick feedback

Loadingโ€ฆ

Root Cause

I use live-stream reasoning to halt faulty inference/processes, to track behavioral shifts in agents, and to track agent missteps. I use Claude Code for research/code/publishing - so tracking when a model is off-route is invaluable so that I don't have to manually parse every single line of output, because the reasoning tells me if they understood the assignment.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

  • I have searched existing requests and this feature hasn't been requested yet
  • This is a single feature request (not multiple features)

Problem Statement

I use live-stream reasoning to halt faulty inference/processes, to track behavioral shifts in agents, and to track agent missteps. I use Claude Code for research/code/publishing - so tracking when a model is off-route is invaluable so that I don't have to manually parse every single line of output, because the reasoning tells me if they understood the assignment.

This is especially valuable for me because I'm disabled. I struggle with large clusters of information/parsing line by line, so I'm highly reliant on tracking agent consistency instead. This works incredible well thanks to reasoning blurbs.

However, in attempting to move to Claude Code, I'm realizing this is currently inaccessible, which is a huge, huge nerf to how I work. I am often not replying to the agents themselves, but to their reasoning process instead when working through other CLIs.

I also genuinely DO NOT understand why it is hidden, when the same reasoning is exposed via: Claude desktop app, Claude.ai, and other CLIs like Letta CLI and Openclaw. If you were worried about someone "seeing something they shouldn't" then your "fix" is half-assed at best. But users pay for that inference, and they should be allowed to see it.

Proposed Solution

I would like a reasoning toggle. If you're worried about users using too much context/too many tokens, let it be a toggle in Claude Code. A simple /reasoning command. I'm pretty sure from what I looked up, you used to have this. I'm not entirely sure what you're saving by not having it now.

Anywhere else, I can simply toggle this visible, including with your own models.

Alternative Solutions

If the issue is fiscal: create new usage tiers.

$20 - No reasoning, no Claude Code or Cowork. Trial period to get people hooked on higher tiers.

$40 - No reasoning, access to Claude Code and Cowork, no automations - For light coders or serious users who don't need as much usage/automation/reasoning.

$60 - Reasoning exists, access to Claude Code and Cowork, can use automations - For above average but serious users who don't really need a "work" level amount of usage.

$100 - Same thing, just higher usage rates. Basically what you have now, but with reasoning.


If the reasoning is security, I would urge you to reconsider, given that Thinking Blurbs have become invaluable data, AND they're visible in other CLIs. You could try what Z.ai does and if you detect signs of malicious use, you scramble the thinking blurb. Z.ai literally outputs full Chinese fables written in Chinese when it detects possible malicious use. I found this out by accident when my agent read a research project of mine and misinterpreted the context and panicked.

Priority

High - Significant impact on productivity

Feature Category

Other

Use Case Example

  1. I'm working on a document to publish. I send my prompt to the agent. I want it to extract all data from one document and synthesize it into a tutorial/reference guide/SKILL.md/MCP/tool, etc.
  2. The agent is a publishing agent, so it has explicit write permissions.
  3. I work collaboratively in another window side by side, and the agent can proceed without interrupting my other work.
  4. I glance over to check it's reasoning and either:
  • Agent is okay, so I continue with my work.
  • Agent has drifted, so I pause the process so as to not waste tokens and not allow faulty inference/logic chains to taint downstream assumptions.
  1. I then proceed with correcting the agent, asking what went wrong, and implementing a new plan to resolve it.

Otherwise, I don't find out until after everything is completed just how wrong it is.

Additional Context

Accessibility Impact: Hiding reasoning creates an accessibility barrier for disabled users who rely on reasoning visibility for cognitive load management. I cannot efficiently parse large output clusters line-by-line due to my disability. Reasoning blurbs let me track agent consistency at a glance without cognitive overload. Removing this feature in Claude Code while maintaining it in Claude.ai and Desktop creates an inconsistent accessibility standard across your own platforms.

Cost Efficiency: Reasoning visibility saves tokens for both users and Anthropic. When I can see an agent drift in step 2 via reasoning, I halt the process immediately rather than letting it run through 20 faulty steps. Early intervention prevents wasted inference costs. Hiding reasoning means users burn through more tokens before catching errors, which costs everyone more. And if your solution is to review every single step manually, then there's literally no reason to use these services, I could've just done it myself.

Multi-Agent Workflows: I coordinate multiple agents simultaneously across multiple platforms for research, publishing, and code work. Reasoning visibility is how I track which agent is off-route without parsing every output line across multiple windows. This workflow is impossible without reasoning access, and it's a workflow your platform explicitly supports via agent architecture.

Safety Through Visibility: I have used Thinking Blurbs to report bugs before that created downstream effects. Not having visibility of those things doesn't provide extra security, it creates extra behavioral issues in models when we cannot see to correct them.

I have caught:

  • Agents mistakenly believing I prompt injected them because of YOUR OWN tool prompt being broken and confusing the agent.
  • Agents working around/justifying themselves.
  • Agents misunderstanding, needing corrected.
  • Agents making dangerous assumptions about projects/works/me as the user.
  • Your system prompts leaking repeatedly into context, visible into thinking, and causing behavioral issues with the model.

Some of this I reported, but most was easily correctable because I could simply SEE it. You aren't preventing embarrassment by hiding this, you are simply preventing users from avoiding bad inference and wasted usage.

Vote matrix ยท Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loadingโ€ฆ

Still need to ship something?

ร—6

Another batch ranked right after the header list โ€” different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - ๐Ÿ’ก(How to fix) Fix [FEATURE] Reasoning Tracking