claude-code - 💡(How to fix) Fix [DOCS] `/model` docs omit the uncached full-history warning for mid-conversation switches [1 participants]

Official PRs (…)
ON THIS PAGE

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#48087Fetched 2026-04-15 06:33:36
View on GitHub
Comments
0
Participants
1
Timeline
4
Reactions
0
Author
Participants
Timeline (top)
labeled ×4

Error Message

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

Root Cause

Because model-config, commands, and costs all present /model switching as a normal mid-session workflow, users are missing the cautionary context that motivated the new warning in v2.1.108.

RAW_BUFFERClick to expand / collapse

Documentation Type

Missing documentation (feature not documented)

Documentation Location

https://code.claude.com/docs/en/model-config

Section/Topic

Setting your model / during-session /model switching guidance

Current Documentation

The model configuration page currently says:

You can configure your model in several ways, listed in order of priority:

  1. During session - Use /model <alias|name> to switch models mid-session

The command reference says:

/model [model] | Select or change the AI model. For models that support it, use left/right arrows to adjust effort level. The change takes effect immediately without waiting for the current response to finish

The cost guide also recommends mid-session switching:

Sonnet handles most coding tasks well and costs less than Opus. Reserve Opus for complex architectural decisions or multi-step reasoning. Use /model to switch models mid-session, or set a default in /config.

What's Wrong or Missing?

Changelog v2.1.108 says:

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

The current docs never explain this user-visible behavior.

They do not tell users that switching models after a conversation already has history makes the next response resend the full conversation context uncached, which can increase token usage and latency. They also do not mention that Claude Code now warns before this happens.

Because model-config, commands, and costs all present /model switching as a normal mid-session workflow, users are missing the cautionary context that motivated the new warning in v2.1.108.

Suggested Improvement

Add a short note anywhere /model mid-session switching is introduced, especially on model-config and the /model command reference.

Suggested wording:

Switching models in the middle of an existing conversation can increase cost and latency on the next response, because Claude Code must resend the full conversation history uncached for the new model. Claude Code warns before this happens. If you know you'll need a different model, prefer selecting it at session start when possible.

The costs page should also stop recommending mid-session switching without this caveat.

Impact

Medium - Makes feature difficult to understand

Additional Context

Affected Pages:

PageContext
https://code.claude.com/docs/en/model-configPrimary guidance that explicitly tells users to switch models mid-session
https://code.claude.com/docs/en/commands/model command reference omits the warning and uncached-history consequence
https://code.claude.com/docs/en/costsRecommends using /model mid-session as a cost optimization without caveat

Total scope: 3 pages affected

Source: Changelog v2.1.108

Exact changelog entry:

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

Related documented precedent:

https://code.claude.com/docs/en/fast-mode already documents a similar cost/latency tradeoff for another mid-conversation mode switch:

When you switch into fast mode mid-conversation, you pay the full fast mode uncached input token price for the entire conversation context. This costs more than if you had enabled fast mode from the start.

extent analysis

TL;DR

Add a note to documentation pages about the potential cost and latency increase when switching models mid-session using /model, and consider revising the costs page to include this caveat.

Guidance

  • Review the affected pages (model-config, commands, and costs) to ensure consistent documentation of the /model command's behavior and its implications on cost and latency.
  • Add a warning or note on these pages to inform users about the potential increase in cost and latency when switching models mid-session, as suggested in the issue.
  • Consider revising the costs page to remove or modify the recommendation to use /model mid-session as a cost optimization without including the necessary caveat.
  • Verify that the added notes and revisions are consistent with the changelog entry for v2.1.108 and the related documented precedent for fast mode.

Example

A possible note to add to the documentation could be:

Switching models in the middle of an existing conversation can increase cost and latency on the next response, because Claude Code must resend the full conversation history uncached for the new model. Claude Code warns before this happens. If you know you'll need a different model, prefer selecting it at session start when possible.

Notes

The suggested improvement should be applied to all affected pages to ensure consistent documentation and to prevent user confusion. The exact wording of the note can be adjusted to fit the tone and style of the existing documentation.

Recommendation

Apply the workaround by adding the suggested note to the affected documentation pages, as this will provide users with the necessary information to make informed decisions about model switching and cost optimization.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING