claude-code - 💡(How to fix) Fix [DOCS] `/model` docs omit the uncached full-history warning for mid-conversation switches [1 participants]

claude-code2026-04-14 19:58:29

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#48087•Fetched 2026-04-15 06:33:36

View on GitHub

Comments

Participants

Timeline

Reactions

Author

coygeek

Participants

coygeek

Timeline (top)

labeled ×4

Error Message

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

Root Cause

Because model-config, commands, and costs all present /model switching as a normal mid-session workflow, users are missing the cautionary context that motivated the new warning in v2.1.108.

RAW_BUFFERClick to expand / collapse

Documentation Type

Missing documentation (feature not documented)

Documentation Location

https://code.claude.com/docs/en/model-config

Section/Topic

Setting your model / during-session /model switching guidance

Current Documentation

The model configuration page currently says:

You can configure your model in several ways, listed in order of priority:

During session - Use /model <alias|name> to switch models mid-session

The command reference says:

/model [model] | Select or change the AI model. For models that support it, use left/right arrows to adjust effort level. The change takes effect immediately without waiting for the current response to finish

The cost guide also recommends mid-session switching:

Sonnet handles most coding tasks well and costs less than Opus. Reserve Opus for complex architectural decisions or multi-step reasoning. Use /model to switch models mid-session, or set a default in /config.

What's Wrong or Missing?

Changelog v2.1.108 says:

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

The current docs never explain this user-visible behavior.

They do not tell users that switching models after a conversation already has history makes the next response resend the full conversation context uncached, which can increase token usage and latency. They also do not mention that Claude Code now warns before this happens.

Because model-config, commands, and costs all present /model switching as a normal mid-session workflow, users are missing the cautionary context that motivated the new warning in v2.1.108.

Suggested Improvement

Add a short note anywhere /model mid-session switching is introduced, especially on model-config and the /model command reference.

Suggested wording:

Switching models in the middle of an existing conversation can increase cost and latency on the next response, because Claude Code must resend the full conversation history uncached for the new model. Claude Code warns before this happens. If you know you'll need a different model, prefer selecting it at session start when possible.

The costs page should also stop recommending mid-session switching without this caveat.

Impact

Medium - Makes feature difficult to understand

Additional Context

Affected Pages:

Page	Context
https://code.claude.com/docs/en/model-config	Primary guidance that explicitly tells users to switch models mid-session
https://code.claude.com/docs/en/commands	`/model` command reference omits the warning and uncached-history consequence
https://code.claude.com/docs/en/costs	Recommends using `/model` mid-session as a cost optimization without caveat

Total scope: 3 pages affected

Source: Changelog v2.1.108

Exact changelog entry:

Improved /model to warn before switching models mid-conversation, since the next response re-reads the full history uncached

Related documented precedent:

https://code.claude.com/docs/en/fast-mode already documents a similar cost/latency tradeoff for another mid-conversation mode switch:

When you switch into fast mode mid-conversation, you pay the full fast mode uncached input token price for the entire conversation context. This costs more than if you had enabled fast mode from the start.

extent analysis

TL;DR

Add a note to documentation pages about the potential cost and latency increase when switching models mid-session using /model, and consider revising the costs page to include this caveat.

Guidance

Review the affected pages (model-config, commands, and costs) to ensure consistent documentation of the /model command's behavior and its implications on cost and latency.
Add a warning or note on these pages to inform users about the potential increase in cost and latency when switching models mid-session, as suggested in the issue.
Consider revising the costs page to remove or modify the recommendation to use /model mid-session as a cost optimization without including the necessary caveat.
Verify that the added notes and revisions are consistent with the changelog entry for v2.1.108 and the related documented precedent for fast mode.

Example

A possible note to add to the documentation could be:

Switching models in the middle of an existing conversation can increase cost and latency on the next response, because Claude Code must resend the full conversation history uncached for the new model. Claude Code warns before this happens. If you know you'll need a different model, prefer selecting it at session start when possible.

Notes

The suggested improvement should be applied to all affected pages to ensure consistent documentation and to prevent user confusion. The exact wording of the note can be adjusted to fit the tone and style of the existing documentation.

Recommendation

Apply the workaround by adding the suggested note to the affected documentation pages, as this will provide users with the necessary information to make informed decisions about model switching and cost optimization.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#optimization #prompt formatting #chain error #conversation history #tool integration

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [DOCS] `/model` docs omit the uncached full-history warning for mid-conversation switches [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Documentation Type

Documentation Location

Section/Topic

Current Documentation

What's Wrong or Missing?

Suggested Improvement

Impact

Additional Context

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix [DOCS] `/model` docs omit the uncached full-history warning for mid-conversation switches [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Documentation Type

Documentation Location

Section/Topic

Current Documentation

What's Wrong or Missing?

Suggested Improvement

Impact

Additional Context

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING