gemini-cli - 💡(How to fix) Fix [Bug] Gemini CLI ignores model choices made on command line at startup and in settings, is not aware which model is used [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
google-gemini/gemini-cli#25898Fetched 2026-04-24 06:13:22
View on GitHub
Comments
0
Participants
1
Timeline
1
Reactions
0
Author
Participants
Timeline (top)
labeled ×1

Error Message

At the same time usage quota are far from even dented for Gemini Pro - never seen more than 15%, but often when Gemini Flash is in frenzy Pro usage is at 0% (rounding error) while Flash is at 15% of daily quota. This happens disregard for time of day and night - I work sometimes very long hours.

RAW_BUFFERClick to expand / collapse

What happened?

Over several last days Gemini CLI started to ignore any settings used to set used models - my choice of Gemini 3.1 Pro Preview stems from desire to get good quality code and efficient coding session. Over last days I observed tendency for Gemini to route requests at the beginning to pro model and after some time to route requests to flash model. As code base I work on is complex and multi platform change of routing is immediately visible in presumably lost context - model does not even remember last user actions or requests, or looses ability to follow implementation plan and jumps to edit code on which work was earlier already finished with that same agent. Moreover agent just couple of minutes earlier confirmed after user request that code which is now edited in fact was finished earlier.

Loss of context and forgetting about execution / implementation plans often results in introduction of bugs and bug quashing frenzy - when agent after every fast compilation attempt comes with new code modification proposal after just several seconds its clear he is using flash model and after checking stats i observed that during some sessions almost all requests were routed to Gemini flash.

At the same time usage quota are far from even dented for Gemini Pro - never seen more than 15%, but often when Gemini Flash is in frenzy Pro usage is at 0% (rounding error) while Flash is at 15% of daily quota. This happens disregard for time of day and night - I work sometimes very long hours.

What did you expect to happen?

Just stick to the contract Google has entered to with customer and respect configuration decisions. Session UUID which was attached may not be representative for that problem but earlier session had to be terminated early due to this routing. At the same time it seems Flash model has a much lower ability to follow any instructions, mandatory rules or security limitations.

Client information

Client information

CLI Version: 0.39.0
Git Commit: https://github.com/google-gemini/gemini-cli/commit/398f78dcaa8fd2396684add19933916f7b87d349
Session ID: 6df14344-1210-4a6d-81cd-4355a0318e6c
Operating System: win32 v2 5.9.0
Sandbox Environment: no sandbox
Model Version: gemini-3.1-pro-preview
Auth Type: oauth-personal
Memory Usage: 393.5 MB
Terminal Name: Unknown
Terminal Background: #0c0 c0c
Kitty Keyboard Protocol: Unsupported

Login information

No response

Anything else we need to know?

No response

extent analysis

TL;DR

The Gemini CLI may be ignoring settings for the used model, causing requests to be routed to the flash model instead of the configured Gemini 3.1 Pro Preview model.

Guidance

  • Verify the model usage statistics to confirm that the requests are being routed to the flash model despite the configuration settings.
  • Check the CLI version and Git commit to ensure that there are no known issues with the current version (0.39.0) that may be causing this behavior.
  • Review the session logs to see if there are any errors or warnings that may indicate why the requests are being routed to the flash model.
  • Consider testing with a different model version or configuration to see if the issue persists.

Notes

The issue may be related to a bug or limitation in the current CLI version or model implementation, and further investigation is needed to determine the root cause.

Recommendation

Apply workaround: Try configuring the model settings again or testing with a different model version to see if the issue persists, as the root cause is unclear and may require further investigation.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING