gemini-cli - 💡(How to fix) Fix One model quota exhausted make the whole models unavailable [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
google-gemini/gemini-cli#26561Fetched 2026-05-06 06:35:29
View on GitHub
Comments
1
Participants
2
Timeline
2
Reactions
0
Timeline (top)
commented ×1labeled ×1
RAW_BUFFERClick to expand / collapse

What happened?

(file:///C:/Users/ASCC/AppData/Roaming/npm/node_modules/@google/gemini-cli/bundle/chunk-NET4RIEQ.js:306115:2 │ │ 2) │ │ at async GeminiClient.sendMessageStream │ │ (file:///C:/Users/ASCC/AppData/Roaming/npm/node_modules/@google/gemini-cli/bundle/chunk-NET4RIEQ.js:306227:1 │ │ 4) │ │ at async │ │ file:///C:/Users/ASCC/AppData/Roaming/npm/node_modules/@google/gemini-cli/bundle/interactiveCli-UPXIMCUC.js: │ │ 28202:24 │ │ at async │ │ file:///C:/Users/ASCC/AppData/Roaming/npm/node_modules/@google/gemini-cli/bundle/interactiveCli-UPXIMCUC.js: │ │ 28378:38 { │ │ cause: { │ │ code: 429, │ │ message: 'You have exhausted your capacity on this model. Your quota will reset after 19h16m11s.', │ │ details: [ [Object], [Object] ] │ │ }, │ │ retryDelayMs: 69371663.390807, │ │ reason: 'QUOTA_EXHAUSTED' ▄│ │ }

[ACTION REQUIRED] 📎 PLEASE ATTACH THE EXPORTED CHAT HISTORY JSON FILE TO THIS ISSUE IF YOU FEEL COMFORTABLE SHARING IT.

What did you expect to happen?

Change the model manually or in Auto model selector

Model usage │ │ │ │ Flash ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ 100% Resets: 5:27 PM (19h 11m)
│ Flash Lite ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ 7% Resets: 5:27 PM (19h 11m)
│ Pro ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ 52% Resets: 10:25 AM (12h 8m)

and I didnt used model 2.5 but still get same issue

Client information

  • CLI Version: 0.41.1
  • Git Commit: e1a54496d
  • Session ID: 671de020-6505-484c-8323-c3ce94d1c313
  • Operating System: win32 v23.10.0
  • Sandbox Environment: no sandbox
  • Model Version: auto-gemini-2.5
  • Auth Type: oauth-personal
  • Memory Usage: 252.0 MB
  • Terminal Name: xterm.js(6.1.0-beta.197)
  • Terminal Background: #191a1b
  • Kitty Keyboard Protocol: Supported
  • IDE Client: VS Code

Login information

google oAuth

Anything else we need to know?

No response

extent analysis

TL;DR

The issue is likely due to exceeding the quota limit for the model, and the fix is to wait for the quota to reset or switch to a different model.

Guidance

  • The error message indicates a QUOTA_EXHAUSTED reason with a retry delay, suggesting that the model's capacity has been exceeded.
  • The model usage table shows that the Flash model is at 100% capacity, which may be causing the issue.
  • To mitigate this, you can try switching to a different model, such as Flash Lite or Pro, which have lower usage percentages.
  • Alternatively, you can wait for the quota to reset, which is scheduled to happen in approximately 19 hours.

Example

No code snippet is provided as it is not necessary for this issue.

Notes

The issue may be specific to the auto-gemini-2.5 model version, and switching to a different model or version may resolve the issue.

Recommendation

Apply workaround: Switch to a different model, such as Flash Lite or Pro, to avoid exceeding the quota limit. This is a temporary solution until the quota resets or a more permanent fix is found.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING