openclaw - 💡(How to fix) Fix Embedded agent retry on 429 rate limit does not follow fallback chain

openclaw2026-05-24 13:53:30

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

When the embedded agent receives a 429 (Too Many Requests) rate limit from a model provider, it retries the same rate-limited model instead of falling back to the next model in the configured fallback chain.

Root Cause

Fix Action

Fix / Workaround

Workaround: Remove duplicate/rate-limited models from config and restart gateway. This is not sustainable.

Code Example

config-audit.jsonl confirms fallback chain was correctly set and written:
- 2026-05-24T00:24:57Z: fallbacks = [qwen3.6-plus, gemini-flash, gemini-flash-2]
- 2026-05-24T00:30:00Z: fallbacks = [gemini-flash, gemini-flash-2, qwen3.6-plus]

Despite correct configuration, 429 retries did not traverse the fallback chain. Session file became corrupted and required manual deletion.

RAW_BUFFERClick to expand / collapse

Bug type

Behavior bug (incorrect output/state without crash)

Beta release blocker

Summary

Steps to reproduce

Configure agents.defaults.model.fallbacks with multiple models:
- primary: google-gemini/gemini-2.5-flash
- fallbacks: [google-gemini-2/gemini-2.5-flash, custom-ai-sumopod-com/qwen3.6-plus]
Send enough requests to hit Google Gemini FREE tier rate limit (15 RPM).
Observe that subsequent requests receive 429 Too Many Requests.
The embedded agent retries on the same rate-limited model (google-gemini/gemini-2.5-flash) repeatedly instead of attempting the next fallback model.

Expected behavior

When a 429 rate limit is received, the agent should recognize it as a provider-level rate limit and automatically proceed to the next model in the configured fallbacks chain, consistent with how OpenClaw handles other non-recoverable errors.

Actual behavior

The agent retries the same rate-limited model until all retry attempts are exhausted. Session becomes stuck and unresponsive despite valid fallback models being configured. Manual config edit and gateway restart were required to restore functionality. Confirmed by config-audit.jsonl showing correct fallback chain configuration on 2026-05-24.

OpenClaw version

2026.5.18

Operating system

Linux 6.6.117-45.1.oc9.x86_64 (OpenCloudOS, x64)

Install method

npm install (Node.js v22.22.1)

Model

google-gemini/gemini-2.5-flash

Provider / routing chain

Embedded agent → google-gemini provider → Gemini 2.5 Flash (FREE tier, 15 RPM limit) → Fallback chain: google-gemini-2/gemini-2.5-flash → custom-ai-sumopod-com/qwen3.6-plus

Additional provider/model setup details

Gateway mode: local
Config path: /root/.openclaw/openclaw.json
Telegram surface (direct chat)
Fallback chain was correctly configured via config.set command, confirmed by config-audit.jsonl entries at 2026-05-24T00:24:57Z and 2026-05-24T00:30:00Z
Session became stuck during Gemini rate limit window without attempting any fallback

Logs, screenshots, and evidence

config-audit.jsonl confirms fallback chain was correctly set and written:
- 2026-05-24T00:24:57Z: fallbacks = [qwen3.6-plus, gemini-flash, gemini-flash-2]
- 2026-05-24T00:30:00Z: fallbacks = [gemini-flash, gemini-flash-2, qwen3.6-plus]

Despite correct configuration, 429 retries did not traverse the fallback chain. Session file became corrupted and required manual deletion.

Impact and severity

Affected users: Any user with fallback chains using rate-limited providers (e.g. Gemini FREE tier 15 RPM)
Severity: Blocks workflow — session becomes unresponsive, requires manual intervention (config edit + gateway restart)
Frequency: Always — reproducible by hitting the rate limit on the primary provider
Consequence: Missed messages, failed automation (cron jobs), potential extra cost from failed retries

Additional information

Workaround: Remove duplicate/rate-limited models from config and restart gateway. This is not sustainable.

Reporter: Wahyu Maulana Ishaq (@wmaulanaaishq) Affiliation: Student, Universitas Alma Ata, Yogyakarta, Indonesia

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix Embedded agent retry on 429 rate limit does not follow fallback chain

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

Code Example

Bug type

Beta release blocker

Summary

Steps to reproduce

Expected behavior

Actual behavior

OpenClaw version

Operating system

Install method

Model

Provider / routing chain

Additional provider/model setup details

Logs, screenshots, and evidence

Impact and severity

Additional information

FAQ

Expected behavior

Still need to ship something?

TRENDING