hermes - 💡(How to fix) Fix Gateway: prevent silent fallback on high-trust routes and expose effective model [1 comments, 1 participants]

hermes2026-05-10 15:20:57

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#23265•Fetched 2026-05-11 03:30:19

View on GitHub

Comments

Participants

Timeline

Reactions

Author

wangrenzhu-ola

Participants

wangrenzhu-ola

Timeline (top)

labeled ×4closed ×1commented ×1

Error Message

For selected gateway routes/chats/platforms, if the configured provider/credential label is unavailable, fail closed with a clear error instead of silently using fallback.
A gateway route can be configured so that an unavailable primary credential produces a clear user-visible error and does not call fallback providers.

Root Cause

Problem

High-trust gateway channels can silently fall back from the configured primary model/provider to a weaker fallback provider when the primary credential pool is unavailable. In a real incident, a Weixin DM route configured for openai-codex / gpt-5.5 / gpt-pro-account-5 repeatedly fell back to kimi-coding / kimi-k2.6 because the gateway runtime saw credential pool: no available entries / No Codex credentials stored. The user only noticed after task quality degraded.

Code Example

Primary provider auth failed: No Codex credentials stored. Run `hermes auth` to authenticate. — trying fallback
Fallback provider resolved: kimi-coding model=kimi-k2.6
Runtime provider supplied explicit model override: gpt-5.5 -> kimi-k2.6

---

gateway_credential_routing:
       rules:
         - platform: weixin
           provider: openai-codex
           allow_labels: [gpt-pro-account-5]
           exclusive: true
           direct: true
           fallback_policy: fail_closed   # or strict: true / allow_fallback: false

RAW_BUFFERClick to expand / collapse

Problem

Relevant log pattern:

Primary provider auth failed: No Codex credentials stored. Run `hermes auth` to authenticate. — trying fallback
Fallback provider resolved: kimi-coding model=kimi-k2.6
Runtime provider supplied explicit model override: gpt-5.5 -> kimi-k2.6

This is dangerous for high-authority channels because fallback changes the effective model capability and behavior without an obvious runtime signal.

Current Behavior

model.default remains gpt-5.5 and gateway routing can be configured to target openai-codex with an allow-listed credential label.
When the gateway cannot load/use the primary provider credential pool, it falls back to the configured fallback provider.
The fallback can persist across messages/runs until gateway/auth state is refreshed.
Replies do not consistently include the actually used provider/model/credential label, so users cannot easily detect silent downgrade.

Expected Behavior

Add first-class controls for high-trust gateway routing:

Strict routing / no silent fallback

For selected gateway routes/chats/platforms, if the configured provider/credential label is unavailable, fail closed with a clear error instead of silently using fallback.

Example policy shape:

gateway_credential_routing:
  rules:
    - platform: weixin
      provider: openai-codex
      allow_labels: [gpt-pro-account-5]
      exclusive: true
      direct: true
      fallback_policy: fail_closed   # or strict: true / allow_fallback: false

Runtime model/provider watermark
- Optional per-platform/per-chat footer showing the actual runtime model/provider/credential label.
- Normal example: — gpt-5.5 · openai-codex · gpt-pro-account-5
- Fallback example: — ⚠ fallback: kimi-k2.6 · kimi-coding · codex unavailable
Explicit downgrade/restore events
- If fallback is allowed, the response should make the downgrade visible.
- When the primary provider/credential becomes available again, the gateway should prefer/restore the primary route on the next message and optionally show restored once.
Observability and audit logs
- Log selected runtime provider/model/credential label per gateway response.
- Log route policy decisions: strict fail-closed, fallback allowed, fallback reason, restored primary.

Acceptance Criteria

A gateway route can be configured so that an unavailable primary credential produces a clear user-visible error and does not call fallback providers.
A gateway route can optionally append a concise runtime model footer to responses.
If fallback is allowed, fallback responses are visibly marked as fallback.
Gateway logs include the effective provider/model/credential label for each response.
Tests cover:
- strict route + missing primary credential => no fallback call
- non-strict route + missing primary credential => fallback call with visible warning
- footer/watermark formatting for normal and fallback responses
- primary restoration after credential pool becomes available again, if restoration logic is implemented

Notes

This belongs in Hermes runtime/gateway rather than skills: the issue is provider selection, credential routing, fallback policy, and gateway response observability.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#dependency error #configuration error #environment variable #network issue #logging issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Gateway: prevent silent fallback on high-trust routes and expose effective model [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Problem

Code Example

Problem

Current Behavior

Expected Behavior

Acceptance Criteria

Notes

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix Gateway: prevent silent fallback on high-trust routes and expose effective model [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Problem

Code Example

Problem

Current Behavior

Expected Behavior

Acceptance Criteria

Notes

Still need to ship something?

RELATED_DISCOVERY

TRENDING