hermes - 💡(How to fix) Fix Auxiliary tasks silently fall back to paid OpenRouter models, bypassing user's free-only configuration [1 pull requests]

hermes2026-05-11 22:03:08

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Users who explicitly configure only OpenRouter :free models in fallback_providers still get billed (or blocked by key monthly limits) because auxiliary tasks (title_generation, compression, vision, etc.) silently fall back to a hardcoded paid model (google/gemini-3-flash-preview) when their primary provider fails.

Error Message

INFO agent.auxiliary_client: Auxiliary title_generation: connection error on auto WARNING agent.title_generator: Title generation failed: Error code: 403 - {'error': {'message': 'Key limit exceeded (monthly limit)...'}}

Root Cause

The auxiliary fallback chain is independent of the user's fallback_providers config and ignores :free model variant constraints. The default model per provider is a hardcoded constant (_OPENROUTER_MODEL, _NOUS_MODEL), not derived from the user's configured models.

Fix Action

Fixed

Fixed by PR: fix(agent): use fallback_providers model for OpenRouter auxiliary tasks (https://github.com/NousResearch/hermes-agent/pull/24036)

Code Example

model:
  default: moonshotai/kimi-k2.6
  provider: nvidia

fallback_providers:
  - provider: openrouter
    model: inclusionai/ring-2.6-1t:free   # explicitly :free
  # ...all :free models

auxiliary:
  title_generation:
    provider: auto                          # default

---

INFO agent.auxiliary_client: Auxiliary title_generation: connection error on auto
INFO agent.auxiliary_client: Auxiliary title_generation: ... falling back to openrouter (google/gemini-3-flash-preview)
WARNING agent.title_generator: Title generation failed: Error code: 403 - {'error': {'message': 'Key limit exceeded (monthly limit)...'}}

RAW_BUFFERClick to expand / collapse

Summary

Reproduce

~/.hermes/config.yaml:

model:
  default: moonshotai/kimi-k2.6
  provider: nvidia

fallback_providers:
  - provider: openrouter
    model: inclusionai/ring-2.6-1t:free   # explicitly :free
  # ...all :free models

auxiliary:
  title_generation:
    provider: auto                          # default

OPENROUTER_API_KEY is set (used by fallback_providers).

What Happens

NVIDIA primary times out
_resolve_auto() enters Step 2 (_try_payment_fallback)
Reaches OpenRouter, uses hardcoded _OPENROUTER_MODEL = \"google/gemini-3-flash-preview\" (agent/auxiliary_client.py:391) — a PAID model
Hits user's per-key monthly limit → HTTP 403 Key limit exceeded (monthly limit)

Logs:

INFO agent.auxiliary_client: Auxiliary title_generation: connection error on auto
INFO agent.auxiliary_client: Auxiliary title_generation: ... falling back to openrouter (google/gemini-3-flash-preview)
WARNING agent.title_generator: Title generation failed: Error code: 403 - {'error': {'message': 'Key limit exceeded (monthly limit)...'}}

Root Cause

Expected Behavior

When the user's fallback_providers list contains only :free models for OpenRouter (or has documented their free-only intent), auxiliary tasks should:

Use those same :free models, OR
Refuse to fall back to OpenRouter at all (current behavior of explicit provider: nvidia is the correct shape), OR
At minimum, log a clear warning that a paid model is being used

Suggested Fix

Option A (least invasive): when picking the aux fallback model for a provider, prefer the user's fallback_providers[provider].model if any entry exists for that provider, before falling back to the hardcoded constant.

Option B: add a top-level auxiliary.free_only: true flag that filters out paid defaults across all aux tasks.

Option C: document this trap prominently in auxiliary.title_generation.provider and recommend explicit provider: <main_provider> for free-tier users.

Environment

Hermes Agent v0.13.0 (2026.5.7)
macOS / Python 3.11
Primary: NVIDIA NIM (free)
Fallback: OpenRouter (`:free` only) with $10 credit but $0/low per-key monthly limit

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #cache issue #memory leak #API versioning #request timeout

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Auxiliary tasks silently fall back to paid OpenRouter models, bypassing user's free-only configuration [1 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Code Example

Summary

Reproduce

What Happens

Root Cause

Expected Behavior

Suggested Fix

Environment

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix Auxiliary tasks silently fall back to paid OpenRouter models, bypassing user's free-only configuration [1 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Code Example

Summary

Reproduce

What Happens

Root Cause

Expected Behavior

Suggested Fix

Environment

Still need to ship something?

RELATED_DISCOVERY

TRENDING