hermes - 💡(How to fix) Fix provider: nous falls back to 32,768-token context, blocking boot with model.context_length workaround required

hermes2026-05-11 20:31:42

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

A model configured under provider: nous (e.g. moonshotai/kimi-k2.6) cannot boot Hermes Agent because agent.model_metadata.get_model_context_length() returns the hardcoded 32,768 fallback, which is below the 64,000-token minimum enforced at run_agent.py:2254.

Error Message

At minimum, surface a clearer error: distinguish "model's real context is below 64k" from "metadata not found for provider, falling back to 32,768" — the current message implies the model has a 32k window, which is wrong and misleads users into switching models unnecessarily.

#12440 — same error string, but root-caused to delegate_task ignoring model config; PR #12503.

Root Cause

agent/model_metadata.py::get_model_context_length() reads ~/.hermes/models_dev_cache.json. The cache has no nous provider entries at top level. Lookups for provider: nous therefore miss and the function returns its hardcoded 32_768 default.

Per the providers doc, Nous Portal is documented as suffix-matching Nous model IDs against OpenRouter metadata — i.e. there is no standalone nous provider in models.dev; it piggybacks on OpenRouter. But when the user writes provider: nous literally in config, that suffix-match path isn't exercised by the context-length lookup, and the lookup falls through to the 32k default.

The 32k figure is not a real upstream API cap — moonshotai/kimi-k2.6 is 262,144 tokens per Moonshot's spec, and OpenRouter's cache correctly carries limit.context: 262144 for the same model.

Fix Action

Workaround

Add model.context_length to config.yaml:

model:
  default: moonshotai/kimi-k2.6
  provider: nous
  base_url: https://inference-api.nousresearch.com/v1
  context_length: 262144

The override path at run_agent.py:2220 (config_context_length=_config_context_length) feeds this directly into get_model_context_length(), bypassing the missing-metadata fallback.

Code Example

model:
  default: moonshotai/kimi-k2.6
  provider: nous
  base_url: https://inference-api.nousresearch.com/v1

---

Model moonshotai/kimi-k2.6 has a context window of 32,768 tokens, which is below the minimum 64,000 required by Hermes Agent.  Choose a model with at least 64K context, or set model.context_length in config.yaml to override.

---

model:
  default: moonshotai/kimi-k2.6
  provider: nous
  base_url: https://inference-api.nousresearch.com/v1
  context_length: 262144

RAW_BUFFERClick to expand / collapse

Summary

Repro

~/.hermes/config.yaml:

model:
  default: moonshotai/kimi-k2.6
  provider: nous
  base_url: https://inference-api.nousresearch.com/v1

Result on startup:

Model moonshotai/kimi-k2.6 has a context window of 32,768 tokens, which is below the minimum 64,000 required by Hermes Agent.  Choose a model with at least 64K context, or set model.context_length in config.yaml to override.

Root cause

Workaround

Add model.context_length to config.yaml:

model:
  default: moonshotai/kimi-k2.6
  provider: nous
  base_url: https://inference-api.nousresearch.com/v1
  context_length: 262144

The override path at run_agent.py:2220 (config_context_length=_config_context_length) feeds this directly into get_model_context_length(), bypassing the missing-metadata fallback.

Suggested fix (one or more)

When provider: nous, have get_model_context_length() apply the same OpenRouter suffix-match fallback the wizard/providers doc describes — so the context lookup matches the model-resolution path.
Or: populate models_dev_cache.json with a nous provider section that mirrors OpenRouter's Moonshot/Kimi entries.
At minimum, surface a clearer error: distinguish "model's real context is below 64k" from "metadata not found for provider, falling back to 32,768" — the current message implies the model has a 32k window, which is wrong and misleads users into switching models unnecessarily.

#5173 — same pattern (cache returns bogus 32k for a model that should be much bigger), different provider; PR #5179.
#12440 — same error string, but root-caused to delegate_task ignoring model config; PR #12503.
#31 — Nous Portal not first-class in the setup wizard; relates to the missing-provider gap.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #autograd error #model save/load #optimization #mixed precision

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix provider: nous falls back to 32,768-token context, blocking boot with model.context_length workaround required

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Workaround

Code Example

Summary

Repro

Root cause

Workaround

Suggested fix (one or more)

Related

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix provider: nous falls back to 32,768-token context, blocking boot with model.context_length workaround required

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Workaround

Code Example

Summary

Repro

Root cause

Workaround

Suggested fix (one or more)

Related

Still need to ship something?

RELATED_DISCOVERY

TRENDING