hermes - ✅(Solved) Fix Add per-channel model override (`channel_models`) — parallel to `channel_prompts` [1 pull requests, 3 comments, 2 participants]

hermes2026-04-30 08:31:09

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#17834•Fetched 2026-05-01 05:55:37

View on GitHub

Comments

Participants

Timeline

Reactions

Author

PaulBlackSwan

Participants

alt-glitch

PaulBlackSwan

Timeline (top)

labeled ×4commented ×3closed ×1cross-referenced ×1

Fix Action

Fix / Workaround

Today there's no built-in way to express "this channel/topic uses model X" in config. Workarounds (multiple Hermes profiles per channel set, skills with model directives, smart routing) are either heavy (separate process trees and DBs), brittle, or content-based rather than channel-based.

PR fix notes

PR #1991: feat: per-channel model and system prompt overrides for gateway platforms (Fixes #1955)

Repository: NousResearch/hermes-agent
Author: crazywriter1
State: open | merged: False
Link: https://github.com/NousResearch/hermes-agent/pull/1991

Description (problem / solution / changelog)

#1955

Problem

The gateway uses a single global model and system prompt for all channels. Different channels (e.g. Discord #daily vs #dev) cannot use different models or personas without running separate bot instances.

Solution

Config: Add channel_overrides under each platform in gateway config. Keys are channel IDs; values are model, optional provider, and optional system_prompt.
Resolution: When dispatching a message, resolve model and system prompt in order: (1) /model / session still applies; (2) channel_overrides[channel_id] for this platform; (3) global model.default / agent.system_prompt.
Runtime: If a channel override specifies provider, credentials are resolved for that provider via resolve_runtime_provider(override.provider).

Changes

gateway/config.py: Added ChannelOverride dataclass (model, provider, system_prompt). Extended PlatformConfig with channel_overrides: Dict[str, ChannelOverride] and roundtrip in from_dict / to_dict (channel IDs normalized to str).
gateway/run.py: Added _get_channel_override(config, platform, chat_id). Added _resolve_runtime_agent_kwargs_for_provider(provider) for channel-specific provider. Added GatewayRunner._resolve_model_for_channel(platform, chat_id) and _get_system_prompt_for_channel(platform, chat_id). Main agent dispatch uses these when building model, runtime_kwargs, and combined_ephemeral system prompt.
tests/gateway/test_config.py: Tests for ChannelOverride and PlatformConfig.channel_overrides roundtrip and numeric channel ID normalization.
tests/gateway/test_channel_overrides.py: Tests for _get_channel_override, _resolve_model_for_channel, and _get_system_prompt_for_channel.

Testing

tests/gateway/test_config.py
- TestChannelOverride: from_dict empty, to_dict omits None.
- TestPlatformConfigRoundtrip: test_channel_overrides_roundtrip, test_channel_overrides_from_dict_normalizes_channel_id_to_str.
tests/gateway/test_channel_overrides.py (new)
- TestGetChannelOverride: no override when empty config / platform not configured / channel not in overrides; returns override when channel matches; int-like chat_id normalized to str.
- TestResolveModelForChannel: uses channel override when present; falls back to global when no override.
- TestGetSystemPromptForChannel: uses channel override when present; falls back to global when no override.
Bare-runner safety: _resolve_model_for_channel, _get_system_prompt_for_channel, and run_sync use getattr(self, "config", None) so tests that create GatewayRunner with object.__new__(GatewayRunner) (no config) do not raise AttributeError.
Suite hygiene: tests/conftest.py — asyncio DeprecationWarning filter and event loop fixture using get_running_loop/new_event_loop; test_slack.py default users_info mock and module-level RuntimeWarning filter; test_whatsapp_connect.py filter on TestFileHandleClosedOnError. Gateway test run: 1215 passed, 21 skipped, 0 warnings.

Example config

platforms:
  discord:
    enabled: true
    channel_overrides:
      "1234567890":
        model: openrouter/healer-alpha
        provider: openrouter
        system_prompt: "You are a daily news summarizer."
      "9876543210":
        model: anthropic/claude-opus-4.6
        provider: anthropic
        system_prompt: "You are a coding assistant."

## Changed files

- `gateway/config.py` (modified, +53/-3)
- `gateway/run.py` (modified, +66/-7)
- `tests/conftest.py` (modified, +14/-3)
- `tests/gateway/test_channel_overrides.py` (added, +128/-0)
- `tests/gateway/test_config.py` (modified, +50/-0)
- `tests/gateway/test_slack.py` (modified, +7/-1)
- `tests/gateway/test_whatsapp_connect.py` (modified, +1/-0)
- `tools/delegate_tool.py` (modified, +5/-0)

Code Example

platforms:
  telegram:
    channel_models:
      "-1009999999999:101": anthropic/claude-opus-4.7    # cron-watchdog topic — deep reasoning
      "-1009999999999:102": ollama/gemma4-heretic         # daily-chatter topic — local
      "-1001234567890":     openai/gpt-5.5                # default for whole chat (no thread match)

  discord:
    channel_models:
      "1234567890": anthropic/claude-opus-4.7
      "9876543210": ollama/gemma4-heretic

  slack:
    channel_models:
      C0123456789: anthropic/claude-opus-4.7

---

if "channel_models" in platform_cfg:
    channel_models = platform_cfg["channel_models"]
    if isinstance(channel_models, dict):
        bridged["channel_models"] = {str(k): str(v).strip() for k, v in channel_models.items() if v}
    # silently ignore non-dict; misconfig surfaces in logs at resolution time

---

def resolve_channel_model(
    config_extra: dict,
    channel_id: str,
    parent_id: str | None = None,
) -> str | None:
    """Resolve a per-channel model override from platform config.

    Looks up ``channel_models`` in the adapter's ``config.extra`` dict.
    Prefers an exact match on *channel_id*; falls back to *parent_id*
    (useful for forum threads / topics inheriting a parent channel's model).

    Returns the model identifier string, or None if no match is found.
    """
    models = config_extra.get("channel_models") or {}
    if not isinstance(models, dict):
        return None
    for key in (channel_id, parent_id):
        if not key:
            continue
        model = models.get(key)
        if model is None:
            continue
        model = str(model).strip()
        if model:
            return model
    return None

---

# Per-channel ephemeral model override (e.g. Telegram channel_models).
# Applied at AIAgent construction time, never persisted to session model state.
channel_model: Optional[str] = None

---

def _resolve_channel_model(self, channel_id: str, parent_id: str | None = None) -> str | None:
    return resolve_channel_model(self.config.extra, channel_id, parent_id)

# in MessageEvent construction:
channel_model=self._resolve_channel_model(channel_id, parent_id or None),

---

model = _resolve_gateway_model(user_config)
if event.channel_model:                      # NEW
    model = event.channel_model              # NEW
override = self._session_model_overrides.get(resolved_session_key) if resolved_session_key else None
if override:
    ...  # existing session override logic still wins (explicit user intent)

RAW_BUFFERClick to expand / collapse

Use case

The gateway today resolves model in this order: per-session runtime override (set via /model or workspace UI, scoped to one session) → model: in config.yaml. For users running Hermes against multiple Telegram/Discord topics or Slack channels with very different needs, the global default forces a compromise and /model doesn't carry over to the next thread.

Concrete example from a current Hermes user (Telegram with topics):

#dev / #code-review / #architecture topics → want Claude Opus 4.7 for deep reasoning, willing to pay token costs.
#daily / #personal / #journal topics → want a local Ollama model (e.g. gemma4-heretic on the user's Mac mini) for low-stakes chatter, zero token cost, lower latency.
Switching globally each time you change topics is impractical and using the workspace model picker doesn't persist across new threads.

Existing precedent

Hermes already supports per-channel ephemeral system prompts via channel_prompts (Discord, Slack — docs). The mechanism is clean:

Config maps channel_id → prompt string
gateway/config.py bridges it from platforms.<plat>.channel_prompts into adapter config.extra
gateway/platforms/base.py::resolve_channel_prompt() looks up exact channel ID first, then falls back to parent (forum / thread → parent channel)
Adapter calls the resolver when constructing MessageEvent, which carries channel_prompt to the agent run

The same pattern extended to channel_models would solve the use case without inventing new abstractions.

Proposed config

platforms:
  telegram:
    channel_models:
      "-1009999999999:101": anthropic/claude-opus-4.7    # cron-watchdog topic — deep reasoning
      "-1009999999999:102": ollama/gemma4-heretic         # daily-chatter topic — local
      "-1001234567890":     openai/gpt-5.5                # default for whole chat (no thread match)

  discord:
    channel_models:
      "1234567890": anthropic/claude-opus-4.7
      "9876543210": ollama/gemma4-heretic

  slack:
    channel_models:
      C0123456789: anthropic/claude-opus-4.7

Lookup mirrors channel_prompts:

Exact chat_id:thread_id match (if message is in a thread/topic)
Fall back to chat_id (parent chat / channel)
Fall back to model: from top-level config

Proposed code (sketch — full PR can follow if maintainers want)

1. `gateway/config.py` — bridge config (mirror of `channel_prompts` block at L710)

if "channel_models" in platform_cfg:
    channel_models = platform_cfg["channel_models"]
    if isinstance(channel_models, dict):
        bridged["channel_models"] = {str(k): str(v).strip() for k, v in channel_models.items() if v}
    # silently ignore non-dict; misconfig surfaces in logs at resolution time

2. `gateway/platforms/base.py` — resolver helper (mirror of `resolve_channel_prompt` at L1036)

def resolve_channel_model(
    config_extra: dict,
    channel_id: str,
    parent_id: str | None = None,
) -> str | None:
    """Resolve a per-channel model override from platform config.

    Looks up ``channel_models`` in the adapter's ``config.extra`` dict.
    Prefers an exact match on *channel_id*; falls back to *parent_id*
    (useful for forum threads / topics inheriting a parent channel's model).

    Returns the model identifier string, or None if no match is found.
    """
    models = config_extra.get("channel_models") or {}
    if not isinstance(models, dict):
        return None
    for key in (channel_id, parent_id):
        if not key:
            continue
        model = models.get(key)
        if model is None:
            continue
        model = str(model).strip()
        if model:
            return model
    return None

3. `MessageEvent` field (parallel to `channel_prompt`, ~L872)

# Per-channel ephemeral model override (e.g. Telegram channel_models).
# Applied at AIAgent construction time, never persisted to session model state.
channel_model: Optional[str] = None

4. Per-platform wiring — Discord example (mirror of `_resolve_channel_prompt` at L2700)

def _resolve_channel_model(self, channel_id: str, parent_id: str | None = None) -> str | None:
    return resolve_channel_model(self.config.extra, channel_id, parent_id)

# in MessageEvent construction:
channel_model=self._resolve_channel_model(channel_id, parent_id or None),

Same wiring in telegram.py (using chat_id + message_thread_id) and slack.py (using channel).

5. `gateway/run.py` — apply override in model resolution

The exact insertion point is wherever _resolve_gateway_model() is consumed before AIAgent(model=...). Sketch (precedence: session runtime override > channel_model > config default):

model = _resolve_gateway_model(user_config)
if event.channel_model:                      # NEW
    model = event.channel_model              # NEW
override = self._session_model_overrides.get(resolved_session_key) if resolved_session_key else None
if override:
    ...  # existing session override logic still wins (explicit user intent)

Threading channel_model through to all AIAgent(...) call sites mirrors how channel_prompt is already threaded.

Precedence (proposed)

Most specific → least specific:

Session-level model override — user explicitly switched model mid-conversation (workspace UI / /model command). Sticky for the session.
channel_models[channel_id:thread_id] — exact topic match.
channel_models[channel_id] — parent fallback.
config.yaml::model — gateway default.
fallback_providers — on auth failure / 429 / etc.

Rationale: the session override is an explicit runtime intent ("for THIS conversation, use X") and should not be silently overridden by config the next turn. channel_models is the new layer for "the default starting point for any new conversation in this channel".

Open design questions

Provider/runtime resolution: a channel_models entry is a model identifier string (anthropic/claude-opus-4.7). The provider/api_key/base_url are still resolved through the existing runtime_provider.resolve_runtime_provider() flow. Should channel_models accept a richer object form ({model: ..., provider: ..., api_key_env: ...}) for cases where the channel needs a non-default provider? My instinct: start with string-only, add object form later if requested.
Reasoning config / toolset overrides: same question — should channel_models also take reasoning_config and enabled_toolsets? Probably out of scope for this issue, but worth mentioning. Could be future channel_overrides: block.
Workspace UI surface: would be nice to expose this in the workspace's Smart Routing UI, but that's outsourc-e/hermes-workspace territory — separate concern.

Alternatives considered

Profiles (one Hermes profile per channel set): heavy, separate state DBs, per-channel session continuity lost.
Skills with model directives: only works if the skill is invoked, not for free-form chat.
Smart Routing (workspace-side content matching): channel-agnostic, harder to predict.
A separate chat_routes config layer: more flexible but reinvents the channel_prompts pattern. Reusing the established pattern keeps cognitive load low.

Backward compatibility

New optional field; absent config = identical behavior to today.
Schema change is additive in gateway/config.py.
No DB migration.

I'd be happy to send a PR if maintainers signal interest in this direction. Wanted to align on shape (string-vs-object, precedence) before opening one.

extent analysis

TL;DR

To address the issue of per-channel model configuration, implement a channel_models feature that allows specifying models per channel, mirroring the existing channel_prompts mechanism.

Guidance

Extend the config.yaml structure: Add a channel_models section under each platform (e.g., Telegram, Discord, Slack) to specify models per channel.
Implement resolve_channel_model: Create a function similar to resolve_channel_prompt to look up the model for a given channel, following the same fallback logic (exact match, parent channel, then default).
Update MessageEvent: Add a channel_model field to store the resolved model for the channel, which will be used in model resolution.
Apply the channel model override: Modify the model resolution logic in gateway/run.py to use the channel_model from the MessageEvent if present.

Example

The proposed config.yaml structure:

platforms:
  telegram:
    channel_models:
      "-1009999999999:101": anthropic/claude-opus-4.7
      "-1009999999999:102": ollama/gemma4-heretic

And the resolve_channel_model function:

def resolve_channel_model(
    config_extra: dict,
    channel_id: str,
    parent_id: str | None = None,
) -> str | None:
    # ...

Notes

This solution builds upon the existing channel_prompts mechanism, keeping the cognitive load low and avoiding the introduction of new abstractions. However, open design questions regarding provider/runtime resolution and reasoning config/toolset overrides should be addressed in future iterations.

Recommendation

Apply the proposed workaround by implementing the channel_models feature, as it provides a flexible and scalable solution for per-channel model configuration without introducing significant changes to the existing

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #installation #tensor shape #autograd error #model save/load

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.