hermes - ✅(Solved) Fix [Bug]: 使用telegram和微信会话，会话着会自动自动中断，往往一个问答还没结束就中断了，需要很久才能恢复会话，但是也会再次中断 [1 pull requests, 1 participants]

hermes2026-05-19 03:47:22

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#28417•Fetched 2026-05-20 04:03:51

View on GitHub

Comments

Participants

Timeline

Reactions

Author

SongweiCao

Participants

SongweiCao

Timeline (top)

labeled ×4cross-referenced ×1

Error Message

Additional Logs / Traceback (optional)

Root Cause

Root Cause Analysis (optional)

Fix Action

Fixed

Fixed by PR: fix(gateway): extend follow-up grace to WeChat/WeCom + bump default to 5s (#28417) (https://github.com/NousResearch/hermes-agent/pull/28723)

PR fix notes

PR #28723: fix(gateway): extend follow-up grace to WeChat/WeCom + bump default to 5s (#28417)

Repository: NousResearch/hermes-agent
Author: xxxigm
State: open | merged: False
Link: https://github.com/NousResearch/hermes-agent/pull/28723

Description (problem / solution / changelog)

What does this PR do?

#28417 reports that on Telegram and WeChat (微信), a single Q&A "自动自动中断" — auto-interrupts before the agent has finished its reply, takes a long time to recover, then breaks again on retry.

Root-cause walk-through (see tests/gateway/test_messaging_followup_grace.py for the contract pinned by this PR):

The runner's default busy_input_mode is "interrupt" (gateway/run.py:1411). Any second user text that arrives while an agent task is running for that session_key fires interrupt_event.set(), aborts the run, and the partial reply is suppressed via the stale-response path in gateway/platforms/base.py:3143-3156.
To stop normal user behavior (typo correction, IME segmentation, voice-to-text fixups) from tripping that interrupt, there's a 3-second "follow-up grace window" in _handle_message that merges the second message into the pending buffer instead.
But that grace was Telegram-only (source.platform == Platform.TELEGRAM). WeChat had no grace at all — every burst follow-up interrupted the run. That's the exact asymmetry that made #28417 hit WeChat just as hard as Telegram.
The 3-second default is also too short for normal mobile bursts: Chinese IMEs and voice-to-text often emit a second short text 2–4s after the first.

Fix (smallest defensible change):

Generalize the grace window to a _FOLLOWUP_GRACE_PLATFORMS frozenset covering Telegram, Weixin, WeCom, WeCom callback — the four platforms in scope for the burst-follow-up pattern that triggered this bug. Slack/Discord/WhatsApp/Feishu intentionally stay on the legacy path; future widenings should be an explicit edit of the set, not a drive-by.
New env var HERMES_GATEWAY_FOLLOWUP_GRACE_SECONDS is the canonical knob. The legacy HERMES_TELEGRAM_FOLLOWUP_GRACE_SECONDS is still honoured as a fallback so existing deployments that tuned it don't see their value reset.
Default bumped from 3.0 → 5.0 seconds. Still well below "the user typed a fresh question" territory, but wide enough to swallow IME / voice-to-text bursts.
Invalid env values fail open to the default instead of crashing request handling; negative values clamp to zero.
Debug log now names the actual platform instead of always saying "Telegram", which had hidden the WeChat case in production logs.

The fix touches one branch in _handle_message plus two new static helpers — the rest of the busy-session machinery is unchanged.

Related Issue

Fixes #28417

Type of Change

🐛 Bug fix (non-breaking change that fixes an issue)
✅ Tests (adding or improving test coverage)
📝 Documentation update

Changes Made

gateway/run.py —
- Replace the inlined Telegram-only branch in _handle_message with a call to two new static helpers: _platform_has_followup_grace() (checks the _FOLLOWUP_GRACE_PLATFORMS frozenset of TELEGRAM / WEIXIN / WECOM / WECOM_CALLBACK) and _messaging_followup_grace_seconds() (resolves env vars, fails open on bad input, clamps negative values to zero).
- Update the debug log to include the actual platform name.
- 75 added / 7 removed lines.
tests/gateway/test_messaging_followup_grace.py — new 267-line test module with 20 tests over four axes:
- Platform coverage — Telegram + Weixin + WeCom + WeCom callback all queue-without-interrupt within the window.
- Negative coverage — Slack stays on the legacy path so this PR doesn't accidentally widen scope.
- Timing semantics — outside the window, the grace branch defers to normal busy-session handling; 0 disables the window entirely.
- Env var resolution — generalised wins over legacy, legacy still honoured, default 5.0, malformed values fall back, negative values clamp.
- Parametrised membership test over 8 platforms so future additions are an explicit fail-the-test diff.
website/docs/reference/environment-variables.md —
- Add a row for HERMES_GATEWAY_FOLLOWUP_GRACE_SECONDS linking to #28417.
- Fix the existing HERMES_TELEGRAM_FOLLOWUP_GRACE_SECONDS description, which incorrectly said the knob delayed outbound messages "to avoid racing the last stream chunk" — it actually controls the inbound follow-up merge window.

How to Test

# New regression suite.
scripts/run_tests.sh tests/gateway/test_messaging_followup_grace.py -q
# Expected: 20 passed in ~1s.

# Adjacent suites that exercise the surrounding busy-session paths.
scripts/run_tests.sh \
  tests/gateway/test_telegram_photo_interrupts.py \
  tests/gateway/test_active_session_text_merge.py \
  tests/gateway/test_busy_session_ack.py \
  tests/gateway/test_command_bypass_active_session.py \
  -q
# Expected: 58 passed (no regressions).

Manual reproduction (Telegram or WeChat):

Start a long-running agent turn (e.g. ask for a multi-paragraph summary).
Within 3–5s of starting, send a short follow-up ("oops, in English please").
Before this PR (WeChat): the run is interrupted, the partial answer is suppressed, the bot starts over from scratch with the merged text — exactly the symptom in #28417.
After this PR: the follow-up is merged into pending and the in-flight run completes uninterrupted; the merged context is consumed on the next turn.

Checklist

Code

My commit messages follow Conventional Commits (fix(gateway):, test(gateway):, docs(env):)
I searched for existing PRs — no duplicate
My PR contains only changes related to this fix
I've run scripts/run_tests.sh tests/gateway/test_messaging_followup_grace.py -q (20 passed)
I've added regression tests covering the asymmetry that caused the bug
I've tested on my platform: macOS 15.x (darwin 24.6.0)

Documentation & Housekeeping

I've updated relevant documentation (website/docs/reference/environment-variables.md — added the new knob, fixed the misleading existing description)
N/A — no config.yaml schema changes (env-only knob, with documented config.yaml route via display.busy_input_mode already present)
N/A — no architecture or workflow change requiring CONTRIBUTING.md / AGENTS.md updates
I've considered cross-platform impact — pure Python in a _handle_message branch, no platform-specific syscalls. The fix runs identically on Linux/macOS/Windows.
N/A — no tool description/schema changes

Changed files

gateway/run.py (modified, +75/-7)
tests/gateway/test_messaging_followup_grace.py (added, +267/-0)
website/docs/reference/environment-variables.md (modified, +2/-1)

Code Example

这个报告还没申请

---

RAW_BUFFERClick to expand / collapse

Bug Description

使用telegram和微信会话，会话着会自动自动中断，往往一个问答还没结束就中断了，需要很久才能恢复会话，但是也会再次中断

Steps to Reproduce

就是普通会话就行

Expected Behavior

最起码能够流畅会话吧

Actual Behavior

这个我暂时不清楚

Affected Component

Gateway (Telegram/Discord/Slack/WhatsApp), Skills (skill loading, skill hub, skill guard), Agent Core (conversation loop, context compression, memory), Configuration (config.yaml, .env, hermes setup), Setup / Installation

Messaging Platform (if gateway-related)

Debug Report

这个报告还没申请

Operating System

macOS 26.5

Python Version

3.11.15

Hermes Version

0.14.0

Additional Logs / Traceback (optional)

Root Cause Analysis (optional)

No response

Proposed Fix (optional)

No response

Are you willing to submit a PR for this?

I'd like to fix this myself and submit a PR

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#installation #parallel task #integration issue #index setup #retrieval issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

hermes - ✅(Solved) Fix [Bug]: 使用telegram和微信会话，会话着会自动自动中断，往往一个问答还没结束就中断了，需要很久才能恢复会话，但是也会再次中断 [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Additional Logs / Traceback (optional)

Root Cause

Root Cause Analysis (optional)

Fix Action

Fixed

PR fix notes

PR #28723: fix(gateway): extend follow-up grace to WeChat/WeCom + bump default to 5s (#28417)

Description (problem / solution / changelog)

What does this PR do?

Related Issue

Type of Change

Changes Made

How to Test

Checklist

Code

Documentation & Housekeeping

Changed files

Code Example

Bug Description

Steps to Reproduce

Expected Behavior

Actual Behavior

Affected Component

Messaging Platform (if gateway-related)

Debug Report

Operating System

Python Version

Hermes Version

Additional Logs / Traceback (optional)

Root Cause Analysis (optional)

Proposed Fix (optional)

Are you willing to submit a PR for this?

Still need to ship something?

RELATED_DISCOVERY

TRENDING