hermes - 💡(How to fix) Fix Bug: Bridge-worker silently drops assistant replies from state.db (streamed responses not in messages list)

hermes2026-05-24 02:52:33

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

When chatting through the TypeScript Hermes WebUI (Agent Bridge path), assistant (AI) replies are frequently missing from state.db (Hermes Agent's SQLite session store). The data IS correctly stored in the WebUI's own session-store (TypeScript SQLite), but the Hermes Agent's state.db only has the user message, not the corresponding assistant reply. This causes data inconsistency between the Python WebUI (8787) and TypeScript WebUI (5173).

Error Message

Root Cause

The conversation loop's streaming callback delivers the assistant response to the bridge/frontend without appending a structured assistant message dict to the AIAgent's internal messages list.

At the end of run_conversation() (in agent/conversation_loop.py):

# Line ~3960
agent._drop_trailing_empty_response_scaffolding(messages)
agent._persist_session(messages, conversation_history)  # Writes messages to state.db

The _drop_trailing_empty_response_scaffolding() removes flagged scaffolding messages from the tail. The resulting messages list only contains conversation_history + user_input (no assistant reply). Since the bridge's _align_prepersist_flush_cursor() set _last_flushed_db_idx = len(conversation_history) + 1, _flush_messages_to_session_db() finds flush_from = max(history_len, history_len+1) = len(messages) and writes nothing.

The assistant reply exists as the final_response string returned from run_conversation(), but not as a structured message dict in the messages list that _persist_session() writes.

Fix Action

Workaround

Use the Python WebUI (port 8787) or Hermes CLI for sessions where state.db data integrity is critical.

Code Example

session-store (dev):  491 messages, last at 10:11:43  ✅ assistant reply present
state.db:             819 messages, last at 10:11:34  ❌ assistant reply missing

---

[hermes-bridge-worker:default] [DBG-BUG2] NOTHING TO FLUSH! flush_from=490 msg_len=490 last_asst_role=assistant

---

PRAGMA wal_checkpoint(TRUNCATE)  → (0, 0, 0)  # SQLITE_OK, 0 pages

---

# Line ~3960
agent._drop_trailing_empty_response_scaffolding(messages)
agent._persist_session(messages, conversation_history)  # Writes messages to state.db

---

User sends message
  ↓
TypeScript backend (Socket.IO)
  ↓
handle-bridge-run.ts → addMessage() → session-store ✅ (immediate)
  ↓
hermes_bridge.py → _prepersist_user_message() → state.db ✅ (writes user msg)
  ↓
Worker process → AIAgent.run_conversation()
  ↓  (streams response to frontend via callback - user SEES it ✅)
  ↓  (but assistant NOT added to messages[] structured list ❌)
  ↓
conversation_loop.py:3961 → _persist_session(messages, ...)
  ↓
_flush_messages_to_session_db()
  ↓
flush_from=490, messages[490:] = [] → writes NOTHING to state.db ❌

---

agent._drop_trailing_empty_response_scaffolding(messages)
# Inject streaming response into messages for persistence
if final_response and isinstance(final_response, str) and final_response.strip():
    if not (messages and isinstance(messages[-1], dict) and 
            messages[-1].get("role") == "assistant" and 
            messages[-1].get("content") == final_response):
        messages.append({"role": "assistant", "content": final_response})
agent._persist_session(messages, conversation_history)

RAW_BUFFERClick to expand / collapse

Bug: Bridge-worker assistant replies silently missing from state.db

Summary

Severity

Environment

WebUI: TypeScript ~/hermes-web-ui/ (Vite dev server on port 8648, Socket.IO chat-run)
Agent Bridge: hermes_bridge.py running as a child process managed by agent-bridge/manager.ts
Hermes Agent: NousResearch/hermes-agent (via systemd gateway service + bridge worker)
Session: mph7v2r9tdeata — a single session spanning ~900+ messages

Evidence

1. Database state comparison

After sending a message, the assistant reply exists in session-store but NOT in state.db:

session-store (dev):  491 messages, last at 10:11:43  ✅ assistant reply present
state.db:             819 messages, last at 10:11:34  ❌ assistant reply missing

2. Debug probe confirms nothing to flush

Added diagnostic probe at conversation_loop.py:3961 (after _persist_session()). Output from worker stderr:

[hermes-bridge-worker:default] [DBG-BUG2] NOTHING TO FLUSH! flush_from=490 msg_len=490 last_asst_role=assistant

This confirms:

The messages list has len=490 (including the assistant reply at index 489)
_last_flushed_db_idx=490 (set by bridge's _align_prepersist_flush_cursor)
flush_from = max(history_len, _last_flushed_db_idx) = 490 ≥ len(messages) = 490
_flush_messages_to_session_db() writes messages[490:] which is an empty list

3. No exceptions thrown at any catch point

Four except blocks were instrumented with [DBG-BUG] probes — none fired:

Location	Probe	Result
`run_agent.py:1296` (append_message exception)	`[DBG-BUG]`	❌ Not triggered
`hermes_bridge.py:1148` (prepersist exception)	`[DBG-BUG]`	❌ Not triggered
`hermes_bridge.py:1154` (align cursor exception)	`[DBG-BUG]`	❌ Not triggered
`hermes_bridge.py:1212` (sync tail exception)	`[DBG-BUG]`	❌ Not triggered
`run_agent.py:1247` (_session_db is None)	`[DBG-BUG3]`	❌ Not triggered

4. WAL checkpoint confirms data never written

PRAGMA wal_checkpoint(TRUNCATE)  → (0, 0, 0)  # SQLITE_OK, 0 pages

Zero uncheckpointed pages. The data was genuinely never written to state.db.

5. Streamed responses reach the frontend

The user SEES the assistant reply in the WebUI frontend. The reply is correctly streamed through the bridge's Socket.IO events and saved to the TypeScript session-store via flushResponseRunToDb(). The problem is ONLY in the state.db write path.

Root Cause

The conversation loop's streaming callback delivers the assistant response to the bridge/frontend without appending a structured assistant message dict to the AIAgent's internal messages list.

At the end of run_conversation() (in agent/conversation_loop.py):

# Line ~3960
agent._drop_trailing_empty_response_scaffolding(messages)
agent._persist_session(messages, conversation_history)  # Writes messages to state.db

The assistant reply exists as the final_response string returned from run_conversation(), but not as a structured message dict in the messages list that _persist_session() writes.

Why this doesn't affect the Python WebUI (8787)

The Python WebUI handles sessions differently — it calls AIAgent in-process via streaming.py and uses state_sync.py to independently sync session metadata to state.db. It doesn't go through the bridge → worker architecture that has the _last_flushed_db_idx issue.

Data Flow

User sends message
  ↓
TypeScript backend (Socket.IO)
  ↓
handle-bridge-run.ts → addMessage() → session-store ✅ (immediate)
  ↓
hermes_bridge.py → _prepersist_user_message() → state.db ✅ (writes user msg)
  ↓
Worker process → AIAgent.run_conversation()
  ↓  (streams response to frontend via callback - user SEES it ✅)
  ↓  (but assistant NOT added to messages[] structured list ❌)
  ↓
conversation_loop.py:3961 → _persist_session(messages, ...)
  ↓
_flush_messages_to_session_db()
  ↓
flush_from=490, messages[490:] = [] → writes NOTHING to state.db ❌

Proposed Fix

In agent/conversation_loop.py, before _persist_session() at line ~3961, ensure final_response is injected into messages:

agent._drop_trailing_empty_response_scaffolding(messages)
# Inject streaming response into messages for persistence
if final_response and isinstance(final_response, str) and final_response.strip():
    if not (messages and isinstance(messages[-1], dict) and 
            messages[-1].get("role") == "assistant" and 
            messages[-1].get("content") == final_response):
        messages.append({"role": "assistant", "content": final_response})
agent._persist_session(messages, conversation_history)

This fix was tested and resolved the issue in a single-turn test.

Files Involved

File	Role
`~/.hermes/hermes-agent/agent/conversation_loop.py`	Conversation loop — final `_persist_session()` at ~L3961
`~/.hermes/hermes-agent/run_agent.py`	`_flush_messages_to_session_db()` — uses `_last_flushed_db_idx` to skip writes
`web-ui/.../agent-bridge/hermes_bridge.py`	`_align_prepersist_flush_cursor()` — sets `_last_flushed_db_idx` too aggressively

Additional diagnostic probes added during debugging:

[DBG-BUG] at 4 silent-catch except blocks
[DBG-BUG2] at final _persist_session() exit
[DBG-BUG3] at _session_db is None check
[DBG-BUG4] at final_response injection point

Reproduction

Chat through TypeScript WebUI (port 5173 → backend 8648)
Send any message
Compare state.db vs session-store for the same session
Observe missing assistant replies in state.db
Session-store has complete data

Workaround

Use the Python WebUI (port 8787) or Hermes CLI for sessions where state.db data integrity is critical.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Bug: Bridge-worker silently drops assistant replies from state.db (streamed responses not in messages list)

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Workaround

Code Example

Bug: Bridge-worker assistant replies silently missing from state.db

Summary

Severity

Environment

Evidence

1. Database state comparison

2. Debug probe confirms nothing to flush

3. No exceptions thrown at any catch point

4. WAL checkpoint confirms data never written

5. Streamed responses reach the frontend

Root Cause

Why this doesn't affect the Python WebUI (8787)

Data Flow

Proposed Fix

Files Involved

Reproduction

Workaround

Still need to ship something?

TRENDING