hermes - 💡(How to fix) Fix [Bug]: SessionDB silently skips current turn when message repair shortens conversation history [4 pull requests]

hermes2026-05-12 05:20:01

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

AIAgent._repair_message_sequence(messages) can shorten the in-memory messages list before persistence, but _flush_messages_to_session_db(messages, conversation_history) still uses len(conversation_history) as the skip offset.

If repair removes or merges enough messages from the historical portion, flush_from can become greater than len(messages). Python slicing then returns an empty list, so the current user turn and assistant response are silently not persisted to SessionDB.

Error Message

No exception is raised, and the current turn is skipped. 4. Add a warning/error if flush_from > len(messages).

Root Cause

The relevant flow is:

Gateway loads persisted history from SessionDB.

run_conversation() builds:

messages = list(conversation_history)
messages.append(current_user_message)

Before the API call, Hermes runs:

repaired_seq = self._repair_message_sequence(messages)

That repair can mutate messages in place by:
- dropping stray/orphan tool messages
- merging consecutive user messages

On exit, persistence still computes:

start_idx = len(conversation_history)
flush_from = max(start_idx, self._last_flushed_db_idx)
for msg in messages[flush_from:]:
    self._session_db.append_message(...)

If conversation_history had 120 entries, but repair shortens messages to 116, then:

messages[120:] == []

No exception is raised, and the current turn is skipped.

Fix Action

Fixed

Fixed by PR: fix(agent): prevent silent data loss when message repair shortens history (https://github.com/NousResearch/hermes-agent/pull/24196)
Fixed by PR: fix(agent): persist repaired current turn history (https://github.com/NousResearch/hermes-agent/pull/24211)
Fixed by PR: fix(agent): clamp flush_from to len(messages) after repair shortens the list (https://github.com/NousResearch/hermes-agent/pull/24326)
Fixed by PR: fix(agent): rewrite SessionDB transcript after message-sequence repair (https://github.com/NousResearch/hermes-agent/pull/24419)

Code Example

messages = list(conversation_history)
   messages.append(current_user_message)

---

repaired_seq = self._repair_message_sequence(messages)

---

start_idx = len(conversation_history)
   flush_from = max(start_idx, self._last_flushed_db_idx)
   for msg in messages[flush_from:]:
       self._session_db.append_message(...)

---

messages[120:] == []

---

history_len=120
messages before repair = 122
repair removes/merges 6 historical entries
messages after repair = 116
flush_from = len(conversation_history) = 120
messages[120:] = []
flushed_rows = 0

---

history_len=120 before_repair=122 repairs=6 after_repair=116 flushed_rows=0

---

flush_from > len(messages)

RAW_BUFFERClick to expand / collapse

Summary

Impact

Gateway-style integrations that create a fresh AIAgent per inbound message rely on SessionDB for continuity. Once this happens, the same session keeps loading stale history, causing follow-up messages like "yes", "check again", or "continue" to resolve against old context.

Observed symptom:

user asks about weather
assistant asks whether to check again
user replies "check"
model answers an unrelated old topic because the recent weather turn was never persisted

Root Cause

The relevant flow is:

Gateway loads persisted history from SessionDB.

run_conversation() builds:

messages = list(conversation_history)
messages.append(current_user_message)

Before the API call, Hermes runs:

repaired_seq = self._repair_message_sequence(messages)

That repair can mutate messages in place by:
- dropping stray/orphan tool messages
- merging consecutive user messages

On exit, persistence still computes:

start_idx = len(conversation_history)
flush_from = max(start_idx, self._last_flushed_db_idx)
for msg in messages[flush_from:]:
    self._session_db.append_message(...)

If conversation_history had 120 entries, but repair shortens messages to 116, then:

messages[120:] == []

No exception is raised, and the current turn is skipped.

Minimal Reproduction Shape

A simplified reproduction:

history_len=120
messages before repair = 122
repair removes/merges 6 historical entries
messages after repair = 116
flush_from = len(conversation_history) = 120
messages[120:] = []
flushed_rows = 0

Observed local reproduction output:

history_len=120 before_repair=122 repairs=6 after_repair=116 flushed_rows=0

Expected Behavior

The current user message and assistant response should always be persisted, even if historical messages are repaired before the model call.

At minimum, _flush_messages_to_session_db() should not silently skip persistence when:

flush_from > len(messages)

Suggested Fix

Do not use the original len(conversation_history) as the only persistence boundary after in-place repair.

Possible approaches:

Track the current-turn boundary explicitly after repair.
Adjust the persistence offset when _repair_message_sequence() mutates messages.
Persist the current turn separately from historical replay.
Add a warning/error if flush_from > len(messages).

Suggested Regression Test

Add a test where:

conversation_history contains malformed historical entries.
messages = conversation_history + [current_user, assistant_reply].
_repair_message_sequence(messages) shortens messages.
_flush_messages_to_session_db(messages, conversation_history) is called.
Assert the current user and assistant reply are still written to SessionDB.

This should prevent silent context loss in gateway integrations.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #memory leak #API versioning #request timeout

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix [Bug]: SessionDB silently skips current turn when message repair shortens conversation history [4 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Code Example

Summary

Impact

Root Cause

Minimal Reproduction Shape

Expected Behavior

Suggested Fix

Suggested Regression Test

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix [Bug]: SessionDB silently skips current turn when message repair shortens conversation history [4 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Code Example

Summary

Impact

Root Cause

Minimal Reproduction Shape

Expected Behavior

Suggested Fix

Suggested Regression Test

Still need to ship something?

RELATED_DISCOVERY

TRENDING