hermes - 💡(How to fix) Fix feat: pre-turn memory health hook in run_conversation() [1 comments, 1 participants]

hermes2026-05-13 15:18:16

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#25061•Fetched 2026-05-14 03:49:22

View on GitHub

Comments

Participants

Timeline

Reactions

Author

frambu

Participants

frambu

Timeline (top)

labeled ×4commented ×1

Root Cause

run_conversation() in run_agent.py (~L11713) has no pre-processing hook that checks memory health before each turn. The only mechanism is a text instruction in the system prompt, which depends on the LLM remembering to follow it — something it demonstrably fails at.

Fix Action

Fix / Workaround

memory-management-and-cleanup skill (user-created) documents this pattern
Current workaround: cron job that polls every 6h (rejected as insufficient — the right fix is code-level)

Code Example

if self._memory_store and self._memory_store.get_usage_pct() > 70:
    compacted = self._memory_store.compact(target_pct=60)
    if compacted:
        logger.info("Pre-turn memory compaction: freed %d chars", compacted)

RAW_BUFFERClick to expand / collapse

Problem

Standing rules in the system prompt that require proactive LLM behavior (e.g. "if memory exceeds 70%, compact entries") are consistently ignored. The LLM prioritizes the user's latest message over static rules, making these instructions unreliable regardless of how clearly they are written.

Root Cause

Proposed Fix

Add a pre-turn hook before the main LLM call in run_conversation() that:

Checks self._memory_store.get_usage_pct()
If > 70%, automatically runs compaction targeting < 60%
Logs the action for observability

Something like:

if self._memory_store and self._memory_store.get_usage_pct() > 70:
    compacted = self._memory_store.compact(target_pct=60)
    if compacted:
        logger.info("Pre-turn memory compaction: freed %d chars", compacted)

This ensures memory health is enforced mechanically, not by LLM compliance with a text rule.

memory-management-and-cleanup skill (user-created) documents this pattern
Current workaround: cron job that polls every 6h (rejected as insufficient — the right fix is code-level)

Opened via feedback from a user session: standing rules in system prompt that require proactive LLM behavior are unreliable.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#prompt template #agent execution #callback error #memory management #API rate limit

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix feat: pre-turn memory health hook in run_conversation() [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

Code Example

Problem

Root Cause

Proposed Fix

Related

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix feat: pre-turn memory health hook in run_conversation() [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

Code Example

Problem

Root Cause

Proposed Fix

Related

Still need to ship something?

RELATED_DISCOVERY

TRENDING