hermes - 💡(How to fix) Fix [Memory System] No importance-scoring mechanism — what should be remembered vs. forgotten? [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
NousResearch/hermes-agent#12883Fetched 2026-04-20 12:16:24
View on GitHub
Comments
0
Participants
1
Timeline
0
Reactions
0

Error Message

  • One-off observations (e.g., a specific error message)

Root Cause

Root Cause

RAW_BUFFERClick to expand / collapse

markdown ## Core Problem

 The Hermes memory system has a fundamental meta-cognitive flaw: **there is no mechanism to determine what information is worth preserving
 long-term vs. temporary context that should expire.**

 ---

 ## 1. No Importance/Relevance Scoring

 - **"Save everything" is the only strategy**: When in doubt, agents err on the side of saving. Over time, memory becomes bloated with low-value
 trivia while genuinely critical insights get buried.
 - **No "signal vs. noise" discrimination**: The system cannot distinguish between:
   - One-off observations (e.g., a specific error message)
   - Durable patterns (e.g., a user's working style, recurring failure modes)
   - Ephemeral context (current task state) vs. permanent knowledge (user preferences)
 - **No decay/importance recalculation**: Once saved, nothing ever gets promoted (important) or demoted (forgotten). Memory is write-only.

 **The paradox**: Users must manually tell the AI "remember this" — but users shouldn't have to manage memory. The AI should know what matters.

 ---

 ## 2. Fragmented Memory Surfaces (Architecture Problem)

 Four independent systems, each storing different types of information with no unified retrieval:

 | System | Content | Problem |
 |--------|---------|---------|
 | `memory` (memory.md) | Persistent notes about environment, conventions, tool quirks | Becomes bloated, 95%+ capacity |
 | `user` (user profile) | User preferences, communication style, personal context | Mixed with system-level facts |
 | `session/` | Full conversation transcripts | Never auto-summarized, grows forever |
 | `skills/` | Procedural knowledge (skills) | Separate from memory, no cross-reference |
 | gbrain/mempalace | External knowledge brains | Syncs separately, embedding failures cause silent degradation |

 **Result**: The AI has no single queryable memory — it must check multiple places with different interfaces, and the data is inconsistent across
 them.

 ---

 ## 3. Session Breaks = Context Loss

 - On reconnect, key context from before the break is lost unless it was explicitly saved to memory
 - What's actually preserved after a disconnect is unclear and unpredictable to the user
 - "I told you this last session" has no guarantee of working
 - `session_search` exists but is unreliable — searches miss relevant past sessions or return irrelevant ones

 ---

 ## 4. gbrain Sync Degrades Silently

 - Embedding failures (503 errors) cause gbrain to fall back to keyword-only search without alerting the user
 - The user perceives "search is working" when it's actually operating in a severely degraded mode
 - No health check, no warning, no self-healing

 ---

 ## 5. Skills and Memory Are Separate Silos

 - Skills store procedural knowledge (HOW to do things)
 - Memory stores facts/context (WHAT the user wants, WHY they want it)
 - These should inform each other but don't — no cross-pollination

 ---

 ## Root Cause

 This is a **meta-cognition problem**: the system has no internal model of "what is valuable to remember." It needs:

 1. **Importance scoring** — real-time assessment of whether a piece of information is worth long-term storage
 2. **Memory lifecycle management** — promote, demote, expire, consolidate
 3. **Consolidated retrieval** — one queryable interface, not 4 fragmented systems
 4. **Self-awareness of degradation** — know when gbrain/etc. is failing and tell the user

 ---

 ## Suggested Directions (Not Prescriptive)

 - Add a "memory pressure" metric that triggers consolidation when capacity is high
 - Tag each memory entry with a confidence/relevance score that decays over time
 - Build a unified memory API so the agent queries one place, not four
 - Add a `memory health` command showing which systems are degraded and how much data is "stale"
 - Auto-summarize old sessions into compressed memory entries

 ---

 *This issue captures systemic memory architecture problems identified through user feedback.*

extent analysis

TL;DR

Implementing an importance scoring system and unified memory API can help address the meta-cognitive flaw in the Hermes memory system.

Guidance

  • Introduce a real-time importance scoring mechanism to assess the value of information for long-term storage, potentially using a confidence/relevance score that decays over time.
  • Develop a unified memory API to consolidate retrieval across the four fragmented systems, enabling the agent to query one interface instead of multiple.
  • Consider adding a "memory pressure" metric to trigger consolidation when capacity is high, and implement auto-summarization of old sessions to reduce memory bloat.
  • Create a memory health command to provide visibility into system degradation and data staleness.

Example

A potential implementation could involve tagging each memory entry with a relevance score, such as a timestamped confidence level, to facilitate decay and prioritization:

memory_entry = {
    'content': 'user preference',
    'relevance_score': 0.8,  # initial confidence level
    'timestamp': datetime.now()
}

This example is simplified and may require adaptation to the actual system architecture.

Notes

The suggested directions provided in the issue are not prescriptive, and the actual implementation will depend on the specific requirements and constraints of the Hermes memory system. Additionally, addressing the meta-cognitive flaw will likely require ongoing evaluation and refinement to ensure the system effectively prioritizes valuable information.

Recommendation

Apply a workaround by introducing a basic importance scoring system and unified memory API to begin addressing the meta-cognitive flaw, as this will provide a foundation for further refinement and improvement.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING