hermes - 💡(How to fix) Fix [Feature]: Tiered Memory Storage Architecture — Replace flat memory with 4-layer storage

StepCodex · 2026-05-26T17:18:15Z

[hermes] Problem or Use Case The current memory tool has a ~2200 character limit. Agents naturally dump everything into it — project configs, position data, wo… ### Problem or Use Case The current `memory` tool has a ~2200 character limit. Agents naturally dump everything into it — project configs, position data, workflow details, user preferences, version histories — all mixed together. This leads to: 1. **Memory fills up fast** (hitting 99% regularly) 2. **Hard to update** (needs exact string matching for replace) 3. **Hard to find** (behavior rules mixed with technical configs) 4. **Wastes tokens** (full memory injected every turn even when most is irrelevant) When memory is full, agents start forgetting things they were explicitly told. For users running complex workflows (trading systems, multi-project management, CI/CD pipelines), this is a critical limitation. ### Proposed Solution Instead of treating `memory` as the only persistent storage, adopt a **tiered architecture** with 4 layers: ### Layer 0: Root Index File (`~/.hermes/index.json`) A single JSON file that maps all active projects to their storage locations. Agents only need **one** memory entry: `"project index → ~/.hermes/index.json"`. ### Layer 1: Project Files (Data Layer) - Structured data (positions, configs, output data) - Scripts and automation - No character limit, direct read/write via terminal/execute_code ### Layer 2: Skills (Logic Layer) - Reusable workflows and procedures - Configuration parameters and version histories - Managed via `skill_manage` ### Layer 3: Memory (Index + Rules Layer) - Only two types of content: a) **Global behavior rules** — user preferences, communication style, dos/donts b) **One pointer** to the root index file - NOT for: detailed data, workflow steps, version histories ### Layer 4: session_search (Fallback) - Full-text search across past conversation sessions - Last resort when above layers don't have the answer **Implementation suggestions:** - Add a **memory quota warning** when usage exceeds 80% - Document this tiered approach as a best practice in Hermes docs - Optionally ship a default `~/.hermes/index.json` template - Update the memory tool's system prompt to explicitly recommend project files over memory for large data ### Alternatives Considered 1. **Expand memory limit** — Simpler but doesn't solve the "everything mixed together" problem; still wastes tokens per turn. 2. **Use a single universal approach** — What agents do today (cram everything into memory). It has already been proven to fail at scale. 3. **Database backend (SQLite)** — More powerful but more complex to ship and maintain. The tiered approach uses existing infrastructure (file system + skills + memory) without new dependencies. ### Feature Type Performance / reliability ### Scope Small (single file, < 50 lines) ### Contribution - [ ] I'd like to implement this myself and submit a PR ### Debug Report (optional) ```shell ```

hermes2026-05-26 17:18:15

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

RAW_BUFFERClick to expand / collapse

Problem or Use Case

The current memory tool has a ~2200 character limit. Agents naturally dump everything into it — project configs, position data, workflow details, user preferences, version histories — all mixed together. This leads to:

Memory fills up fast (hitting 99% regularly)
Hard to update (needs exact string matching for replace)
Hard to find (behavior rules mixed with technical configs)
Wastes tokens (full memory injected every turn even when most is irrelevant)

When memory is full, agents start forgetting things they were explicitly told. For users running complex workflows (trading systems, multi-project management, CI/CD pipelines), this is a critical limitation.

Proposed Solution

Instead of treating memory as the only persistent storage, adopt a tiered architecture with 4 layers:

Layer 0: Root Index File (`~/.hermes/index.json`)

A single JSON file that maps all active projects to their storage locations. Agents only need one memory entry: "project index → ~/.hermes/index.json".

Layer 1: Project Files (Data Layer)

Structured data (positions, configs, output data)
Scripts and automation
No character limit, direct read/write via terminal/execute_code

Layer 2: Skills (Logic Layer)

Reusable workflows and procedures
Configuration parameters and version histories
Managed via skill_manage

Layer 3: Memory (Index + Rules Layer)

Only two types of content: a) Global behavior rules — user preferences, communication style, dos/donts b) One pointer to the root index file
NOT for: detailed data, workflow steps, version histories

Layer 4: session_search (Fallback)

Full-text search across past conversation sessions
Last resort when above layers don't have the answer

Implementation suggestions:

Add a memory quota warning when usage exceeds 80%
Document this tiered approach as a best practice in Hermes docs
Optionally ship a default ~/.hermes/index.json template
Update the memory tool's system prompt to explicitly recommend project files over memory for large data

Alternatives Considered

Expand memory limit — Simpler but doesn't solve the "everything mixed together" problem; still wastes tokens per turn.
Use a single universal approach — What agents do today (cram everything into memory). It has already been proven to fail at scale.
Database backend (SQLite) — More powerful but more complex to ship and maintain. The tiered approach uses existing infrastructure (file system + skills + memory) without new dependencies.

Feature Type

Performance / reliability

Scope

Small (single file, < 50 lines)

Contribution

I'd like to implement this myself and submit a PR

Debug Report (optional)

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix [Feature]: Tiered Memory Storage Architecture — Replace flat memory with 4-layer storage

Recommended Tools

GitHub issue graph ai analysis

Problem or Use Case

Proposed Solution

Layer 0: Root Index File (`~/.hermes/index.json`)

Layer 1: Project Files (Data Layer)

Layer 2: Skills (Logic Layer)

Layer 3: Memory (Index + Rules Layer)

Layer 4: session_search (Fallback)

Alternatives Considered

Feature Type

Scope

Contribution

Debug Report (optional)

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix [Feature]: Tiered Memory Storage Architecture — Replace flat memory with 4-layer storage

Recommended Tools

GitHub issue graph ai analysis

Problem or Use Case

Proposed Solution

Layer 0: Root Index File (~/.hermes/index.json)

Layer 1: Project Files (Data Layer)

Layer 2: Skills (Logic Layer)

Layer 3: Memory (Index + Rules Layer)

Layer 4: session_search (Fallback)

Alternatives Considered

Feature Type

Scope

Contribution

Debug Report (optional)

Still need to ship something?

TRENDING

Layer 0: Root Index File (`~/.hermes/index.json`)