hermes - 💡(How to fix) Fix Feature: Allow memory provider to serve as session search backend (bypass local FTS5)

StepCodex · 2026-05-21T15:35:50Z

[hermes] Problem Hermes Agent has two independent retrieval paths for past conversations, and they never talk to each other: | Path | Tool | Backend | Search T… ## Problem Hermes Agent has **two independent retrieval paths** for past conversations, and they never talk to each other: | Path | Tool | Backend | Search Type | |------|------|---------|-------------| | Built-in | session_search | Local SQLite FTS5 (hermes_state.py) | Keyword matching | | Memory Provider | viking_search (OpenViking) / provider-specific | External vector DB | Semantic / RAG | When a user configures an external memory provider like OpenViking, the provider already receives **all session data** via sync_turn() and on_session_end()/commit. The OpenViking plugin stores every conversation turn and runs VLM-based memory extraction on commit. This means the memory provider has **richer, semantically searchable** archives of the same conversations. However, session_search always hits the local SQLite FTS5 — it has no awareness of the active memory provider. The agent system prompt (SESSION_SEARCH_GUIDANCE in prompt_builder.py) directs it to use session_search by default for cross-session recall, which **completely bypasses the memory provider**. ## Why This Matters 1. **Token efficiency**: Semantic vector search retrieves relevant context in 1 query, while FTS5 keyword matching may need multiple attempts with different query phrasing 2. **Result quality**: FTS5 returns raw message snippets; memory providers like OpenViking return VLM-extracted, structured memories 3. **Data duplication**: Session data is stored in two places (local DB + provider) but only the local DB is searchable via session_search 4. **Wasted potential**: Memory providers like OpenViking already have prefetch() and queue_prefetch() methods in their plugin interface — suggesting someone already envisioned this integration, but it was never wired up ## Proposed Solutions ### Option A — Memory provider registers a session search backend Extend the BaseMemoryProvider interface with an optional `session_search(query, limit, ...)` method. When the active memory provider implements it, the session_search tool delegates to the provider instead of hitting local FTS5. Backwards-compatible: providers without it fall back to current FTS5. ### Option B — Config-level switch Add a config key like `memory.search_backend: auto | local | provider`: - `auto` (default): use provider if it supports search, fallback to local FTS5 - `local`: always use FTS5 (current behavior) - `provider`: force delegate to memory provider ### Option C — Unified search tool Replace session_search + provider-specific search tools (e.g. viking_search) with a single unified search tool that knows which backend to query. Agent uses one tool for all cross-session retrieval regardless of backend. ## Implementation Notes - OpenViking plugin already exposes viking_search and has prefetch()/queue_prefetch() lifecycle hooks — well positioned for Option A - session_search_tool.py operates purely on hermes_state.SessionDB (SQLite) — needs to accept a provider reference at construction or via a registry - memory_manager.py already orchestrates provider lifecycle — could also manage search delegation - prompt_builder.py SESSION_SEARCH_GUIDANCE would need updating ## Related - PR #10463 / Issue #8705: OpenViking session lifecycle fixes — baseline for making OpenViking a full search backend - OpenViking plugin already has prefetch(query, session_id) — the hook point for session-aware retrieval

hermes2026-05-21 15:35:50

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Root Cause

Token efficiency: Semantic vector search retrieves relevant context in 1 query, while FTS5 keyword matching may need multiple attempts with different query phrasing
Result quality: FTS5 returns raw message snippets; memory providers like OpenViking return VLM-extracted, structured memories
Data duplication: Session data is stored in two places (local DB + provider) but only the local DB is searchable via session_search
Wasted potential: Memory providers like OpenViking already have prefetch() and queue_prefetch() methods in their plugin interface — suggesting someone already envisioned this integration, but it was never wired up

RAW_BUFFERClick to expand / collapse

Problem

Hermes Agent has two independent retrieval paths for past conversations, and they never talk to each other:

Path	Tool	Backend	Search Type
Built-in	session_search	Local SQLite FTS5 (hermes_state.py)	Keyword matching
Memory Provider	viking_search (OpenViking) / provider-specific	External vector DB	Semantic / RAG

When a user configures an external memory provider like OpenViking, the provider already receives all session data via sync_turn() and on_session_end()/commit. The OpenViking plugin stores every conversation turn and runs VLM-based memory extraction on commit. This means the memory provider has richer, semantically searchable archives of the same conversations.

However, session_search always hits the local SQLite FTS5 — it has no awareness of the active memory provider. The agent system prompt (SESSION_SEARCH_GUIDANCE in prompt_builder.py) directs it to use session_search by default for cross-session recall, which completely bypasses the memory provider.

Why This Matters

Token efficiency: Semantic vector search retrieves relevant context in 1 query, while FTS5 keyword matching may need multiple attempts with different query phrasing
Result quality: FTS5 returns raw message snippets; memory providers like OpenViking return VLM-extracted, structured memories
Data duplication: Session data is stored in two places (local DB + provider) but only the local DB is searchable via session_search
Wasted potential: Memory providers like OpenViking already have prefetch() and queue_prefetch() methods in their plugin interface — suggesting someone already envisioned this integration, but it was never wired up

Proposed Solutions

Option A — Memory provider registers a session search backend

Extend the BaseMemoryProvider interface with an optional session_search(query, limit, ...) method. When the active memory provider implements it, the session_search tool delegates to the provider instead of hitting local FTS5. Backwards-compatible: providers without it fall back to current FTS5.

Option B — Config-level switch

Add a config key like memory.search_backend: auto | local | provider:

auto (default): use provider if it supports search, fallback to local FTS5
local: always use FTS5 (current behavior)
provider: force delegate to memory provider

Option C — Unified search tool

Replace session_search + provider-specific search tools (e.g. viking_search) with a single unified search tool that knows which backend to query. Agent uses one tool for all cross-session retrieval regardless of backend.

Implementation Notes

OpenViking plugin already exposes viking_search and has prefetch()/queue_prefetch() lifecycle hooks — well positioned for Option A
session_search_tool.py operates purely on hermes_state.SessionDB (SQLite) — needs to accept a provider reference at construction or via a registry
memory_manager.py already orchestrates provider lifecycle — could also manage search delegation
prompt_builder.py SESSION_SEARCH_GUIDANCE would need updating

PR #10463 / Issue #8705: OpenViking session lifecycle fixes — baseline for making OpenViking a full search backend
OpenViking plugin already has prefetch(query, session_id) — the hook point for session-aware retrieval

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering