Error Message

Gateway log: no error, just retry timing

(a) Surface plugin failures at WARN. Independently of the env-config fix, the silent 30-min retry is its own UX cliff and overlaps with the observability gap of #27474. Logging the upstream HTTP status once at WARN level on the first memory-provider failure of a session would let operators attribute a freeze to the memory plugin without tcpdump or backend-side logs. Happy to scope a separate issue.

Fix Action

Fix / Workaround

Root cause is a two-line gap in plugins/memory/engram/__init__.py: the URL is hardcoded and no auth header is emitted. Patch is ready and fully backwards compatible.

Proposed patch â€” minimal, backwards compatible

Patch ready locally and validated against both default-localhost and remote-authenticated setups. Will open a PR if maintainers confirm the approach (env names, single-header X-API-Key vs also accepting Authorization: Bearer, etc.).

Code Example

# 1. Run Engram Core 0.3.0 with CLOUD_MODE=true (the documented secure default)
# 2. Configure any Hermes profile with memory.provider: engram in config.yaml
# 3. Send any message that triggers memory read/write (most do â€” session search runs every turn)

# Backend log:   POST /v1/search HTTP/1.1 401 Unauthorized
# Gateway log:   no error, just retry timing
# User sees:     â³ Still working... iteration 1/90, waiting for stream response
#                (~30 minutes per message before timeout)

---

ENGRAM_BASE = "http://localhost:8100"

# in _api_request():
req.add_header("X-Namespace", ns)
with urllib.request.urlopen(req, timeout=timeout) as r:
    ...

---

ENGRAM_BASE = os.environ.get("ENGRAM_BASE", "http://localhost:8100")
ENGRAM_API_KEY = os.environ.get("ENGRAM_API_KEY", "")

# in _api_request() and _list_memories():
req.add_header("X-Namespace", ns)
if ENGRAM_API_KEY:
    req.add_header("X-API-Key", ENGRAM_API_KEY)
with urllib.request.urlopen(req, timeout=timeout) as r:
    ...

TL;DR

Enabling Engram Core 0.3.0's recommended CLOUD_MODE posture turns the bundled engram memory plugin into a silent failure: every memory op returns 401 Unauthorized, the gateway retries for ~30 minutes per message, and no log surfaces the cause. To the end-user, Hermes looks frozen.

Root cause is a two-line gap in plugins/memory/engram/__init__.py: the URL is hardcoded and no auth header is emitted. Patch is ready and fully backwards compatible.

Investigating also surfaces a wider pattern worth surfacing for triage: engram is the only bundled memory plugin that doesn't read endpoint + credentials from env vars. The other four (mem0, supermemory, honcho, hindsight) all do â€” so this isn't a new convention to introduce, it's an existing one that engram has drifted away from.

Triage context: the user-visible behavior (gateway hangs ~30 min on the default secure posture of Engram 0.3.0, with no actionable log) is closer to the bug pattern of #27226 / #27474 than to a feature gap â€” flagged here in case the current type/feature + P3 labels (auto-applied at open time) deserve a second look.

Reproduction

# 1. Run Engram Core 0.3.0 with CLOUD_MODE=true (the documented secure default)
# 2. Configure any Hermes profile with memory.provider: engram in config.yaml
# 3. Send any message that triggers memory read/write (most do â€” session search runs every turn)

# Backend log:   POST /v1/search HTTP/1.1 401 Unauthorized
# Gateway log:   no error, just retry timing
# User sees:     â³ Still working... iteration 1/90, waiting for stream response
#                (~30 minutes per message before timeout)

/v1/health returns 200 without auth, so basic probes pass. The failure is invisible unless the operator tails the Engram-side logs directly.

Related open issues â€” this fits an existing cluster

Three sibling issues opened the same day (2026-05-17), all under tool/memory + comp/plugins, point at the same broader pattern of memory-plugin config/observability gaps:

#27314 â€” Honcho memory provider: support runtime peer alias/prefix mapping for multi-user gateways. Same class of concern: a memory plugin can't be configured for multi-tenant / multi-user deployments without source forks.
#27226 â€” Supermemory default/My Space config can create a literal default container. Plugin treats hardcoded sentinels incorrectly, silently splits user memory. Labeled type/bug P2.
#27474 â€” hindsight embedded daemon profile mismatch causes false 'not available' status. Memory plugin silent observability failure â€” status reports wrong without explaining why. Labeled type/bug P3.

The engram silent-401 + 30-min hang is the same family of failure: a bundled memory plugin whose configuration model can't keep up with how the upstream service evolves (or with non-default deployment shapes), surfacing as silent UX cliffs at the gateway. Treating these as a cluster (rather than as four isolated tickets) might be worth a maintainer-side decision.

Who is affected by the engram instance specifically

Three concrete classes of user are blocked or about to be:

1. Anyone upgrading Engram Core to 0.3.0 with auth enabled. This is the documented Engram recommendation and the default in fresh installs. After upgrade, every Hermes profile using memory.provider: engram enters the 30-min retry loop on every message.

2. Anyone running Engram on a non-localhost host. A reasonable production split (gateway and memory backend on different VMs / containers) is impossible today without forking the plugin into $HERMES_HOME/plugins/ â€” the URL is hardcoded to http://localhost:8100.

3. Anyone running multiple Hermes profiles against different memory tenants (dev/staging/prod, multi-org, isolating an experimental profile from production memory). Today this requires duplicating the plugin source per profile â€” the exact pain that #27314 reports for Honcho.

Why the fix is more than a one-line tweak â€” engram is an outlier in this repo

A quick grep of plugins/memory/*/__init__.py:

Plugin	Endpoint via env	Credential via env
`mem0`	`MEM0_BASE_URL`	`MEM0_API_KEY`
`supermemory`	`SUPERMEMORY_BASE_URL`	`SUPERMEMORY_API_KEY`
`honcho`	`HONCHO_BASE_URL`	`HONCHO_API_KEY`
`hindsight`	`HINDSIGHT_BASE_URL`	`HINDSIGHT_API_KEY`
`engram`	âŒ hardcoded `http://localhost:8100`	âŒ no auth header emitted at all

Four of five bundled memory plugins already follow 12-factor â€” env-driven endpoint + credential is the de facto convention. Engram is the only one without that safety net, which is exactly why it broke silently when its upstream service evolved its auth posture. Bringing engram back into the convention closes the immediate bug and removes the seam where this class of failure can re-emerge for the next plugin.

Current plugin code (`plugins/memory/engram/init.py`)

ENGRAM_BASE = "http://localhost:8100"

# in _api_request():
req.add_header("X-Namespace", ns)
with urllib.request.urlopen(req, timeout=timeout) as r:
    ...

Proposed patch â€” minimal, backwards compatible

ENGRAM_BASE = os.environ.get("ENGRAM_BASE", "http://localhost:8100")
ENGRAM_API_KEY = os.environ.get("ENGRAM_API_KEY", "")

# in _api_request() and _list_memories():
req.add_header("X-Namespace", ns)
if ENGRAM_API_KEY:
    req.add_header("X-API-Key", ENGRAM_API_KEY)
with urllib.request.urlopen(req, timeout=timeout) as r:
    ...

Defaults preserve the exact current behavior (localhost:8100, no auth header) â€” zero impact on existing installs. Aligns engram with how mem0/supermemory/honcho/hindsight are already configured.

Suggested follow-up scopes (optional, separate issues)

(b) Make the env-driven contract explicit. If maintainers want, a short docs section ("expected env names for bundled memory plugins") â€” or a tiny base-class helper exposing endpoint_from_env(name, default) / auth_header_from_env(name) â€” would give future plugins an obvious template and prevent another silent drift. Touches the same surface as #27314 (per-plugin runtime knobs).

Versions tested

Engram Core 0.3.0 (CLOUD_MODE=true, X-API-Key auth)
Hermes Agent recent main

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix engram memory plugin causes silent 30-min gateway hang on Engram 0.3.0 CLOUD_MODE — and reveals missing env-driven config baseline across bundled memory plugins

Recommended Tools

GitHub issue graph ai analysis

Error Message

Gateway log: no error, just retry timing

Root Cause

Fix Action

Fix / Workaround

Proposed patch â€” minimal, backwards compatible

Code Example

TL;DR

Reproduction

Related open issues â€” this fits an existing cluster

Who is affected by the engram instance specifically

Why the fix is more than a one-line tweak â€” engram is an outlier in this repo

Current plugin code (`plugins/memory/engram/init.py`)

Proposed patch â€” minimal, backwards compatible

Suggested follow-up scopes (optional, separate issues)

Versions tested

PR

Still need to ship something?

TRENDING

hermes - 💡(How to fix) Fix engram memory plugin causes silent 30-min gateway hang on Engram 0.3.0 CLOUD_MODE — and reveals missing env-driven config baseline across bundled memory plugins

Recommended Tools

GitHub issue graph ai analysis

Error Message

Gateway log: no error, just retry timing

Root Cause

Fix Action

Fix / Workaround

Proposed patch â€” minimal, backwards compatible

Code Example

TL;DR

Reproduction

Related open issues â€” this fits an existing cluster

Who is affected by the engram instance specifically

Why the fix is more than a one-line tweak â€” engram is an outlier in this repo

Current plugin code (plugins/memory/engram/__init__.py)

Proposed patch â€” minimal, backwards compatible

Suggested follow-up scopes (optional, separate issues)

Versions tested

PR

Still need to ship something?

RELATED_DISCOVERY

TRENDING

Current plugin code (`plugins/memory/engram/init.py`)