hermes - ✅(Solved) Fix Improve long-session continuity across compactions [1 pull requests, 1 participants]

smilesvua · 2026-05-10T16:40:59Z

[hermes] PR 23308: feat: improve long-session continuity across compactions - Repository: NousResearch/hermes-agent - Author: smilesvua - State: open | merged:… # PR #23308: feat: improve long-session continuity across compactions - Repository: NousResearch/hermes-agent - Author: smilesvua - State: open | merged: False - Link: https://github.com/NousResearch/hermes-agent/pull/23308 ## Description (problem / solution / changelog) ## What does this PR do? Improves long-session continuity across repeated Hermes context compactions by adding recoverable session lineage, safer compression persistence/validation, durable run-ledger artifacts, memory pre-compress checkpointing, and reusable workflow templates for long autonomous work. ## Related Issue / Plan / Control Doc Fixes #23307 - PRD / implementation plan: `/mnt/data/hermes/plans/2026-05-10_032152-memory-across-compactions.md` - Long-task control doc / resume capsule: `/mnt/data/hermes/plans/2026-05-10_033003-memory-across-compactions-execution.md` - Run ledger / state capsule handles: new `hermes runs` CLI and `agent/run_ledger.py` / `agent/run_ledger_reader.py`; branch validation did not mutate real run ledgers - Evidence or artifact manifest: `/mnt/data/hermes/plans/2026-05-10_033003-final-review-context.md` ## Type of Change - [x] 🐛 Bug fix (non-breaking change that fixes an issue) - [x] ✨ New feature (non-breaking change that adds functionality) - [ ] 🔒 Security fix - [x] 📝 Documentation update - [x] ✅ Tests (adding or improving test coverage) - [ ] ♻️ Refactor (no behavior change) - [ ] 🎯 New skill (bundled or hub) ## Changes Made - `hermes_state.py`: resume resolution now follows validated compression continuation tips and avoids arbitrary child-session jumps. - `scripts/backfill_sessions_from_json.py`: adds an explicit manual JSON transcript -> SQLite repair script with dry-run, idempotent skip behavior, force repair, include-active opt-in, malformed-file reporting, and tests. - `run_agent.py`: persists parent transcript before compression rotation and persists the child compressed summary immediately after continuation creation. - `agent/context_compressor.py`: validates compression summaries and fails closed on invalid/truncated/template-leaking summaries; accepts bounded/redacted memory checkpoint source material. - `agent/run_ledger.py`: adds durable append-only run events, artifact references, and resume capsules for long runs. - `agent/run_ledger_reader.py` + `hermes_cli/run_ledger_cli.py`: adds read-only `hermes runs list/events/capsule/recover` retrieval surfaces. - `plugins/memory/hindsight/__init__.py`: adds bounded/redacted Hindsight pre-compress checkpoint extraction and non-blocking retain enqueueing. - `.github/*` and `docs/templates/*`: add long-task PR/issue/control-doc/evidence manifest templates. - Tests: focused coverage for session resume, backfill repair, compression split persistence, summary validation, run ledger writing/reading, CLI behavior, Hindsight pre-compress checkpoints, and template validation. ## How to Test 1. Run the focused impacted suite: `scripts/run_tests.sh tests/hermes_state/test_resolve_resume_session_id.py tests/test_hermes_state.py tests/scripts/test_backfill_sessions_from_json.py tests/run_agent/test_compression_split_persistence.py tests/run_agent/test_compression_boundary_hook.py tests/run_agent/test_run_ledger_compression_capsule.py tests/run_agent/test_run_ledger_session_reset.py tests/run_agent/test_run_ledger_tool_events.py tests/run_agent/test_compress_focus_plugin_fallback.py tests/gateway/test_compress_command.py tests/agent/test_context_compressor.py tests/agent/test_context_compressor_summary_continuity.py tests/agent/test_run_ledger.py tests/agent/test_run_ledger_readonly.py tests/hermes_cli/test_run_ledger_cli.py tests/cli/test_cli_new_session.py tests/cli/test_branch_command.py tests/plugins/memory/test_hindsight_provider.py -q` 2. Run Ruff on touched code/tests. 3. Validate workflow templates: parse GitHub issue YAML, parse JSON schema/template, run Draft 2020-12 schema validation against the template. 4. Run `git diff --check origin/main...HEAD`. ## TDD / Review Evidence - Planned tests vs. acceptance criteria: documented slice-by-slice in `/mnt/data/hermes/plans/2026-05-10_033003-memory-across-compactions-execution.md` before each implementation slice. - RED evidence: each code slice recorded expected failures before implementation (resume tip, backfill script missing/edge cases, compression split persistence, invalid summary fallback, run ledger, retrieval CLI, Hindsight pre-compress bridge). - GREEN evidence: final focused impacted suite passed: 498 passed in 23.55s. - Broader validation / regression checks: Ruff passed; workflow template YAML/JSON/schema validation passed; `git diff --check origin/main...HEAD` passed; worktree clean. - Independent review / second opinion: OpenRouter second review was run on plans/diffs/focused contexts per slice and on a final consolidated context

hermes2026-05-10 16:40:59

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

NousResearch/hermes-agent#23307•Fetched 2026-05-11 03:30:03

View on GitHub

Comments

Participants

Timeline

Reactions

Author

smilesvua

Participants

smilesvua

Timeline (top)

labeled ×3cross-referenced ×1

RAW_BUFFERClick to expand / collapse

Problem or use case

Long Hermes sessions can cross multiple context compactions. Before this work, resilience relied too heavily on lossy summaries and fragile session persistence/search behavior, making it harder for humans or fresh agents to resume long autonomous tasks safely.

Proposed solution

Improve long-session continuity across several layers:

Resolve compressed sessions to their latest continuation tip.
Provide a manual JSON transcript to SQLite/session-search backfill path.
Persist both sides of compression splits immediately.
Reject invalid or truncated compression summaries non-destructively.
Add durable run-ledger events, artifact manifests, and resume capsules.
Add read-only hermes runs retrieval CLI.
Bridge memory pre-compress checkpoints into compressor source material and Hindsight retention.
Add PR/issue/control-doc/evidence templates for long autonomous work.

Acceptance criteria

Resuming a compressed session ID lands on the latest compression continuation tip, not an arbitrary child.
Historical JSON transcripts with missing SQLite messages can be repaired manually with dry-run/idempotent behavior.
Compression split persists parent transcript and child summary immediately enough for resume/search recovery.
Invalid, truncated, or template-leaking summaries fail closed without destructive transcript replacement.
Long runs emit durable run-ledger events, artifacts, and resume capsules with read-only retrieval commands.
Memory providers can contribute bounded/redacted pre-compress checkpoint context; failures remain non-fatal.
Long autonomous work has reusable GitHub/control-doc/evidence templates.

TDD / validation plan

Implementation followed slice-by-slice TDD. Each code slice defined goal-to-test mappings, observed RED failures, implemented the minimal GREEN behavior, and ran focused plus adjacent regression tests.

Final local validation:

Focused impacted test set: 498 passed.
Ruff on touched code/tests: passed.
Workflow template YAML/JSON/schema validation: passed.
git diff --check origin/main...HEAD: passed.
OpenRouter second review: pass on slice reviews plus final consolidated context.

Durable context / evidence

Execution log/control doc: /mnt/data/hermes/plans/2026-05-10_033003-memory-across-compactions-execution.md Primary plan: /mnt/data/hermes/plans/2026-05-10_032152-memory-across-compactions.md Final review context: /mnt/data/hermes/plans/2026-05-10_033003-final-review-context.md

Worker contract / stop conditions

Scope: session continuity, compression persistence/validation, durable run-ledger recovery surfaces, memory pre-compress bridge, and workflow templates.

Stop/ask if any change requires live credential mutation, destructive production DB repair, or enabling new retention behavior outside tested local paths.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#dependency conflict #environment setup #docker error #permission error #memory optimization

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - ✅(Solved) Fix Improve long-session continuity across compactions [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #23308: feat: improve long-session continuity across compactions

Description (problem / solution / changelog)

What does this PR do?

Related Issue / Plan / Control Doc

Type of Change

Changes Made

How to Test

TDD / Review Evidence

Operational / Safety Impact

Reviewer / second-review notes triaged

Checklist

Code

Documentation & Housekeeping

Screenshots / Logs

Changed files

Problem or use case

Proposed solution

Acceptance criteria

TDD / validation plan

Durable context / evidence

Worker contract / stop conditions

Still need to ship something?

TRENDING

hermes - ✅(Solved) Fix Improve long-session continuity across compactions [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #23308: feat: improve long-session continuity across compactions

Description (problem / solution / changelog)

What does this PR do?

Related Issue / Plan / Control Doc

Type of Change

Changes Made

How to Test

TDD / Review Evidence

Operational / Safety Impact

Reviewer / second-review notes triaged

Checklist

Code

Documentation & Housekeeping

Screenshots / Logs

Changed files

Problem or use case

Proposed solution

Acceptance criteria

TDD / validation plan

Durable context / evidence

Worker contract / stop conditions

Still need to ship something?

RELATED_DISCOVERY

TRENDING