hermes - 💡(How to fix) Fix Bug: Agent enters retry loop when execute_code output is truncated / empty

hermes2026-05-31 04:44:55

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Error Message

empty due to an internal error, the agent interprets the partial/empty Tool layer: Return a structured error when output is truncated, making

Fix Action

Fix / Workaround

Problem

When execute_code tool output is truncated (50 KB cap hit) or returns empty due to an internal error, the agent interprets the partial/empty result as a transient failure and retries the same code in the next turn — leading to a loop of 2–4 identical execute_code calls before it gives up or the user interrupts.

Observed Behavior

Agent calls execute_code with a script that produces large stdout
Platform layer truncates output (50 KB cap) or tool returns empty result
Agent receives output that looks like "no output" or partial output
Agent re-issues the same execute_code call, reasoning the previous attempt "didn't complete"
Steps 2–4 repeat 2–4 times

Why This Is Hard to Fix in a Single Place

This is agent reasoning behavior, not a deterministic tool bug. The retry decision happens in the LLM turn, not in the tool itself. Possible mitigations at different layers: Tool layer: Return a structured error when output is truncated, making it unambiguous that the call did run but output was cut: {"status": "truncated", "bytes_captured": N, "exit_code": 0} Currently truncation is silent from the agent's perspective. Prompt layer: Instruct the agent that a missing/empty execute_code result means output was dropped, not that execution failed — not a retry signal. Gateway layer: Detect consecutive identical execute_code calls and inject a warning before the next LLM turn.

Reproduction

Reliably reproduced by running execute_code with a script that prints more than ~50 KB of stdout. The agent sees empty output and retries.

Notes

Not filing a PR — the correct fix depends on which layer(s) upstream wants to address. Filing this to document the phenomenon and start the discussion.

RAW_BUFFERClick to expand / collapse

Problem

Observed Behavior

Agent calls execute_code with a script that produces large stdout
Platform layer truncates output (50 KB cap) or tool returns empty result
Agent receives output that looks like "no output" or partial output
Agent re-issues the same execute_code call, reasoning the previous attempt "didn't complete"
Steps 2–4 repeat 2–4 times

Why This Is Hard to Fix in a Single Place

Reproduction

Reliably reproduced by running execute_code with a script that prints more than ~50 KB of stdout. The agent sees empty output and retries.

Notes

Not filing a PR — the correct fix depends on which layer(s) upstream wants to address. Filing this to document the phenomenon and start the discussion.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Bug: Agent enters retry loop when execute_code output is truncated / empty

Recommended Tools

GitHub issue graph ai analysis

Error Message

Fix Action

Fix / Workaround

Problem

Observed Behavior

Why This Is Hard to Fix in a Single Place

Reproduction

Notes

Problem

Observed Behavior

Why This Is Hard to Fix in a Single Place

Reproduction

Notes

Still need to ship something?

TRENDING