hermes - 💡(How to fix) Fix Silent failure: stream timeout after partial text delivery shows no error to user

Error Message

else: # Text-only partial delivery — warn user that response may be incomplete _warn = ( "\n\n⚠ Stream timed out during response. " "The response may be incomplete. " "Ask me to retry if you want to continue." ) _partial_text = (_partial_text or "") + _warn try: agent._fire_stream_delta(_warn) except Exception: pass logger.warning( "Partial stream delivered before error; ...", len(_partial_text or ""), result["error"], )

Root Cause

In agent/chat_completion_helpers.py, the interruptible_streaming_api_call function has this logic (around line 2074):

if result["error"] is not None:
    if deltas_were_sent["yes"]:
        # Some content already delivered — can't retry (would duplicate)
        if _partial_names:  # tool calls were in-flight
            # ✅ GOOD: Shows warning "⚠ Stream stalled mid tool-call..."
        else:  # just text, no tool calls
            # ❌ BUG: Only logs a warning, returns stub silently
            logger.warning("Partial stream delivered before error...")
            # Returns stub response with partial text, turn ends "normally"
    raise result["error"]  # Only reached if nothing was sent yet

The else branch (text-only partial delivery) produces no user-facing signal that the stream failed. The stub response has finish_reason="stop", so the conversation loop treats it as a successful turn.

Code Example

if result["error"] is not None:
    if deltas_were_sent["yes"]:
        # Some content already delivered — can't retry (would duplicate)
        if _partial_names:  # tool calls were in-flight
            # ✅ GOOD: Shows warning "⚠ Stream stalled mid tool-call..."
        else:  # just text, no tool calls
            # ❌ BUG: Only logs a warning, returns stub silently
            logger.warning("Partial stream delivered before error...")
            # Returns stub response with partial text, turn ends "normally"
    raise result["error"]  # Only reached if nothing was sent yet

---

⚠ Stream timed out after partial delivery. The response may be incomplete.
Ask me to retry if you want to continue.

---

else:
    # Text-only partial delivery — warn user that response may be incomplete
    _warn = (
        "\n\n⚠ Stream timed out during response. "
        "The response may be incomplete. "
        "Ask me to retry if you want to continue."
    )
    _partial_text = (_partial_text or "") + _warn
    try:
        agent._fire_stream_delta(_warn)
    except Exception:
        pass
    logger.warning(
        "Partial stream delivered before error; ...",
        len(_partial_text or ""), result["error"],
    )

Description

When a streaming API call times out after some text has already been delivered to the user but before any tool calls are generated, the system silently returns a truncated stub response with no user-facing error message. The conversation turn ends as if it completed normally, but the model's intended actions (tool calls) were never executed.

Root Cause

In agent/chat_completion_helpers.py, the interruptible_streaming_api_call function has this logic (around line 2074):

if result["error"] is not None:
    if deltas_were_sent["yes"]:
        # Some content already delivered — can't retry (would duplicate)
        if _partial_names:  # tool calls were in-flight
            # ✅ GOOD: Shows warning "⚠ Stream stalled mid tool-call..."
        else:  # just text, no tool calls
            # ❌ BUG: Only logs a warning, returns stub silently
            logger.warning("Partial stream delivered before error...")
            # Returns stub response with partial text, turn ends "normally"
    raise result["error"]  # Only reached if nothing was sent yet

What the User Sees

Model starts outputting text (e.g., "Understood！I'll.....")
User sees this text appear in the terminal
Stream times out (120s read timeout exceeded)
The turn silently ends — no error, no retry, no warning
The timer stops (appears "stuck")
Tool calls the model was about to make are never executed
User waits indefinitely thinking the model is still working

Expected Behavior

When a stream times out after partial text delivery (no tool calls in flight), the user should still see a warning message, similar to the existing tool-call-in-flight case:

⚠ Stream timed out after partial delivery. The response may be incomplete.
Ask me to retry if you want to continue.

Steps to Reproduce

Use a reasoning model (e.g., mimo-v2.5-pro) with a large context (40K+ tokens)
Send a request that requires tool calls (e.g., "help me redesign this webpage")
The model starts reasoning, begins outputting text
The stream read timeout (120s default) is exceeded before tool calls are generated
The turn ends silently with truncated text

Environment

Hermes Agent v0.14.0
Provider: xiaomi (mimo-v2.5-pro) / Z.AI (glm-5.1)
Default stream read timeout: 120s
Large context sessions (40K+ tokens)

Frequency

Observed 25+ times across 3 days (May 18, 24, 25) with both xiaomi and Z.AI providers. More frequent with reasoning models and large contexts.

Relevant Code

agent/chat_completion_helpers.py lines 2074-2138 — the silent stub path
agent/chat_completion_helpers.py lines 1378-1380 — HERMES_STREAM_READ_TIMEOUT default 120s
agent/conversation_loop.py lines 1140-1155 — spinner/callback cleanup after streaming

Suggested Fix

In the else branch (line 2118-2125), add a user-visible warning similar to the tool-call case:

else:
    # Text-only partial delivery — warn user that response may be incomplete
    _warn = (
        "\n\n⚠ Stream timed out during response. "
        "The response may be incomplete. "
        "Ask me to retry if you want to continue."
    )
    _partial_text = (_partial_text or "") + _warn
    try:
        agent._fire_stream_delta(_warn)
    except Exception:
        pass
    logger.warning(
        "Partial stream delivered before error; ...",
        len(_partial_text or ""), result["error"],
    )

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

hermes - 💡(How to fix) Fix Silent failure: stream timeout after partial text delivery shows no error to user

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Description

Root Cause

What the User Sees

Expected Behavior

Steps to Reproduce

Environment

Frequency

Relevant Code

Suggested Fix

Still need to ship something?

TRENDING