openclaw - ✅(Solved) Fix [Bug]: Agent loop does not terminate after final response when Queued messages exist in context — causes full task replay [1 pull requests, 5 comments, 3 participants]

yine · 2026-03-20T08:58:10Z

[openclaw] Environment - OpenClaw version : 2026.3.12 6472949 confirmed on session dated 2026-03-12 - Channel : POPO IM custom integration via aigateway-xxxxx-… ## Environment - **OpenClaw version**: 2026.3.12 (6472949) (confirmed on session dated 2026-03-12) - **Channel**: POPO IM (custom integration via `aigateway-xxxxx-com`) - **Model**: kimi-k2.5 - **OS**: Linux server （Debian 12） ## Description When an external message arrives while the agent is busy executing a task, OpenClaw queues it as `[Queued messages while agent was busy]` and inserts it into the session's `parentId` chain. After the current task completes and the assistant outputs a final summary (no `toolCall`), **the agent loop does not terminate**. Instead, OpenClaw continues invoking the model, which sees the unhandled Queued user message in context and proceeds to fully replay the previous task — generating 10+ consecutive assistant messages within ~1.5 seconds, all without any real tool execution. Restarting OpenClaw does **not** fix the issue. Deleting the session JSONL files and restarting does. ## Reproduction Steps 1. Configure an agent with an external IM integration (POPO, Telegram, etc.) 2. Send a message that triggers a long multi-step task (multiple `toolCall`/`toolResult` cycles) 3. **While the agent is executing**, send a new message from the external channel 4. Observe `[Queued messages while agent was busy]` inserted into the session 5. Wait for the current task to complete (assistant outputs final text-only summary) 6. Observe: the agent does **not** stop — it immediately continues generating new assistant messages ## Expected Behavior After the assistant outputs a final response with no `toolCall`, the agent loop should terminate. Queued messages should be handled as a new turn, not as continuation of the current loop. ## Actual Behavior The agent loop continues. The model is called again with the full context (which includes the Queued user message) and begins replaying the previously completed task — outputting 10+ consecutive assistant messages at ~150ms intervals, none of which execute any tools. ## Session Log Evidence Analyzed from JSONL session logs (431 lines, session `e999671b-...`): | Metric | Value | |--------|-------| | Total `toolCall` records | 145 | | Total `toolResult` records in replay sequences | **0** | | `[Queued messages while agent was busy]` user messages | **20** | | Consecutive assistant sequences (length > 1) | 20 groups | | Longest consecutive assistant sequence | **11 messages in 1.36 seconds** | The replayed content is byte-for-byte identical to earlier messages in the session: ``` Line 149 (original): "好的，我来更新配置：1. 每天提醒加入本日天气..." Line 182 (replay): "好的，我来更新配置：1. 每天提醒加入本日天气..." ← identical ``` Timing of the 11-message replay sequence: ``` 03:10:34.296Z → normal final summary (line 181) 03:10:34.502Z → replay begins (206ms later) 03:10:34.653Z → (151ms) 03:10:34.772Z → (119ms) ... 03:10:35.655Z → ends (11 messages, 1.36 seconds total) ``` Replay sequence length scales with context size — when context had ~50 messages, sequences were 2–3 messages long; by the time context reached 180 messages, sequences grew to 11. ## Root Cause (Hypothesis) The agent loop termination condition does not distinguish between: 1. A Queued message already present in historical context (inserted mid-task) 2. A new user message arriving after the current task completed After the assistant produces a text-only final response, the loop should stop. Instead it appears to scan the full session chain for any unanswered user message — finds the Queued message — and invokes the model again. ## Why Deleting Session Files Fixes It The corrupted context chain is persisted in the JSONL file. On restart, OpenClaw resumes from the same poisoned context. Deleting the file removes the chain entirely, so the next session starts clean. ## Workaround Delete session JSONL files and restart. (Restart alone is insufficient.) ## Related Issues - **#30604** — `Followup queue delivers same message multiple times when agent is busy`: upstream/related at the queue layer. PR #46170 was opened to fix it but closed by the author without merging. - **#35092** — `/new does not flush queued messages`: corroborates why session deletion is required for recovery. - **#50892** — Discord collect-mode duplicate delivery: superficially similar but different mechanism; confirmed **not the same issue**. The core problem described here — **agent loop not terminating after final response when Queued messages exist in context** — does not appear to have an existing tracking issue. # PR #51298: fix(agent-loop): terminate loop after final response when queue items predate run start (#50956) - Repository: openclaw/openclaw - Author: ajitpratap0 - State: closed | merged: False - Link: https://github.com/openclaw/openclaw/pull/51298 ## Description (problem / solution / changelog) ## Summary - **Problem:** When queued messages exist in session history, the agent loop r

openclaw2026-03-20 08:58:10

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#50956•Fetched 2026-04-08 01:06:18

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

commented ×5cross-referenced ×2labeled ×2referenced ×1

Environment

OpenClaw version: 2026.3.12 (6472949) (confirmed on session dated 2026-03-12)
Channel: POPO IM (custom integration via aigateway-xxxxx-com)
Model: kimi-k2.5
OS: Linux server （Debian 12）

Description

When an external message arrives while the agent is busy executing a task, OpenClaw queues it as [Queued messages while agent was busy] and inserts it into the session's parentId chain. After the current task completes and the assistant outputs a final summary (no toolCall), the agent loop does not terminate. Instead, OpenClaw continues invoking the model, which sees the unhandled Queued user message in context and proceeds to fully replay the previous task — generating 10+ consecutive assistant messages within ~1.5 seconds, all without any real tool execution.

Restarting OpenClaw does not fix the issue. Deleting the session JSONL files and restarting does.

Reproduction Steps

Configure an agent with an external IM integration (POPO, Telegram, etc.)
Send a message that triggers a long multi-step task (multiple toolCall/toolResult cycles)
While the agent is executing, send a new message from the external channel
Observe [Queued messages while agent was busy] inserted into the session
Wait for the current task to complete (assistant outputs final text-only summary)
Observe: the agent does not stop — it immediately continues generating new assistant messages

Expected Behavior

After the assistant outputs a final response with no toolCall, the agent loop should terminate. Queued messages should be handled as a new turn, not as continuation of the current loop.

Actual Behavior

The agent loop continues. The model is called again with the full context (which includes the Queued user message) and begins replaying the previously completed task — outputting 10+ consecutive assistant messages at ~150ms intervals, none of which execute any tools.

Session Log Evidence

Analyzed from JSONL session logs (431 lines, session e999671b-...):

Metric	Value
Total `toolCall` records	145
Total `toolResult` records in replay sequences	0
`[Queued messages while agent was busy]` user messages	20
Consecutive assistant sequences (length > 1)	20 groups
Longest consecutive assistant sequence	11 messages in 1.36 seconds

The replayed content is byte-for-byte identical to earlier messages in the session:

Line 149 (original): "好的，我来更新配置：1. 每天提醒加入本日天气..."
Line 182 (replay):   "好的，我来更新配置：1. 每天提醒加入本日天气..."  ← identical

Timing of the 11-message replay sequence:

03:10:34.296Z → normal final summary (line 181)
03:10:34.502Z → replay begins (206ms later)
03:10:34.653Z → (151ms)
03:10:34.772Z → (119ms)
...
03:10:35.655Z → ends (11 messages, 1.36 seconds total)

Replay sequence length scales with context size — when context had ~50 messages, sequences were 2–3 messages long; by the time context reached 180 messages, sequences grew to 11.

Root Cause (Hypothesis)

The agent loop termination condition does not distinguish between:

A Queued message already present in historical context (inserted mid-task)
A new user message arriving after the current task completed

After the assistant produces a text-only final response, the loop should stop. Instead it appears to scan the full session chain for any unanswered user message — finds the Queued message — and invokes the model again.

Why Deleting Session Files Fixes It

The corrupted context chain is persisted in the JSONL file. On restart, OpenClaw resumes from the same poisoned context. Deleting the file removes the chain entirely, so the next session starts clean.

Workaround

Delete session JSONL files and restart. (Restart alone is insufficient.)

Related Issues

#30604 — Followup queue delivers same message multiple times when agent is busy: upstream/related at the queue layer. PR #46170 was opened to fix it but closed by the author without merging.
#35092 — /new does not flush queued messages: corroborates why session deletion is required for recovery.
#50892 — Discord collect-mode duplicate delivery: superficially similar but different mechanism; confirmed not the same issue.

The core problem described here — agent loop not terminating after final response when Queued messages exist in context — does not appear to have an existing tracking issue.

Root Cause

Root Cause (Hypothesis)

Code Example

Line 149 (original): "好的，我来更新配置：1. 每天提醒加入本日天气..."
  Line 182 (replay):   "好的，我来更新配置：1. 每天提醒加入本日天气..."  ← identical

---

03:10:34.296Z → normal final summary (line 181)
  03:10:34.502Z → replay begins (206ms later)
  03:10:34.653Z → (151ms)
  03:10:34.772Z → (119ms)
  ...
  03:10:35.655Z → ends (11 messages, 1.36 seconds total)

---

RAW_BUFFERClick to expand / collapse

Bug type

Behavior bug (incorrect output/state without crash)

Summary

Environment

OpenClaw version: 2026.3.12 (6472949) (confirmed on session dated 2026-03-12)
Channel: POPO IM (custom integration via aigateway-xxxxx-com)
Model: kimi-k2.5
OS: Linux server （Debian 12）

Description

Restarting OpenClaw does not fix the issue. Deleting the session JSONL files and restarting does.

Reproduction Steps

Configure an agent with an external IM integration (POPO, Telegram, etc.)
Send a message that triggers a long multi-step task (multiple toolCall/toolResult cycles)
While the agent is executing, send a new message from the external channel
Observe [Queued messages while agent was busy] inserted into the session
Wait for the current task to complete (assistant outputs final text-only summary)
Observe: the agent does not stop — it immediately continues generating new assistant messages

Expected Behavior

After the assistant outputs a final response with no toolCall, the agent loop should terminate. Queued messages should be handled as a new turn, not as continuation of the current loop.

Actual Behavior

Session Log Evidence

Analyzed from JSONL session logs (431 lines, session e999671b-...):

Metric	Value
Total `toolCall` records	145
Total `toolResult` records in replay sequences	0
`[Queued messages while agent was busy]` user messages	20
Consecutive assistant sequences (length > 1)	20 groups
Longest consecutive assistant sequence	11 messages in 1.36 seconds

The replayed content is byte-for-byte identical to earlier messages in the session:

Line 149 (original): "好的，我来更新配置：1. 每天提醒加入本日天气..."
Line 182 (replay):   "好的，我来更新配置：1. 每天提醒加入本日天气..."  ← identical

Timing of the 11-message replay sequence:

03:10:34.296Z → normal final summary (line 181)
03:10:34.502Z → replay begins (206ms later)
03:10:34.653Z → (151ms)
03:10:34.772Z → (119ms)
...
03:10:35.655Z → ends (11 messages, 1.36 seconds total)

Replay sequence length scales with context size — when context had ~50 messages, sequences were 2–3 messages long; by the time context reached 180 messages, sequences grew to 11.

Root Cause (Hypothesis)

The agent loop termination condition does not distinguish between:

A Queued message already present in historical context (inserted mid-task)
A new user message arriving after the current task completed

Why Deleting Session Files Fixes It

Workaround

Delete session JSONL files and restart. (Restart alone is insufficient.)

Related Issues

#30604 — Followup queue delivers same message multiple times when agent is busy: upstream/related at the queue layer. PR #46170 was opened to fix it but closed by the author without merging.
#35092 — /new does not flush queued messages: corroborates why session deletion is required for recovery.
#50892 — Discord collect-mode duplicate delivery: superficially similar but different mechanism; confirmed not the same issue.

The core problem described here — agent loop not terminating after final response when Queued messages exist in context — does not appear to have an existing tracking issue.

Steps to reproduce

Configure an agent with an external IM integration (POPO, Telegram, etc.)
Send a message that triggers a long multi-step task (multiple toolCall/toolResult cycles)
While the agent is executing, send a new message from the external channel
Observe [Queued messages while agent was busy] inserted into the session
Wait for the current task to complete (assistant outputs final text-only summary)
Observe: the agent does not stop — it immediately continues generating new assistant messages

Expected behavior

After the assistant outputs a final response with no toolCall, the agent loop should terminate. Queued messages should be handled as a new turn, not as continuation of the current loop.

Actual behavior

OpenClaw version

2026.3.12 (6472949)

Operating system

Debin12

Install method

No response

Model

kimi-k2.5

Provider / routing chain

openclaw -> aigw.xxx.com -> kimi

Additional provider/model setup details

No response

Logs, screenshots, and evidence

Impact and severity

No response

Additional information

No response

extent analysis

Fix Plan

To address the issue of the agent loop not terminating after a final response when queued messages exist in the context, we need to modify the termination condition to distinguish between queued messages already present in the historical context and new user messages arriving after the current task completed.

Here are the steps to implement the fix:

Modify the Agent Loop Termination Condition:
- Check if there are any new user messages that arrived after the current task completed.
- If yes, do not terminate the loop but instead handle the new message as a new turn.
- If not, and the current task has completed with a final text-only summary, terminate the loop.
Implement a Mechanism to Track New User Messages:
- Introduce a flag or a timestamp to mark when the current task started and completed.
- When a new user message arrives, check if it arrived after the current task completed. If so, mark it as a new message to be handled in a new turn.
Update the Context Handling:
- When handling a new user message, ensure that the context is updated correctly to reflect the new turn.
- Remove or ignore any queued messages that were part of the previous task's context to prevent replaying the previous task.

Example code snippet in Python to illustrate the modified termination condition and new message handling:

def check_termination_condition(current_task_completed, new_user_message_arrived):
    if current_task_completed and not new_user_message_arrived:
        # Terminate the loop if the task is completed and no new user message has arrived
        return True
    elif new_user_message_arrived:
        # Handle the new user message as a new turn
        handle_new_user_message()
        return False
    else:
        # Continue the loop if the task is not completed or a new user message has arrived
        return False

def handle_new_user_message():
    # Update the context to reflect the new turn
    update_context()
    # Remove or ignore any queued messages from the previous task's context
    clear_queued_messages()

# Example usage
current_task_completed = True
new_user_message_arrived = False

if check_termination_condition(current_task_completed, new_user_message_arrived):
    # Terminate the agent loop
    terminate_agent_loop()
else:
    # Continue the agent loop
    continue_agent_loop()

Verification

To verify that the fix worked, follow these steps:

Reproduce the Issue: Follow the reproduction steps provided in the issue description to reproduce the problem.
Apply the Fix: Implement the modified termination condition and new message handling mechanism as described in the fix plan.
Test the Fix: Repeat the reproduction steps after applying the fix to ensure that the agent loop terminates correctly after a final response when queued messages exist in the context.
Verify the Behavior: Observe the agent's

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

After the assistant outputs a final response with no toolCall, the agent loop should terminate. Queued messages should be handled as a new turn, not as continuation of the current loop.

#api #ssr #installation #tensor shape #autograd error #agent execution #callback error #memory management #API rate limit #retriever error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

openclaw - ✅(Solved) Fix [Bug]: Agent loop does not terminate after final response when Queued messages exist in context — causes full task replay [1 pull requests, 5 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Environment

Description

Reproduction Steps

Expected Behavior

Actual Behavior

Session Log Evidence

Root Cause (Hypothesis)

Why Deleting Session Files Fixes It

Workaround

Related Issues

Root Cause

Root Cause (Hypothesis)

Fix Action

Fix / Workaround

Workaround

PR fix notes

PR #51298: fix(agent-loop): terminate loop after final response when queue items predate run start (#50956)

Description (problem / solution / changelog)

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

User-visible / Behavior Changes

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual (before fix)

Evidence

Human Verification (required)

Review Conversations

Compatibility / Migration

Failure Recovery (if this breaks)

Risks and Mitigations

Changed files

Code Example

Bug type

Summary

Environment

Description

Reproduction Steps

Expected Behavior

Actual Behavior

Session Log Evidence

Root Cause (Hypothesis)

Why Deleting Session Files Fixes It

Workaround

Related Issues

Steps to reproduce

Expected behavior

Actual behavior

OpenClaw version

Operating system

Install method

Model

Provider / routing chain

Additional provider/model setup details

Logs, screenshots, and evidence

Impact and severity

Additional information

extent analysis

Fix Plan

Verification

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING