claude-code - 💡(How to fix) Fix [BUG] Sub-agent task-notification fires repeated <status>completed</status> events as the transcript grows after termination

Root Cause

When a background sub-agent (Agent tool with run_in_background: true) is declared completed by the harness watchdog OR exits gracefully, the harness continues to tail the sub-agent's JSONL transcript file. Every time the file grows (because the underlying subprocess is still running and emitting turns, or because the harness's own polling re-renders content), the orchestrator receives a fresh <task-notification> system message with the same <task-id> and <status>completed</status> but a new <result> body summarizing the latest content.

Fix Action

Fix / Workaround

Dispatch any background sub-agent: `Agent({ subagent_type: 'general-purpose', run_in_background: true, prompt: '<some task>' })`. Note the returned `agentId`.
Wait for the first `<status>completed</status>` notification. Capture its `<result>` content.
Manually append to the JSONL file: `echo '{"type":"assistant","timestamp":"2026-05-24T11:00:00Z","message":{"content":[{"type":"text","text":"fabricated content"}]}}' >> ~/.claude/projects/<hash>/subagents/agent-<id>.jsonl`
Expected: harness ignores the append; orchestrator receives no further notifications for that agent ID.
Actual: orchestrator receives a fresh `<task-notification>` with `<status>completed</status>` and the fabricated content.

Related issue: #61877 (sub-agent generation loops — the trigger for the appends). The bug I'm reporting here is independent: even if the agent exits cleanly, the harness has no defense against late writes to the transcript appearing as fresh completion events. I added an evidence comment to that issue: https://github.com/anthropics/claude-code/issues/61877#issuecomment-4528291470
Related issue: #50272 (stale symlink mtime). Adjacent but different — that's about diagnosing liveness; this is about the notification side.
Project-side postmortem with full timeline: https://github.com/metazen11/psde-os/blob/develop/reports/2026-05-24-runaway-subagent-postmortem.md
Local mitigation: a `detect_runaway_subagents.sh` shell script that scans for sub-agent JSONL files with recent mtimes and no process holding them open. Symptom-catcher; the harness fix would make it unnecessary.

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single bug report (please file separate reports for different bugs)
I am using the latest version of Claude Code

What's Wrong?

The result: the orchestrator receives multiple "completed" notifications for the same agent, each with different content, often hours apart. Each subsequent body looks like a fresh, independent completion. There is no way for the orchestrator to know whether <status>completed</status> means "this is the real one" or "this is the seventh fake one."

In my incident, a single sub-agent fired at least 8 separate <status>completed</status> notifications across a 4-hour window. The first one was the real watchdog termination; the rest were the runaway agent's continued output (separately filed in the runaway / generation-loop class, #61877). Even after the agent process exited at hour 21, the harness fired one more "completed" notification a day later when something — re-render? cache miss? — touched the file's mtime.

What Should Happen?

After a sub-agent transitions to a terminal status (completed, stopped, failed, timed_out), the harness should:

Stop tailing the JSONL transcript. No more notifications from that agent ID, ever.
Refuse new appends. If the underlying subprocess somehow keeps writing (e.g., the watchdog killed the stream but the process didn't die — see related issues #61877, #50272), those appends should be ignored, not surfaced.
Distinguish termination causes in the status. Today <status>completed</status> covers both "model emitted terminal turn" and "watchdog gave up waiting." These have very different reliability implications for the orchestrator. Suggested: completed_gracefully, terminated_by_watchdog, terminated_by_kill. The orchestrator can then refuse to act on body content from terminated_by_watchdog.

Error Messages/Logs

No error — the failure mode is structural, not an error condition. Sample of what the orchestrator sees:

``` <task-notification> <task-id>aa4481de51c90f2ed</task-id> <status>completed</status>

<summary>Agent "Fix #701 forward-auth OIDC loop" completed</summary> <result>API Error: Overloaded</result> </task-notification> \`\`\`

(That first one fires within ~10 minutes — the legitimate stream termination.)

Then ~4 hours later:

``` <task-notification> <task-id>aa4481de51c90f2ed</task-id> <status>completed</status>

<summary>Agent "Fix #701 forward-auth OIDC loop" completed</summary> <result>I don't see a \`/plan\` skill in the available-skills list…</result> </task-notification> \`\`\`

Same agent ID. Same status. Completely different (and never-requested) "work." The orchestrator burned context trying to figure out what was happening.

Steps to Reproduce

This is hard to reproduce on demand because it requires a sub-agent that goes into a generation loop (see #61877). The reproducible part is the harness behavior:

Dispatch any background sub-agent: `Agent({ subagent_type: 'general-purpose', run_in_background: true, prompt: '<some task>' })`. Note the returned `agentId`.
Wait for the first `<status>completed</status>` notification. Capture its `<result>` content.
Manually append to the JSONL file: `echo '{"type":"assistant","timestamp":"2026-05-24T11:00:00Z","message":{"content":[{"type":"text","text":"fabricated content"}]}}' >> ~/.claude/projects/<hash>/subagents/agent-<id>.jsonl`
Expected: harness ignores the append; orchestrator receives no further notifications for that agent ID.
Actual: orchestrator receives a fresh `<task-notification>` with `<status>completed</status>` and the fabricated content.

I haven't yet tested step 4 — it's a clean reproduction harness, marking as TODO to confirm. The natural reproduction in my incident was a runaway agent doing the appends itself; if step 4's manual append doesn't reproduce, the harness's notification trigger is something other than mtime change (perhaps file size delta, or perhaps the inner JSONL parser re-runs on poll and noises out duplicates).

Claude Model

Opus (claude-opus-4-7[1m])

Is this a regression?

I don't know

Claude Code Version

2.1.133

Platform

Anthropic API

Operating System

macOS

Terminal/Shell

iTerm2

Additional Information

Related issue: #61877 (sub-agent generation loops — the trigger for the appends). The bug I'm reporting here is independent: even if the agent exits cleanly, the harness has no defense against late writes to the transcript appearing as fresh completion events. I added an evidence comment to that issue: https://github.com/anthropics/claude-code/issues/61877#issuecomment-4528291470
Related issue: #50272 (stale symlink mtime). Adjacent but different — that's about diagnosing liveness; this is about the notification side.
Project-side postmortem with full timeline: https://github.com/metazen11/psde-os/blob/develop/reports/2026-05-24-runaway-subagent-postmortem.md
Local mitigation: a `detect_runaway_subagents.sh` shell script that scans for sub-agent JSONL files with recent mtimes and no process holding them open. Symptom-catcher; the harness fix would make it unnecessary.

The load-bearing fix is: `<status>completed</status>` should be terminal — no more notifications, no more appends acted on, ever. Everything else is defense in depth.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [BUG] Sub-agent task-notification fires repeated <status>completed</status> events as the transcript grows after termination

Recommended Tools

GitHub issue graph ai analysis

Error Message

Error Messages/Logs

Root Cause

Fix Action

Fix / Workaround

Preflight Checklist

What's Wrong?

What Should Happen?

Error Messages/Logs

Steps to Reproduce

Claude Model

Is this a regression?

Claude Code Version

Platform

Operating System

Terminal/Shell

Additional Information

Still need to ship something?

TRENDING