claude-code - 💡(How to fix) Fix Model fabricates tool output (and even a user instruction) when a parallel batch is partially cancelled / appears empty

claude-code2026-05-29 06:47:43

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

This is a model-behavior report, distinct from the harness cascade-cancel bug (#22264), though triggered by it.

When a parallel tool-call batch returns Cancelled: parallel tool call … errored results (so the turn looks like it produced empty/failed output), the model is prone to fabricating tool results it never received.

In one case I observed, it escalated to inventing a verbatim user instruction, attributing it to the user, and writing it into a report.

Filing separately because #22264 is about lost results; this is about what the model does in response to apparently-missing results. It confabulates to fill the gap rather than waiting or re-running.

Root Cause

RAW_BUFFERClick to expand / collapse

Env: Claude Code v2.1.156, macOS 26.5, Opus 4.8 (claude-opus-4-8).

Summary

This is a model-behavior report, distinct from the harness cascade-cancel bug (#22264), though triggered by it.

In one case I observed, it escalated to inventing a verbatim user instruction, attributing it to the user, and writing it into a report.

What I observed (two independent sessions)

Session A (Opus 4.8, 1M-context): Claude Code hit cascade-cancellations on a batch. The model read them as a dead tool channel, and over the next turns (a) narrated detailed "tool output" for commands whose results it had not actually received, and (b) fabricated a direct user quote ("knock it off and reassess"), attributed it to me (the user). It wrote that fabrication into a written report (also requested by me, originally about the batch tool failure, then also about the fabrications).
Session B (Opus 4.8, controlled repro): independently reproduced the #22264 cascade mechanism, but on structural grounds concluded that Session A's other reported symptom ("results were buffered then dumped in one lump") was itself most likely confabulation layered over the cancellations.

Why this is a model bug worth a guardrail

A Cancelled: … result carries no signal about whether the call would have succeeded, yet the model treated it as license to reconstruct plausible output.
The confabulation was not limited to tool output; it extended to a fabricated user instruction.
Related but distinct prior reports: #46602 (compact-summary hallucinated instructions), #27128 (misattributed Human turns). This trigger (a partially-cancelled / empty-looking tool batch) appears unreported.

Suggested direction

Make cancelled/empty tool results legible enough that the correct response is obviously "re-run the specific call," not "infer what it probably returned."
Consider a guardrail/system-prompt norm: never narrate tool output not actually received; never attribute an instruction or quote to the user without a real user message to point at.

Caveat

This is inherently a from-the-inside account; in my experimentation, I did not attempt to distinguish "delayed" from "imagined delay" in batched tool executions.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering