claude-code - 💡(How to fix) Fix Opus 4.8 (1M context) — frequent "tool_use malformed" client errors despite tools executing successfully

claude-code2026-05-29 15:33:15

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Root Cause

Each failure costs one round-trip + a context-polluting "retry" prompt, and in a long session it accumulates. Worse, because the tool did execute, a careless user might double-execute side-effectful operations (e.g. re-create a PR, double-commit) if they don't notice the empty turn and just re-issue the request.

Fix Action

Workaround

Switch to Opus 4.7 (1M context) via /model.

RAW_BUFFERClick to expand / collapse

Environment

Model: claude-opus-4-8-1m (switched to claude-opus-4-7-1m as workaround mid-session — see below)
Claude Code version: 2.1.92
Session type: forked from a prior session (this is a variable I could not isolate solo — see Repro)
OS: Windows 11 + Git Bash
Workflow: long CI-debugging session with many tool calls (Bash, Edit, Read, WebFetch, subagents)

Symptom

Roughly 5 out of ~15 tool calls on Opus 4.8 fail with:

Your tool call was malformed and could not be parsed. Please retry.

The assistant turn appears empty to the user, and a UserPromptSubmit-style retry prompt arrives next turn. However, the tool actually executed on the server side — verified independently each time:

Edit: subsequent git diff confirms the file was modified exactly as intended
Bash (gh pr create): the PR was actually created on GitHub (visible at the returned URL)
Subagent (Agent tool): subagent ran to completion and the structured result was retained in the next assistant turn's context

So the tool call reached the server, ran, and produced correct output — only the client-side parse / surfacing of the assistant message that contained the tool call failed.

Repro / variable isolation

After ~5 occurrences across an hour on Opus 4.8, I switched the same session to Opus 4.7 (1M) via /model. 0 occurrences across 6+ subsequent tool calls in the same session, same workflow, same kinds of tools (Bash, Read, Edit, gh CLI). Strong signal that the regression is model-side rather than environmental.

The one variable I could not isolate solo: the session was forked from a prior conversation. I don't know whether a fresh 4.8 session (no forked history) reproduces — would appreciate help confirming. Three possible outcomes:

Fresh 4.8 also reproduces → model-level regression in 4.8
Only forked 4.8 reproduces → interaction between 4.8 and replayed conversation history
Neither → close

Hypothesis

Plausibly XML close-tag drift in long / nested <parameter> content — e.g. the model emitting </invoke> where </parameter> was expected, causing the client-side parser to drop or mis-attribute the tool_use block. There is a known mode where long args with mixed content trip this; 4.8 appears to have raised the failure rate noticeably.

Observed triggering tool calls in this session:

Edit with multi-line new_string containing mixed CJK comments + YAML code
Bash with HEREDOC commit messages (multi-line, CJK + ASCII)
WebFetch with a longer prompt arg
(one occurrence after a system-reminder injection arrived between the user message and my tool call — could be coincidence, flagging in case)

Workaround

Switch to Opus 4.7 (1M context) via /model.

Impact

Happy to provide

Session transcript excerpts (contains code from a private repo; can redact)
Specific timestamps / turn boundaries on request

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Opus 4.8 (1M context) — frequent "tool_use malformed" client errors despite tools executing successfully

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Workaround

Environment

Symptom

Repro / variable isolation

Hypothesis

Workaround

Impact

Happy to provide

Still need to ship something?

TRENDING