claude-code - 💡(How to fix) Fix Cowork mode: hallucinated MCP tool name causes silent SDK deadlock until manual restart [1 comments, 2 participants]

JesperLive · 2026-04-27T23:48:10Z

[claude-code] In Claude Desktop Cowork mode, when the model emits a tool use block with a tool name that does not exist in the loaded MCP tool schema concrete… In Claude Desktop Cowork mode, when the model emits a `tool_use` block with a tool name that does not exist in the loaded MCP tool schema (concrete example below: `mcp__Desktop_Commander__lines`, when the Desktop Commander MCP has no `lines` tool), the Agent SDK holds the tool-use frame open indefinitely instead of returning a tool-not-found error. The session appears "thinking" forever with no in-band recovery path. The user must kill the host process to break the deadlock. ## Summary In Claude Desktop Cowork mode, when the model emits a `tool_use` block with a tool name that does not exist in the loaded MCP tool schema (concrete example below: `mcp__Desktop_Commander__lines`, when the Desktop Commander MCP has no `lines` tool), the Agent SDK holds the tool-use frame open indefinitely instead of returning a tool-not-found error. The session appears "thinking" forever with no in-band recovery path. The user must kill the host process to break the deadlock. ## Environment - Claude Desktop (Cowork mode) on Windows 11 - Model: `claude-opus-4-7` (advertised in env) - Claude Code binary: `2.1.111` and `2.1.119` - Desktop Commander MCP: v0.2.39 - Encountered 2026-04-27, session id `local_04615dc0-2f23-4ac1-a07a-a4906c8dab84`, CLI session `5410d741-ed5f-4162-9da2-2c5502618492` ## Reproduction The model issues a `tool_use` like: ``` { "name": "mcp__Desktop_Commander__lines", "input": {"file": "...path to a .md file...", "start": "880", "end": "1015"} } ``` The Desktop Commander MCP has no `lines` tool. The model conflated the name with `cowork_util.py lines`, an unrelated Python subcommand the user invokes via `start_process`. This kind of confusion is the well-known LLM "hallucinated tool name" failure mode and will happen periodically with any non-trivial multi-MCP setup. ## Observed behavior - No `tool_result` event written to the session audit log for that `tool_use`. - No error event surfaced to the model or the UI. - UI shows "thinking" state indefinitely. - Session sat in tool-waiting state for **63 minutes 31 seconds** (measured: 21:43:26 UTC tool emit -> 22:46:57 UTC user "Stuck?" message). - Audit log entry 90 has the `tool_use`; no matching `tool_result` entry exists anywhere later in the file. - User typed "Stuck?" three times, then a Todo prompt, then explicitly asked the model to continue. None of these broke the deadlock because the SDK was waiting on a tool result that would never come. - Recovery required closing and restarting Claude Desktop entirely. After restart, the SDK reloaded the transcript and the session resumed normally. ## Expected behavior The MCP server (or the SDK layer above it) should immediately return a `tool_use_error` / `tool_not_found` result when the tool name does not exist in the loaded schema, allowing the model to recover and route around it. A bounded timeout on tool-use frames would also work as a defensive backstop. ## Impact Any time a model hallucinates an MCP tool name -- a known LLM failure mode, especially when adjacent legitimate names are similar -- the session is bricked until the user manually kills the host process. With long contexts and multi-MCP setups, this is not a rare edge case. The user-visible failure mode is also confusing because the UI just shows "thinking" with no indication that the SDK is stuck waiting on a tool that can never resolve. ## Evidence available - `audit.jsonl` entry 90 (full JSON of the unresolved `tool_use` block) at the local-agent-mode-sessions session path. - `main.log` shows the gap and the subsequent Claude Desktop restart at 00:05:43 local. - No `tool_result` entry for the offending `tool_use_id` exists anywhere in the audit log. Happy to share the redacted audit JSON if useful. ## Suggested fix priorities 1. **SDK**: validate every `tool_use` name against the loaded schema before forwarding to the MCP server. On mismatch, synthesize a `tool_use_error` result and return it to the model so it can self-correct in the same turn. 2. **SDK**: enforce a configurable timeout on every tool-use frame (60s default), surfacing a recoverable error on expiry. 3. **UI**: when the SDK has been waiting on a tool result for >30s, surface a visual indicator distinguishing "tool running" from "tool stuck", and offer a "cancel waiting tool call" affordance.

claude-code2026-04-27 23:48:10

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#54108•Fetched 2026-04-28 06:39:03

View on GitHub

Comments

Participants

Timeline

Reactions

Author

JesperLive

Participants

github-actions[bot]

JesperLive

Timeline (top)

labeled ×4commented ×1

In Claude Desktop Cowork mode, when the model emits a tool_use block with a tool name that does not exist in the loaded MCP tool schema (concrete example below: mcp__Desktop_Commander__lines, when the Desktop Commander MCP has no lines tool), the Agent SDK holds the tool-use frame open indefinitely instead of returning a tool-not-found error. The session appears "thinking" forever with no in-band recovery path. The user must kill the host process to break the deadlock.

Error Message

No tool_result event written to the session audit log for that tool_use.
No error event surfaced to the model or the UI.
UI shows "thinking" state indefinitely.
Session sat in tool-waiting state for 63 minutes 31 seconds (measured: 21:43:26 UTC tool emit -> 22:46:57 UTC user "Stuck?" message).
Audit log entry 90 has the tool_use; no matching tool_result entry exists anywhere later in the file.
User typed "Stuck?" three times, then a Todo prompt, then explicitly asked the model to continue. None of these broke the deadlock because the SDK was waiting on a tool result that would never come.
Recovery required closing and restarting Claude Desktop entirely. After restart, the SDK reloaded the transcript and the session resumed normally.

Root Cause

No tool_result event written to the session audit log for that tool_use.
No error event surfaced to the model or the UI.
UI shows "thinking" state indefinitely.
Session sat in tool-waiting state for 63 minutes 31 seconds (measured: 21:43:26 UTC tool emit -> 22:46:57 UTC user "Stuck?" message).
Audit log entry 90 has the tool_use; no matching tool_result entry exists anywhere later in the file.
User typed "Stuck?" three times, then a Todo prompt, then explicitly asked the model to continue. None of these broke the deadlock because the SDK was waiting on a tool result that would never come.
Recovery required closing and restarting Claude Desktop entirely. After restart, the SDK reloaded the transcript and the session resumed normally.

Code Example

{
  "name": "mcp__Desktop_Commander__lines",
  "input": {"file": "...path to a .md file...", "start": "880", "end": "1015"}
}

RAW_BUFFERClick to expand / collapse

Summary

Environment

Claude Desktop (Cowork mode) on Windows 11
Model: claude-opus-4-7 (advertised in env)
Claude Code binary: 2.1.111 and 2.1.119
Desktop Commander MCP: v0.2.39
Encountered 2026-04-27, session id local_04615dc0-2f23-4ac1-a07a-a4906c8dab84, CLI session 5410d741-ed5f-4162-9da2-2c5502618492

Reproduction

The model issues a tool_use like:

{
  "name": "mcp__Desktop_Commander__lines",
  "input": {"file": "...path to a .md file...", "start": "880", "end": "1015"}
}

The Desktop Commander MCP has no lines tool. The model conflated the name with cowork_util.py lines, an unrelated Python subcommand the user invokes via start_process. This kind of confusion is the well-known LLM "hallucinated tool name" failure mode and will happen periodically with any non-trivial multi-MCP setup.

Observed behavior

No tool_result event written to the session audit log for that tool_use.
No error event surfaced to the model or the UI.
UI shows "thinking" state indefinitely.
Session sat in tool-waiting state for 63 minutes 31 seconds (measured: 21:43:26 UTC tool emit -> 22:46:57 UTC user "Stuck?" message).
Audit log entry 90 has the tool_use; no matching tool_result entry exists anywhere later in the file.
User typed "Stuck?" three times, then a Todo prompt, then explicitly asked the model to continue. None of these broke the deadlock because the SDK was waiting on a tool result that would never come.
Recovery required closing and restarting Claude Desktop entirely. After restart, the SDK reloaded the transcript and the session resumed normally.

Expected behavior

The MCP server (or the SDK layer above it) should immediately return a tool_use_error / tool_not_found result when the tool name does not exist in the loaded schema, allowing the model to recover and route around it. A bounded timeout on tool-use frames would also work as a defensive backstop.

Impact

Any time a model hallucinates an MCP tool name -- a known LLM failure mode, especially when adjacent legitimate names are similar -- the session is bricked until the user manually kills the host process. With long contexts and multi-MCP setups, this is not a rare edge case. The user-visible failure mode is also confusing because the UI just shows "thinking" with no indication that the SDK is stuck waiting on a tool that can never resolve.

Evidence available

audit.jsonl entry 90 (full JSON of the unresolved tool_use block) at the local-agent-mode-sessions session path.
main.log shows the gap and the subsequent Claude Desktop restart at 00:05:43 local.
No tool_result entry for the offending tool_use_id exists anywhere in the audit log.

Happy to share the redacted audit JSON if useful.

Suggested fix priorities

SDK: validate every tool_use name against the loaded schema before forwarding to the MCP server. On mismatch, synthesize a tool_use_error result and return it to the model so it can self-correct in the same turn.
SDK: enforce a configurable timeout on every tool-use frame (60s default), surfacing a recoverable error on expiry.
UI: when the SDK has been waiting on a tool result for >30s, surface a visual indicator distinguishing "tool running" from "tool stuck", and offer a "cancel waiting tool call" affordance.

extent analysis

TL;DR

The most likely fix is to validate every tool_use name against the loaded schema in the SDK before forwarding to the MCP server and return a tool_use_error result on mismatch.

Guidance

Validate tool_use names against the loaded schema to prevent forwarding non-existent tool names to the MCP server.
Implement a configurable timeout on every tool-use frame to surface a recoverable error on expiry.
Consider surfacing a visual indicator in the UI when the SDK has been waiting on a tool result for an extended period, offering a "cancel waiting tool call" affordance.
Review the audit.jsonl and main.log files to understand the sequence of events leading to the deadlock.

Example

No code snippet is provided as the issue does not contain sufficient information to generate a specific code example.

Notes

The suggested fix priorities provided in the issue are a good starting point, but the exact implementation details may vary depending on the specific requirements and constraints of the system.

Recommendation

Apply the suggested fix priorities, starting with validating every tool_use name against the loaded schema in the SDK, as this addresses the root cause of the issue and provides a clear path for the model to recover from hallucinated tool names.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

#vector store #embedding generation #cache error #pipeline error #runtime error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Cowork mode: hallucinated MCP tool name causes silent SDK deadlock until manual restart [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Summary

Environment

Reproduction

Observed behavior

Expected behavior

Impact

Evidence available

Suggested fix priorities

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

FAQ

Expected behavior

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix Cowork mode: hallucinated MCP tool name causes silent SDK deadlock until manual restart [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Summary

Environment

Reproduction

Observed behavior

Expected behavior

Impact

Evidence available

Suggested fix priorities

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING