claude-code - 💡(How to fix) Fix Consider supporting Cua Driver for window-scoped/background computer use [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#55496Fetched 2026-05-03 04:51:50
View on GitHub
Comments
0
Participants
1
Timeline
3
Reactions
0
Author
Participants
Timeline (top)
labeled ×3

Root Cause

Hi Claude Code team - I work on Cua and cua-driver, an MIT-licensed native macOS automation driver for agents. I am opening this issue because I think it could be useful for Claude Code's computer-use story, especially for window-scoped or background UI automation.

Code Example

claude mcp add --transport stdio cua-driver -- cua-driver mcp

---

claude mcp add --transport stdio cua-computer-use -- cua-driver mcp --claude-code-computer-use-compat
RAW_BUFFERClick to expand / collapse

Hi Claude Code team - I work on Cua and cua-driver, an MIT-licensed native macOS automation driver for agents. I am opening this issue because I think it could be useful for Claude Code's computer-use story, especially for window-scoped or background UI automation.

Cua Driver is designed around controlling specific application windows instead of the whole desktop. The model/client can list apps and windows, target a pid plus window_id, take a screenshot of that window only, and send clicks/keys in that same window-local coordinate frame. That avoids exposing unrelated desktop content and makes it much easier to keep the coordinate system stable.

Why this may be useful for Claude Code:

  • Window-only screenshots instead of full-desktop screenshots.
  • App/window targeting via pid and window_id.
  • Pixel clicks that are tied to the returned window screenshot coordinate frame.
  • Accessibility-backed controls where useful, with vision-only fallback where needed.
  • Background-ish workflows that do not require constantly bringing unrelated windows to the front.
  • MIT license, so Claude Code could either call Cua Driver directly, depend on it, or reuse only the parts that are useful.

Cua Driver already works with Claude Code today as an MCP server. Standard registration looks like:

claude mcp add --transport stdio cua-driver -- cua-driver mcp

We also added a Claude Code compatibility mode for users who want Claude Code's vision/computer-use-style flow to ground on Cua Driver screenshots:

claude mcp add --transport stdio cua-computer-use -- cua-driver mcp --claude-code-computer-use-compat

That compatibility mode keeps the normal Cua Driver MCP tools and changes only the screenshot tool. The compatibility screenshot requires pid and window_id, captures that target window only, and returns the window-local pixel coordinate frame.

The reason for that compatibility mode is an integration observation: Claude Code appears to treat an MCP screenshot tool named like mcp__cua-computer-use__screenshot as an image-grounding cue for computer-use-style calls. In contrast, calling the Cua Driver CLI directly, or registering only the generic cua-driver MCP server, does not appear to trigger the same vision/computer-use-style behavior. I may be misunderstanding the intended integration contract here, but if this behavior is intentional, it would be helpful to document a supported way for third-party MCP servers to opt into this flow.

The relevant Cua Driver docs are here:

What I am asking for:

  1. Consider using Cua Driver, or parts of it, as a native macOS backend for Claude Code computer-use-style workflows.
  2. If Claude Code has a preferred/supported MCP shape for computer-use-style grounding, please document it so external MCP servers can integrate without relying on naming conventions.
  3. If the current computer-use naming behavior is accidental or reserved, it would be useful to know what third-party tools should do instead.

I am happy to adjust Cua Driver's MCP compatibility layer if Claude Code has a preferred interface contract. The main goal is to make window-scoped, privacy-conscious local computer use work reliably without forcing full-desktop screenshots or fragile coordinate assumptions.

extent analysis

TL;DR

Consider documenting a supported MCP shape for computer-use-style grounding to enable seamless integration with third-party MCP servers like Cua Driver.

Guidance

  • Review the current integration contract between Claude Code and MCP servers to determine if the computer-use naming behavior is intentional or accidental.
  • If intentional, document the preferred MCP shape for computer-use-style grounding to allow external MCP servers to integrate without relying on naming conventions.
  • If accidental, provide guidance on the recommended approach for third-party tools to opt into the computer-use-style flow.
  • Explore the possibility of using Cua Driver or its components as a native macOS backend for Claude Code computer-use-style workflows.

Example

No code snippet is provided as the issue is focused on integration and documentation rather than code implementation.

Notes

The solution depends on the intentions behind the current computer-use naming behavior in Claude Code, which is unclear from the issue description.

Recommendation

Apply workaround: Document the preferred MCP shape for computer-use-style grounding to enable seamless integration with third-party MCP servers like Cua Driver. This will provide clarity and a standardized approach for external tools to integrate with Claude Code.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

claude-code - 💡(How to fix) Fix Consider supporting Cua Driver for window-scoped/background computer use [1 participants]