hermes - 💡(How to fix) Fix cua-driver ignores auxiliary.vision config, uses main session model for image analysis [1 pull requests]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…

Error Message

🔌 Provider: openrouter Model: tencent/hy3-preview 📝 Error: HTTP 404: No endpoints found that support image input

Fix Action

Fixed

Code Example

🔌 Provider: openrouter  Model: tencent/hy3-preview
📝 Error: HTTP 404: No endpoints found that support image input

---

model:
  default: tencent/hy3-preview
  provider: openrouter
auxiliary:
  vision:
    provider: openrouter
    model: google/gemini-2.5-flash

---

2026-05-11 ... ⚠️ API call failed (attempt1/3): NotFoundError [HTTP 404]
   🔌 Provider: openrouter  Model: tencent/hy3-preview
   📝 Error: HTTP 404: No endpoints found that support image input
RAW_BUFFERClick to expand / collapse

Bug Description

When using computer_use tool (backed by cua-driver), the image analysis for SOM (Set-of-Mark) mode ignores the auxiliary.vision configuration and instead uses the main session model. This causes 404 errors when the main model doesn't support image input.

Steps to Reproduce

  1. Configure Hermes Agent with:
    • Main model: tencent/hy3-preview (no image support on OpenRouter)
    • auxiliary.vision.provider: openrouter
    • auxiliary.vision.model: google/gemini-2.5-flash
  2. Enable computer_use toolset
  3. Call computer_use with action='capture', mode='som'

Expected Behavior

cua-driver should route the image analysis request to the model specified in auxiliary.vision (google/gemini-2.5-flash).

Actual Behavior

cua-driver attempts to use the main session model (tencent/hy3-preview) for image analysis, resulting in:

🔌 Provider: openrouter  Model: tencent/hy3-preview
📝 Error: HTTP 404: No endpoints found that support image input

Configuration

model:
  default: tencent/hy3-preview
  provider: openrouter
auxiliary:
  vision:
    provider: openrouter
    model: google/gemini-2.5-flash

Log Evidence

2026-05-11 ... ⚠️ API call failed (attempt1/3): NotFoundError [HTTP 404]
   🔌 Provider: openrouter  Model: tencent/hy3-preview
   📝 Error: HTTP 404: No endpoints found that support image input

Suggested Fix

Update cua-driver / computer_use tool to check for auxiliary.vision configuration and use that model/provider for image analysis tasks instead of the main session model.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

hermes - 💡(How to fix) Fix cua-driver ignores auxiliary.vision config, uses main session model for image analysis [1 pull requests]