openclaw - 💡(How to fix) Fix [Feature] 添加视频(video)输入类型支持

Official PRs (…)
ON THIS PAGE

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…

在模型配置中添加对 video 输入类型的支持,允许调用支持视频理解的大模型(如火山引擎豆包、GPT-4V等)

Root Cause

在模型配置中添加对 video 输入类型的支持,允许调用支持视频理解的大模型(如火山引擎豆包、GPT-4V等)

RAW_BUFFERClick to expand / collapse

Summary

在模型配置中添加对 video 输入类型的支持,允许调用支持视频理解的大模型(如火山引擎豆包、GPT-4V等)

Problem to solve

当前 OpenClaw 模型配置只支持 "text" 和 "image" 输入类型。当配置 video 类型时,服务启动失败并报错:

models.providers.xxx.models.0.input.2: Invalid input (allowed: "text", "image")

这导致无法使用火山引擎、OpenAI 等支持视频理解的多模态大模型。

Proposed solution

在模型配置的 input 字段中,添加对 "video" 输入类型的支持。

允许的值:

  • text (已支持)
  • image (已支持)
  • video (新增)

参考火山引擎文档:https://www.volcengine.com/docs/82379/1362931 支持通过 Files API 传入视频(最大512MB)或 Base64 编码(最大50MB)。

Alternatives considered

  1. 手动编写脚本直接调用火山引擎 API - 需要额外开发,不集成
  2. 等待 OpenClaw 后续版本更新 - 时间不确定
  3. 使用图片帧提取 + image 输入 - 体验不佳,无法保留时序信息

Impact

Affected: 需要视频理解能力的用户 Severity: 中等(阻塞功能) Frequency: 每次使用视频理解时 Consequence: 无法在 OpenClaw 中配置视频理解模型,必须使用外部脚本

Evidence/examples

No response

Additional information

OpenClaw 版本:2026.3.24 火山引擎模型:doubao-seed-2-0-pro-260215, doubao-seed-2-0-lite-260215

相关文档:

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

openclaw - 💡(How to fix) Fix [Feature] 添加视频(video)输入类型支持