openclaw - 💡(How to fix) Fix Feature Request: Native local Whisper integration for audio transcription [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openclaw/openclaw#59192Fetched 2026-04-08 02:27:43
View on GitHub
Comments
0
Participants
1
Timeline
1
Reactions
0
Timeline (top)
subscribed ×1

Add native support for local OpenAI Whisper audio transcription, similar to the existing Groq integration, but without requiring external API calls.

Root Cause

Add native support for local OpenAI Whisper audio transcription, similar to the existing Groq integration, but without requiring external API calls.

RAW_BUFFERClick to expand / collapse

Feature Request

Summary

Add native support for local OpenAI Whisper audio transcription, similar to the existing Groq integration, but without requiring external API calls.

Current Situation

  • Groq audio transcription works but requires paid API access
  • Local Whisper CLI works via tool but requires approval for every voice message
  • No way to allowlist Whisper commands due to variable file paths in arguments

Desired Behavior

Enable automatic audio transcription for incoming voice messages using locally installed Whisper, without requiring exec approvals.

Proposed Configuration

Use Case

Users who want privacy-friendly, cost-free voice message transcription on their self-hosted OpenClaw setup (e.g., Raspberry Pi).

Technical Notes

  • Whisper CLI is already installable via
  • Requires ffmpeg for audio format conversion
  • Similar integration pattern as Groq provider, but calling local binary instead of API

Benefits

  • ✅ No external API costs
  • ✅ Privacy (audio stays local)
  • ✅ Works offline
  • ✅ No exec approval prompts for voice messages

Related


Thank you for considering this feature!

extent analysis

TL;DR

Implement a local OpenAI Whisper audio transcription integration, similar to the existing Groq integration, to enable automatic transcription without requiring external API calls or exec approvals.

Guidance

  • Investigate modifying the existing Groq audio transcription code to call the local Whisper binary instead of making an API call, leveraging the similar integration pattern.
  • Ensure that the Whisper CLI and ffmpeg are properly installed and configured on the system, as they are required for audio format conversion and transcription.
  • Explore implementing allowlisting for Whisper commands to prevent variable file paths in arguments from causing issues, potentially by using a fixed directory for temporary audio files.
  • Review the local Whisper skill implementation for insights into integrating Whisper with the OpenClaw setup.

Example

No explicit code example can be provided without more context, but the Groq audio transcription documentation and local Whisper skill code can serve as references for implementing the local Whisper integration.

Notes

The success of this integration depends on the compatibility of the Whisper CLI with the OpenClaw setup and the ability to properly handle audio file paths and conversions.

Recommendation

Apply a workaround by modifying the existing Groq integration to use the local Whisper binary, as this approach leverages existing code and minimizes the introduction of new dependencies or external API calls.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING