openclaw - 💡(How to fix) Fix [Bug]: Telegram voice note transcription fails with Deepgram — audio arrives as raw <media:audio> without transcript [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openclaw/openclaw#59864Fetched 2026-04-08 02:39:38
View on GitHub
Comments
0
Participants
1
Timeline
3
Reactions
2
Participants
Timeline (top)
subscribed ×2renamed ×1

Telegram voice notes are no longer transcribed via the configured Deepgram provider. Deepgram transcription appears to be completely broken — it never runs.

When transcription does appear to work, it is because OpenClaw silently falls back to the local Whisper CLI (via bash). This fallback only works for agents with unrestricted exec permissions. Agents with restricted exec (e.g. exec.security: "allowlist") get zero transcription because they cannot run the Whisper fallback.

This affects all agents on the same OpenClaw instance across multiple Telegram bot accounts.

It still works fine on Discord!

Error Message

Gateway error log shows a plugin ID mismatch warning (likely cosmetic but potentially related):

plugins.entries.deepgram: plugin deepgram: plugin id mismatch (manifest uses "deepgram", entry hints "deepgram-provider")

No Deepgram-related errors or API call attempts are logged. The transcription step for Deepgram appears to be silently skipped entirely.

Root Cause

When transcription does appear to work, it is because OpenClaw silently falls back to the local Whisper CLI (via bash). This fallback only works for agents with unrestricted exec permissions. Agents with restricted exec (e.g. exec.security: "allowlist") get zero transcription because they cannot run the Whisper fallback.

Code Example

{
  "tools": {
    "media": {
      "audio": {
        "enabled": true,
        "language": "it",
        "providerOptions": {
          "deepgram": {
            "detect_language": false,
            "punctuate": true,
            "smart_format": true
          }
        },
        "models": [
          {
            "provider": "deepgram",
            "model": "nova-3"
          }
        ]
      }
    }
  }
}

---

plugins.entries.deepgram: plugin deepgram: plugin id mismatch (manifest uses "deepgram", entry hints "deepgram-provider")
RAW_BUFFERClick to expand / collapse

Summary

Telegram voice notes are no longer transcribed via the configured Deepgram provider. Deepgram transcription appears to be completely broken — it never runs.

When transcription does appear to work, it is because OpenClaw silently falls back to the local Whisper CLI (via bash). This fallback only works for agents with unrestricted exec permissions. Agents with restricted exec (e.g. exec.security: "allowlist") get zero transcription because they cannot run the Whisper fallback.

This affects all agents on the same OpenClaw instance across multiple Telegram bot accounts.

It still works fine on Discord!

Environment

  • OpenClaw version: 2026.4.1
  • OS: macOS 15.5 (Darwin 24.6.0, arm64, Mac Mini M4)
  • Node.js: v22.22.2
  • Install method: npm global
  • Telegram accounts: 4 bot accounts, 2 actively tested
  • Transcription provider: Deepgram nova-3 (configured explicitly)

Configuration

{
  "tools": {
    "media": {
      "audio": {
        "enabled": true,
        "language": "it",
        "providerOptions": {
          "deepgram": {
            "detect_language": false,
            "punctuate": true,
            "smart_format": true
          }
        },
        "models": [
          {
            "provider": "deepgram",
            "model": "nova-3"
          }
        ]
      }
    }
  }
}

DEEPGRAM_API_KEY is set in ~/.openclaw/.env and was working previously.

Steps to reproduce

  1. Configure OpenClaw with Telegram bot + Deepgram audio transcription (as above)
  2. Send a voice note to the bot via Telegram
  3. Observe that Deepgram is never called — the voice note arrives as raw <media:audio> without Transcript:
  4. If the agent has full exec permissions, OpenClaw may silently fall back to Whisper CLI, masking the Deepgram failure
  5. If the agent has restricted exec (allowlist), no transcription occurs at all

Expected behavior

Voice notes should be transcribed via the configured Deepgram provider before being delivered to the agent.

Actual behavior

  • Deepgram is never called — no Deepgram API requests are made
  • OpenClaw silently falls back to Whisper CLI for agents with full exec permissions
  • Agents with exec.security: "allowlist" (restricted) get no transcription at all — the Whisper fallback fails silently because bash is not in the allowlist
  • No errors are logged when Deepgram transcription is skipped

Logs

Gateway error log shows a plugin ID mismatch warning (likely cosmetic but potentially related):

plugins.entries.deepgram: plugin deepgram: plugin id mismatch (manifest uses "deepgram", entry hints "deepgram-provider")

No Deepgram-related errors or API call attempts are logged. The transcription step for Deepgram appears to be silently skipped entirely.

Possibly related

  • The 2026.4.1 changelog mentions: "Telegram/audio: transcode Telegram voice-note .ogg attachments before the local whisper-cli auto fallback runs, and keep mention-preflight transcription enabled in auto mode when tools.media.audio is unset"
  • Issue #7899 (applyMediaUnderstanding not called)
  • Issue #17101 (Telegram voice messages not transcribed)
  • The Deepgram plugin ID mismatch may be causing the plugin to not be recognized as a valid transcription provider despite loading successfully

Impact

  • Critical for restricted agents: agents with exec allowlist have no transcription at all — voice notes are completely unusable
  • Misleading for unrestricted agents: transcription appears to work intermittently but is actually using a different provider (Whisper) than configured (Deepgram)
  • Voice-heavy users locked out of voice interaction

extent analysis

TL;DR

Investigate and resolve the plugin ID mismatch warning for the Deepgram plugin, as it may be causing the transcription step to be silently skipped.

Guidance

  • Verify that the DEEPGRAM_API_KEY is correctly set and accessible in the ~/.openclaw/.env file.
  • Check the OpenClaw configuration to ensure that the Deepgram provider is correctly specified and enabled.
  • Investigate the plugin ID mismatch warning and determine if it is related to the issue, potentially by checking the plugin's manifest and entry hints.
  • Test the transcription with a different provider, such as the local Whisper CLI, to see if the issue is specific to Deepgram.

Example

No code snippet is provided as the issue seems to be related to configuration and plugin issues rather than code.

Notes

The issue may be related to a change in the 2026.4.1 version of OpenClaw, and checking the changelog and related issues (#7899 and #17101) may provide more insight. Additionally, the fact that it works fine on Discord suggests that the issue may be specific to the Telegram integration.

Recommendation

Apply a workaround by temporarily using the local Whisper CLI as the transcription provider until the Deepgram plugin issue is resolved, as it appears to be working for agents with full exec permissions. This will at least provide some level of transcription functionality until the root cause can be addressed.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

FAQ

Expected behavior

Voice notes should be transcribed via the configured Deepgram provider before being delivered to the agent.

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

openclaw - 💡(How to fix) Fix [Bug]: Telegram voice note transcription fails with Deepgram — audio arrives as raw <media:audio> without transcript [1 participants]