openclaw - 💡(How to fix) Fix [Feature]: ElevenLabs Realtime Voice Provider for Talk

openclaw2026-05-25 10:11:01

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Code Example

{
  talk: {
    realtime: {
      provider: "elevenlabs",
      providers: {
        elevenlabs: {
          apiKey: "${ELEVENLABS_API_KEY}",
          model: "eleven_flash_v2_5",  // or "eleven_v3"
          voice: "N2lVS1w4EtoT3dr4eOWO", // Callum - Husky Trickster
          latencyTier: 4,  // 0-4, maps to optimize_streaming_latency
          // Conversational AI specific:
          agentId: "<optional-convai-agent-id>", // for custom agents
        },
      },
      mode: "realtime",
      transport: "webrtc", // or "provider-websocket"
      brain: "agent-consult",
    },
  },
}

RAW_BUFFERClick to expand / collapse

Feature Request: ElevenLabs Realtime Voice Provider Is your feature request related to a problem? Please describe. Currently, talk.realtime.provider only supports openai and google. Users who prefer ElevenLabs (better voice quality, lower latency with Flash v2.5, specific voice identities) cannot use native realtime voice conversations without routing through OpenAI or Google backends. Describe the solution you'd like Add elevenlabs as a supported talk.realtime.provider, leveraging ElevenLabs' existing realtime capabilities: Streaming TTS: Eleven Flash v2.5 (sub-300ms latency) via WebSocket streaming Realtime STT: Scribe v2 Realtime (already bundled for Voice Call plugin) Conversational AI: ElevenLabs Conversational AI API (WebSocket-based, supports turn-taking, interruption detection) Describe alternatives you've considered stt-tts mode: Works with ElevenLabs TTS but lacks the sub-300ms latency and smooth turn-taking of true realtime Voice Call plugin: Only for telephony use cases, not browser Talk Additional context ElevenLabs APIs that could be integrated: Text-to-Speech Streaming API (/v1/text-to-speech/{voice_id}/stream with optimize_streaming_latency=4) Conversational AI API — full-duplex WebSocket conversation with built-in VAD, interruption handling, and agent configuration Scribe v2 Realtime — already partially integrated for Voice Call streaming STT Configuration example (desired):

{
  talk: {
    realtime: {
      provider: "elevenlabs",
      providers: {
        elevenlabs: {
          apiKey: "${ELEVENLABS_API_KEY}",
          model: "eleven_flash_v2_5",  // or "eleven_v3"
          voice: "N2lVS1w4EtoT3dr4eOWO", // Callum - Husky Trickster
          latencyTier: 4,  // 0-4, maps to optimize_streaming_latency
          // Conversational AI specific:
          agentId: "<optional-convai-agent-id>", // for custom agents
        },
      },
      mode: "realtime",
      transport: "webrtc", // or "provider-websocket"
      brain: "agent-consult",
    },
  },
}

Use case: User has ElevenLabs API key configured for TTS and wants the same voice identity + low latency in browser Talk realtime mode, without needing OpenAI API access. Target Repository: https://github.com/openclaw/openclaw/issues Labels: `enhancement`, `voice`, `talk`, `elevenlabs` Related: ElevenLabs Conversational AI API, Streaming TTS API, Scribe v2 Realtime

Submitted by: Nas

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix [Feature]: ElevenLabs Realtime Voice Provider for Talk

Recommended Tools

GitHub issue graph ai analysis

Code Example

Still need to ship something?

TRENDING