openclaw - 💡(How to fix) Fix [Feature]: Add MiniMax Speech as TTS Provider [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openclaw/openclaw#59734Fetched 2026-04-08 02:41:14
View on GitHub
Comments
1
Participants
2
Timeline
7
Reactions
0
Timeline (top)
subscribed ×2closed ×1commented ×1labeled ×1

Add MiniMax Speech-02 as a fourth TTS provider option alongside ElevenLabs, OpenAI, and Edge.

Root Cause

Add MiniMax Speech-02 as a fourth TTS provider option alongside ElevenLabs, OpenAI, and Edge.

RAW_BUFFERClick to expand / collapse

Summary

Add MiniMax Speech-02 as a fourth TTS provider option alongside ElevenLabs, OpenAI, and Edge.

Problem to solve

OpenClaw currently only supports three TTS providers: ElevenLabs, OpenAI, and Edge. Many users who already use MiniMax for LLM capabilities (via OAuth or API) want to use MiniMax Speech-02 for text-to-speech, but it's not available. The existing TTS options either require paid accounts (ElevenLabs, OpenAI) or have limited voice quality (Edge). MiniMax Speech-02 offers significantly better Chinese TTS quality (WER 2.25% vs ElevenLabs 16%), lower cost (1/4 of ElevenLabs), and already integrates with many platforms like LiveKit, Vapi, and Pipecat.

Proposed solution

Add "minimax" as a TtsProvider option in the OpenClaw config schema:

  1. Add to TtsProvider type: "minimax"
  2. Implement MiniMax TTS adapter using MiniMax Text-to-Speech API (speech-02, speech-02-hd models)
  3. Support configuration via messages.tts.provider: "minimax" with model options (speech-02, speech-02-hd, speech-02.6)
  4. Support voice selection, speed control, and emotion control parameters
  5. Leverage existing MiniMax OAuth integration for authentication

Configuration example: { "messages": { "tts": { "provider": "minimax", "minimax": { "model": "speech-02-hd", "voiceId": "male-qn-qingse", "speed": 1.0, "emotion": "chat" } } } }

Alternatives considered

  1. Using ElevenLabs or OpenAI TTS - requires separate paid accounts, higher cost, worse Chinese voice quality
  2. Using Edge TTS - free but limited voice options and lower quality
  3. Community-developed skills (minimax-tts) - not officially supported, may have stability issues
  4. Using MiniMax API directly via custom scripts - requires manual setup, not integrated into OpenClaw's TTS pipeline

Impact

  • Affected users: OpenClaw users who use MiniMax for LLM and want unified TTS; Chinese-speaking users who need high-quality Chinese TTS
  • Severity: Medium-High - blocks adoption of MiniMax as complete AI platform solution
  • Frequency: Constant - every user who wants MiniMax TTS
  • Consequences: Users must maintain separate TTS subscriptions (ElevenLabs/OpenAI) alongside their MiniMax subscription, increasing costs and complexity

Evidence/examples

Additional information

OpenClaw already has MiniMax OAuth integration, which means users are already authenticated with MiniMax. This makes adding MiniMax TTS a natural extension that would require minimal additional authentication setup. Many community members have requested this feature (evidenced by multiple closed issues/PRs), showing clear user demand.

extent analysis

TL;DR

Add MiniMax Speech-02 as a TTS provider option in OpenClaw by implementing a MiniMax TTS adapter and updating the config schema.

Guidance

  • Update the TtsProvider type to include "minimax" as an option.
  • Implement a MiniMax TTS adapter using the MiniMax Text-to-Speech API, supporting models such as "speech-02" and "speech-02-hd".
  • Add configuration options for MiniMax TTS, including model selection, voice selection, speed control, and emotion control.
  • Leverage the existing MiniMax OAuth integration for authentication.

Example

{
  "messages": {
    "tts": {
      "provider": "minimax",
      "minimax": {
        "model": "speech-02-hd",
        "voiceId": "male-qn-qingse",
        "speed": 1.0,
        "emotion": "chat"
      }
    }
  }
}

Notes

The implementation should consider the existing MiniMax OAuth integration and the user demand for high-quality Chinese TTS.

Recommendation

Apply the proposed solution by adding MiniMax Speech-02 as a TTS provider option, as it offers better Chinese TTS quality, lower cost, and integrates well with existing platforms.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING