litellm - 💡(How to fix) Fix [Feature]: Support gpt-realtime-whisper model (OpenAI Realtime Transcription)

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…

Error Message

It results in an unsupported model error or falls through without proper routing.

Fix Action

Fix / Workaround

Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway.

Code Example

import litellm

response = litellm.speech_to_text(
    model="openai/gpt-realtime-whisper",
    ...
)
RAW_BUFFERClick to expand / collapse

Feature Request

Is your feature request related to a problem? Please describe.

LiteLLM does not currently support the gpt-realtime-whisper model released by OpenAI.

This model is a dedicated real-time speech-to-text model that streams transcription results via the /v1/realtime WebSocket API using the new /v1/realtime/transcription_sessions endpoint. It is distinct from the standard gpt-realtime-* conversation models in that it focuses solely on transcription rather than bidirectional audio conversation.

When attempting to use this model via LiteLLM:

import litellm

response = litellm.speech_to_text(
    model="openai/gpt-realtime-whisper",
    ...
)

It results in an unsupported model error or falls through without proper routing.

Describe the solution you'd like

  1. Model registration: Add gpt-realtime-whisper to LiteLLM's OpenAI model list/registry (e.g., in model_prices_and_context_window.json or equivalent).
  2. Endpoint routing: Route requests for gpt-realtime-whisper to the /v1/realtime/transcription_sessions WebSocket endpoint, not the standard /v1/realtime sessions endpoint.
  3. Pricing: The model is priced at $0.017 / minute (audio input), not per token. Ensure cost tracking reflects this.

Model details

PropertyValue
Model IDgpt-realtime-whisper
ProviderOpenAI
API Endpointwss://api.openai.com/v1/realtime (with transcription_sessions)
Pricing$0.017 / minute (audio input)
Use caseReal-time streaming transcription (STT only)

Describe alternatives you've considered

Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway.

Additional context

  • OpenAI Realtime Transcription API docs
  • Related existing support: gpt-realtime-mini, gpt-4o-realtime-preview (conversation models) are already supported, but the transcription-specific variant is missing.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING