litellm - 💡(How to fix) Fix [Feature]: Support gpt-realtime-whisper model (OpenAI Realtime Transcription)

StepCodex · 2026-05-22T01:35:56Z

[litellm] Feature Request Is your feature request related to a problem? Please describe. LiteLLM does not currently support the gpt-realtime-whisper model rele… ## Fix / Workaround Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway. ## Feature Request ### Is your feature request related to a problem? Please describe. LiteLLM does not currently support the `gpt-realtime-whisper` model released by OpenAI. This model is a dedicated real-time speech-to-text model that streams transcription results via the `/v1/realtime` WebSocket API using the new `/v1/realtime/transcription_sessions` endpoint. It is distinct from the standard `gpt-realtime-*` conversation models in that it focuses solely on transcription rather than bidirectional audio conversation. When attempting to use this model via LiteLLM: ```python import litellm response = litellm.speech_to_text( model="openai/gpt-realtime-whisper", ... ) ``` It results in an unsupported model error or falls through without proper routing. ### Describe the solution you'd like 1. **Model registration**: Add `gpt-realtime-whisper` to LiteLLM's OpenAI model list/registry (e.g., in `model_prices_and_context_window.json` or equivalent). 2. **Endpoint routing**: Route requests for `gpt-realtime-whisper` to the `/v1/realtime/transcription_sessions` WebSocket endpoint, not the standard `/v1/realtime` sessions endpoint. 3. **Pricing**: The model is priced at **$0.017 / minute** (audio input), not per token. Ensure cost tracking reflects this. ### Model details | Property | Value | |----------|-------| | Model ID | `gpt-realtime-whisper` | | Provider | OpenAI | | API Endpoint | `wss://api.openai.com/v1/realtime` (with `transcription_sessions`) | | Pricing | $0.017 / minute (audio input) | | Use case | Real-time streaming transcription (STT only) | ### Describe alternatives you've considered Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway. ### Additional context - [OpenAI Realtime Transcription API docs](https://platform.openai.com/docs/guides/realtime-transcription) - Related existing support: `gpt-realtime-mini`, `gpt-4o-realtime-preview` (conversation models) are already supported, but the transcription-specific variant is missing.

litellm2026-05-22 01:35:56

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Error Message

It results in an unsupported model error or falls through without proper routing.

Fix Action

Fix / Workaround

Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway.

Code Example

import litellm

response = litellm.speech_to_text(
    model="openai/gpt-realtime-whisper",
    ...
)

RAW_BUFFERClick to expand / collapse

Feature Request

Is your feature request related to a problem? Please describe.

LiteLLM does not currently support the gpt-realtime-whisper model released by OpenAI.

This model is a dedicated real-time speech-to-text model that streams transcription results via the /v1/realtime WebSocket API using the new /v1/realtime/transcription_sessions endpoint. It is distinct from the standard gpt-realtime-* conversation models in that it focuses solely on transcription rather than bidirectional audio conversation.

When attempting to use this model via LiteLLM:

import litellm

response = litellm.speech_to_text(
    model="openai/gpt-realtime-whisper",
    ...
)

It results in an unsupported model error or falls through without proper routing.

Describe the solution you'd like

Model registration: Add gpt-realtime-whisper to LiteLLM's OpenAI model list/registry (e.g., in model_prices_and_context_window.json or equivalent).
Endpoint routing: Route requests for gpt-realtime-whisper to the /v1/realtime/transcription_sessions WebSocket endpoint, not the standard /v1/realtime sessions endpoint.
Pricing: The model is priced at $0.017 / minute (audio input), not per token. Ensure cost tracking reflects this.

Model details

Property	Value
Model ID	`gpt-realtime-whisper`
Provider	OpenAI
API Endpoint	`wss://api.openai.com/v1/realtime` (with `transcription_sessions`)
Pricing	$0.017 / minute (audio input)
Use case	Real-time streaming transcription (STT only)

Describe alternatives you've considered

Currently, the only workaround is to call the OpenAI API directly, bypassing LiteLLM entirely, which defeats the purpose of using LiteLLM as a unified proxy/gateway.

Additional context

OpenAI Realtime Transcription API docs
Related existing support: gpt-realtime-mini, gpt-4o-realtime-preview (conversation models) are already supported, but the transcription-specific variant is missing.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering