Source: https://docs.venice.ai/api-reference/

We need to update both model_prices_and_context_window.json and model_prices_and_context_window_backup.json to reflect the new model.

Please add Venice models support

Daniel-OS01 · 2026-03-20T17:56:51Z

[litellm] Source: https://docs.venice.ai/api-reference/ We need to update both model prices and context window.json https://github.com/BerriAI/litellm/blob/d7c… Source: https://docs.venice.ai/api-reference/ We need to update both [model_prices_and_context_window.json](https://github.com/BerriAI/litellm/blob/d7c419bfee93b14bbe0a3a7f4ae31a3ba5768185/model_prices_and_context_window.json) and [model_prices_and_context_window_backup.json](https://github.com/BerriAI/litellm/blob/d7c419bfee93b14bbe0a3a7f4ae31a3ba5768185/litellm/model_prices_and_context_window_backup.json) to reflect the new model. # Please add Venice models support https://docs.venice.ai/models/overview ## Model list table Model Name | Model ID | Context | Attributes | Size | Input ($/M) | Output ($/M) | Cache/Notes -- | -- | -- | -- | -- | -- | -- | -- MiniMax M2.7 | minimax-m27 | 198K | Anonymized | L | $0.38 | $1.50 | $0.07 cache Venice Uncensored 1.1 | e2ee-venice-uncensored-24b-p | 32K | E2EE, Private, Beta, Uncensored | L | $0.25 | $1.15 | Gemma 3 27B | e2ee-gemma-3-27b-p | 40K | E2EE, Private, Beta | L | $0.14 | $0.50 | GLM 4.7 | e2ee-glm-4-7-p | 128K | E2EE, Private, Beta | L | $1.10 | $4.15 | GLM 4.7 Flash | e2ee-glm-4-7-flash-p | 198K | E2EE, Private, Beta | L | $0.13 | $0.55 | GPT OSS 20B | e2ee-gpt-oss-20b-p | 128K | E2EE, Private, Beta | L | $0.05 | $0.19 | GPT OSS 120B | e2ee-gpt-oss-120b-p | 128K | E2EE, Private, Beta | L | $0.13 | $0.65 | Qwen 2.5 7B | e2ee-qwen-2-5-7b-p | 32K | E2EE, Private, Beta | L | $0.05 | $0.13 | Qwen3 30B A3B | e2ee-qwen3-30b-a3b-p | 256K | E2EE, Private, Beta | L | $0.19 | $0.69 | Qwen3 VL 30B A3B | e2ee-qwen3-vl-30b-a3b-p | 128K | E2EE, Private, Beta | L | $0.25 | $0.90 | GLM 5 | e2ee-glm-5 | 198K | E2EE, Private, Beta | L | $1.10 | $4.15 | Qwen3.5 122B A10B | e2ee-qwen3-5-122b-a10b | 128K | E2EE, Private, Beta | L | $0.50 | $4.00 | Grok 4.20 Beta | grok-4-20-beta | 2.0M | Anonymized, Beta | L | $2.50 | $7.50 | $0.25 cache (Higher tier >200K: $5.00/$15.00) Grok 4.20 Multi-Agent Beta | grok-4-20-multi-agent-beta | 2.0M | Anonymized, Beta | L | $2.50 | $7.50 | $0.25 cache (Higher tier >200K: $5.00/$15.00) Qwen 3.5 9B | qwen3-5-9b | 256K | Private | L | $0.05 | $0.15 | GPT-5.4 | openai-gpt-54 | 1.0M | Anonymized, Beta | L | $3.13 | $18.80 | $0.31 cache GPT-5.4 Pro | openai-gpt-54-pro | 1.0M | Anonymized, Beta | L | $37.50 | $225.00 | Higher tier >272K: $75.00/$337.50 GPT-4o | openai-gpt-4o-2024-11-20 | 128K | Anonymized | L | $3.13 | $12.50 | GPT-4o Mini | openai-gpt-4o-mini-2024-07-18 | 128K | Anonymized | L | $0.19 | $0.75 | $0.09 cache Qwen 3.5 35B A3B | qwen3-5-35b-a3b | 256K | Private, Beta | L | $0.31 | $1.25 | $0.16 cache GPT-5.3 Codex | openai-gpt-53-codex | 400K | Anonymized, Beta | L | $2.19 | $17.50 | $0.22 cache Venice Role Play Uncensored | venice-uncensored-role-play | 128K | Private, Uncensored | L | $0.50 | $2.00 | Gemini 3.1 Pro Preview | gemini-3-1-pro-preview | 1.0M | Anonymized | L | $2.50 | $15.00 | $0.50 cache (Higher tier >200K: $5.00/$22.50) Claude Sonnet 4.6 | claude-sonnet-4-6 | 1.0M | Anonymized, Beta | L | $3.60 | $18.00 | $0.36/$4.50 cache MiniMax M2.5 | minimax-m25 | 198K | Private | L | $0.34 | $1.19 | $0.04 cache GLM 5 | zai-org-glm-5 | 198K | Private | L | $1.00 | $3.20 | $0.20 cache Claude Opus 4.6 | claude-opus-4-6 | 1.0M | Anonymized, Beta | L | $6.00 | $30.00 | $0.60/$7.50 cache GLM 4.7 Flash Heretic | olafangensan-glm-4.7-flash-heretic | 200K | Private | L | $0.14 | $0.80 | GLM 4.7 Flash | zai-org-glm-4.7-flash | 128K | Private | L | $0.13 | $0.50 | Kimi K2.5 | kimi-k2-5 | 256K | Private | L | $0.56 | $3.50 | $0.11 cache Qwen 3 Coder 480B Turbo | qwen3-coder-480b-a35b-instruct-turbo | 256K | Private, Beta | L | $0.35 | $1.50 | $0.04 cache NVIDIA Nemotron 3 Nano 30B | nvidia-nemotron-3-nano-30b-a3b | 128K | Private, Beta | L | $0.07 | $0.30 | Qwen3 VL 235B | qwen3-vl-235b-a22b | 256K | Private | L | $0.25 | $1.50 | Mistral Small 3.2 24B Instruct | mistral-small-3-2-24b-instruct | 256K | Private | L | $0.09 | $0.25 | GLM 4.7 | zai-org-glm-4.7 | 198K | Private | L | $0.55 | $2.65 | $0.11 cache Gemini 3 Flash Preview | gemini-3-flash-preview | 256K | Anonymized | L | $0.70 | $3.75 | $0.07 cache GPT-5.2 | openai-gpt-52 | 256K | Anonymized | L | $2.19 | $17.50 | $0.22 cache Kimi K2 Thinking | kimi-k2-thinking | 256K | Private | L | $0.75 | $3.20 | $0.38 cache Claude Opus 4.5 | claude-opus-4-5 | 198K | Anonymized | L | $6.00 | $30.00 | $0.60/$7.50 cache DeepSeek V3.2 | deepseek-v3.2 | 160K | Private | L | $0.33 | $0.48 | $0.16 cache Gemini 3 Pro Preview | gemini-3-pro-preview | 198K | Anonymized | L | $2.50 | $15.00 | $0.63 cache Grok 4.1 Fast | grok-41-fast | 1.0M | Anonymized | L | $0.25 | $0.63 | $0.06 cache MiniMax M2.1 | minimax-m21 | 198K | Private | L | $0.35 | $1.50 | $0.04 cache Grok Code Fast 1 | grok-code-fast-1 | 256K | Anonymized | L | $0.25 | $1.87 | $0.03 cache OpenAI GPT OSS 120B | openai-gpt-oss-120b | 128K | Private | L | $0.07 | $0.30 | Google Gemma 3 27B Ins

https://docs.venice.ai/models/overview

Model list table

Model Name	Model ID	Context	Attributes	Size	Input ($/M)	Output ($/M)	Cache/Notes
MiniMax M2.7	minimax-m27	198K	Anonymized	L	$0.38	$1.50	$0.07 cache
Venice Uncensored 1.1	e2ee-venice-uncensored-24b-p	32K	E2EE, Private, Beta, Uncensored	L	$0.25	$1.15
Gemma 3 27B	e2ee-gemma-3-27b-p	40K	E2EE, Private, Beta	L	$0.14	$0.50
GLM 4.7	e2ee-glm-4-7-p	128K	E2EE, Private, Beta	L	$1.10	$4.15
GLM 4.7 Flash	e2ee-glm-4-7-flash-p	198K	E2EE, Private, Beta	L	$0.13	$0.55
GPT OSS 20B	e2ee-gpt-oss-20b-p	128K	E2EE, Private, Beta	L	$0.05	$0.19
GPT OSS 120B	e2ee-gpt-oss-120b-p	128K	E2EE, Private, Beta	L	$0.13	$0.65
Qwen 2.5 7B	e2ee-qwen-2-5-7b-p	32K	E2EE, Private, Beta	L	$0.05	$0.13
Qwen3 30B A3B	e2ee-qwen3-30b-a3b-p	256K	E2EE, Private, Beta	L	$0.19	$0.69
Qwen3 VL 30B A3B	e2ee-qwen3-vl-30b-a3b-p	128K	E2EE, Private, Beta	L	$0.25	$0.90
GLM 5	e2ee-glm-5	198K	E2EE, Private, Beta	L	$1.10	$4.15
Qwen3.5 122B A10B	e2ee-qwen3-5-122b-a10b	128K	E2EE, Private, Beta	L	$0.50	$4.00
Grok 4.20 Beta	grok-4-20-beta	2.0M	Anonymized, Beta	L	$2.50	$7.50	$0.25 cache (Higher tier >200K: $5.00/$15.00)
Grok 4.20 Multi-Agent Beta	grok-4-20-multi-agent-beta	2.0M	Anonymized, Beta	L	$2.50	$7.50	$0.25 cache (Higher tier >200K: $5.00/$15.00)
Qwen 3.5 9B	qwen3-5-9b	256K	Private	L	$0.05	$0.15
GPT-5.4	openai-gpt-54	1.0M	Anonymized, Beta	L	$3.13	$18.80	$0.31 cache
GPT-5.4 Pro	openai-gpt-54-pro	1.0M	Anonymized, Beta	L	$37.50	$225.00	Higher tier >272K: $75.00/$337.50
GPT-4o	openai-gpt-4o-2024-11-20	128K	Anonymized	L	$3.13	$12.50
GPT-4o Mini	openai-gpt-4o-mini-2024-07-18	128K	Anonymized	L	$0.19	$0.75	$0.09 cache
Qwen 3.5 35B A3B	qwen3-5-35b-a3b	256K	Private, Beta	L	$0.31	$1.25	$0.16 cache
GPT-5.3 Codex	openai-gpt-53-codex	400K	Anonymized, Beta	L	$2.19	$17.50	$0.22 cache
Venice Role Play Uncensored	venice-uncensored-role-play	128K	Private, Uncensored	L	$0.50	$2.00
Gemini 3.1 Pro Preview	gemini-3-1-pro-preview	1.0M	Anonymized	L	$2.50	$15.00	$0.50 cache (Higher tier >200K: $5.00/$22.50)
Claude Sonnet 4.6	claude-sonnet-4-6	1.0M	Anonymized, Beta	L	$3.60	$18.00	$0.36/$4.50 cache
MiniMax M2.5	minimax-m25	198K	Private	L	$0.34	$1.19	$0.04 cache
GLM 5	zai-org-glm-5	198K	Private	L	$1.00	$3.20	$0.20 cache
Claude Opus 4.6	claude-opus-4-6	1.0M	Anonymized, Beta	L	$6.00	$30.00	$0.60/$7.50 cache
GLM 4.7 Flash Heretic	olafangensan-glm-4.7-flash-heretic	200K	Private	L	$0.14	$0.80
GLM 4.7 Flash	zai-org-glm-4.7-flash	128K	Private	L	$0.13	$0.50
Kimi K2.5	kimi-k2-5	256K	Private	L	$0.56	$3.50	$0.11 cache
Qwen 3 Coder 480B Turbo	qwen3-coder-480b-a35b-instruct-turbo	256K	Private, Beta	L	$0.35	$1.50	$0.04 cache
NVIDIA Nemotron 3 Nano 30B	nvidia-nemotron-3-nano-30b-a3b	128K	Private, Beta	L	$0.07	$0.30
Qwen3 VL 235B	qwen3-vl-235b-a22b	256K	Private	L	$0.25	$1.50
Mistral Small 3.2 24B Instruct	mistral-small-3-2-24b-instruct	256K	Private	L	$0.09	$0.25
GLM 4.7	zai-org-glm-4.7	198K	Private	L	$0.55	$2.65	$0.11 cache
Gemini 3 Flash Preview	gemini-3-flash-preview	256K	Anonymized	L	$0.70	$3.75	$0.07 cache
GPT-5.2	openai-gpt-52	256K	Anonymized	L	$2.19	$17.50	$0.22 cache
Kimi K2 Thinking	kimi-k2-thinking	256K	Private	L	$0.75	$3.20	$0.38 cache
Claude Opus 4.5	claude-opus-4-5	198K	Anonymized	L	$6.00	$30.00	$0.60/$7.50 cache
DeepSeek V3.2	deepseek-v3.2	160K	Private	L	$0.33	$0.48	$0.16 cache
Gemini 3 Pro Preview	gemini-3-pro-preview	198K	Anonymized	L	$2.50	$15.00	$0.63 cache
Grok 4.1 Fast	grok-41-fast	1.0M	Anonymized	L	$0.25	$0.63	$0.06 cache
MiniMax M2.1	minimax-m21	198K	Private	L	$0.35	$1.50	$0.04 cache
Grok Code Fast 1	grok-code-fast-1	256K	Anonymized	L	$0.25	$1.87	$0.03 cache
OpenAI GPT OSS 120B	openai-gpt-oss-120b	128K	Private	L	$0.07	$0.30
Google Gemma 3 27B Instruct	google-gemma-3-27b-it	198K	Private	M	$0.12	$0.20
Hermes 3 Llama 3.1 405b	hermes-3-llama-3.1-405b	128K	Private	L	$1.10	$3.00
Venice Small	qwen3-4b	32K	Private, Deprecated	XS	$0.05	$0.15
Qwen 3 235B A22B Thinking 2507	qwen3-235b-a22b-thinking-2507	128K	Private	L	$0.45	$3.50
Qwen 3 235B A22B Instruct 2507	qwen3-235b-a22b-instruct-2507	128K	Private	L	$0.15	$0.75
Qwen 3 Next 80b	qwen3-next-80b	256K	Private	M	$0.35	$1.90
Qwen 3 Coder 480b	qwen3-coder-480b-a35b-instruct	256K	Private	L	$0.75	$3.00
Llama 3.3 70B	llama-3.3-70b	128K	Private	M	$0.70	$2.80
Venice Uncensored 1.1	venice-uncensored	32K	Private, Uncensored	S	$0.20	$0.90
Venice Medium	mistral-31-24b	128K	Private, Deprecated	S	$0.50	$2.00
Claude Sonnet 4.5	claude-sonnet-4-5	198K	Anonymized	L	$3.75	$18.75	$0.38/$4.69 cache
GPT-5.2 Codex	openai-gpt-52-codex	256K	Anonymized	L	$2.19	$17.50	$0.22 cache
Llama 3.2 3B	llama-3.2-3b	128K	Private	XS	$0.15	$0.60
GLM 4.6	zai-org-glm-4.6	198K	Private	L	$0.85	$2.75	$0.30 cache

extent analysis

Fix Plan

To update the model_prices_and_context_window.json and model_prices_and_context_window_backup.json files to reflect the new Venice models, follow these steps:

Add new model entries: Insert the following JSON objects into the files:

{
  "model_name": "Venice Uncensored 1.1",
  "model_id": "e2ee-venice-uncensored-24b-p",
  "context": 32000,
  "attributes": ["E2EE", "Private", "Beta", "Uncensored"],
  "size": "L",
  "input_price": 0.25,
  "output_price": 1.15
}

Add similar entries for other new models, replacing the values as necessary.

Update existing model entries (if necessary): Review the existing models and update their prices, contexts, or attributes if they have changed.
Remove deprecated models (if necessary): If any models are marked as deprecated, consider removing them from the files.

Verification

To verify that the fix worked:

Check file contents: Open the updated model_prices_and_context_window.json and model_prices_and_context_window_backup.json files and verify that the new model entries have been added correctly.
Test model usage: Use the updated models in your application and verify that they are working as expected, with the correct prices and attributes applied.

Extra Tips

Make sure to backup the original files before making any changes.
Consider automating the process of updating the model files using a script or a CI/CD pipeline.
Keep the model files in sync with the latest documentation and pricing information from the model providers.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

litellm - 💡(How to fix) Fix Add Venice models support "venice/grok-code-fast-1" in "model_prices_and_context_window.json" [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Please add Venice models support

Model list table

extent analysis

Fix Plan

Verification

Extra Tips

Still need to ship something?

TRENDING

litellm - 💡(How to fix) Fix Add Venice models support "venice/grok-code-fast-1" in "model_prices_and_context_window.json" [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Please add Venice models support

Model list table

extent analysis

Fix Plan

Verification

Extra Tips

Still need to ship something?

RELATED_DISCOVERY

TRENDING