litellm - 💡(How to fix) Fix Bug: vLLM pass-through endpoint ignores api_key from DB-stored model deployments (401 error) [4 comments, 2 participants]

Q: Expected behavior

The `api_key` from the deployment configuration should be forwarded to the upstream vLLM server as the `x-api-key` header.

litellm2026-03-12 11:56:00

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

BerriAI/litellm#23443•Fetched 2026-04-08 00:44:18

View on GitHub

Comments

Participants

Timeline

Reactions

Author

darin-srp

Participants

darin-srp

jannikstdl

Timeline (top)

mentioned ×4subscribed ×4commented ×1

Error Message

When a vLLM model is configured via the database (e.g., through the UI or management API) with a per-deployment api_key, the vLLM pass-through endpoint (/vllm/{endpoint}) does not send the API key to the upstream vLLM server, resulting in a 401 Unauthorized error.

Root Cause

In litellm/llms/vllm/common_utils.py, VLLMModelInfo.get_api_key() unconditionally returns None, discarding the api_key passed in from the router deployment:

# litellm/llms/vllm/common_utils.py:56-57
@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return None  # <-- always returns None, ignoring the input

This is called in the pass-through flow at litellm/passthrough/main.py:259:

provider_api_key = provider_config.get_api_key(api_key)  # returns None

auth_headers = provider_config.validate_environment(
    ...,
    api_key=provider_api_key,  # None → no x-api-key header set
)

Even though the Router correctly extracts api_key from the deployment's litellm_params and passes it through allm_passthrough_route → llm_passthrough_route, get_api_key() throws it away.

Code Example

model_name: my-vllm-model
   litellm_params:
     model: hosted_vllm/my-model
     api_base: https://my-vllm-server.com
     api_key: sk-my-vllm-key

---

curl -X POST 'http://localhost:4000/vllm/v1/chat/completions' \
     -H 'Authorization: Bearer <litellm-virtual-key>' \
     -H 'Content-Type: application/json' \
     -d '{"model": "my-vllm-model", "messages": [{"role": "user", "content": "hello"}]}'

---

# litellm/llms/vllm/common_utils.py:56-57
@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return None  # <-- always returns None, ignoring the input

---

provider_api_key = provider_config.get_api_key(api_key)  # returns None

auth_headers = provider_config.validate_environment(
    ...,
    api_key=provider_api_key,  # None → no x-api-key header set
)

---

# litellm/llms/vllm/common_utils.py
@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return api_key or get_secret_str("VLLM_API_KEY")

RAW_BUFFERClick to expand / collapse

What happened?

Models configured in YAML with api_key in litellm_params are also affected by the same code path.

Steps to reproduce

Add a vLLM model via DB/UI with an api_key:

model_name: my-vllm-model
litellm_params:
  model: hosted_vllm/my-model
  api_base: https://my-vllm-server.com
  api_key: sk-my-vllm-key

Send a pass-through request:

curl -X POST 'http://localhost:4000/vllm/v1/chat/completions' \
  -H 'Authorization: Bearer <litellm-virtual-key>' \
  -H 'Content-Type: application/json' \
  -d '{"model": "my-vllm-model", "messages": [{"role": "user", "content": "hello"}]}'

The upstream vLLM server returns 401 because no x-api-key header is sent.

Root cause

In litellm/llms/vllm/common_utils.py, VLLMModelInfo.get_api_key() unconditionally returns None, discarding the api_key passed in from the router deployment:

# litellm/llms/vllm/common_utils.py:56-57
@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return None  # <-- always returns None, ignoring the input

This is called in the pass-through flow at litellm/passthrough/main.py:259:

provider_api_key = provider_config.get_api_key(api_key)  # returns None

auth_headers = provider_config.validate_environment(
    ...,
    api_key=provider_api_key,  # None → no x-api-key header set
)

Even though the Router correctly extracts api_key from the deployment's litellm_params and passes it through allm_passthrough_route → llm_passthrough_route, get_api_key() throws it away.

Expected behavior

The api_key from the deployment configuration should be forwarded to the upstream vLLM server as the x-api-key header.

Suggested fix

# litellm/llms/vllm/common_utils.py
@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return api_key or get_secret_str("VLLM_API_KEY")

This preserves backward compatibility (falls back to env var) while correctly using per-deployment keys.

LiteLLM version

Latest main branch (commit e37efc4218)

Relevant code path

litellm/proxy/pass_through_endpoints/llm_passthrough_endpoints.py:328 — vllm_proxy_route calls router
litellm/router.py:3716-3730 — Router spreads deployment["litellm_params"] (including api_key) into kwargs
litellm/passthrough/main.py:259 — calls provider_config.get_api_key(api_key)
litellm/llms/vllm/common_utils.py:56-57 — get_api_key() returns None ← bug is here
litellm/llms/vllm/common_utils.py:42-43 — validate_environment() skips setting header since api_key is None

extent analysis

Fix Plan

To fix the issue, update the get_api_key method in litellm/llms/vllm/common_utils.py to return the provided api_key instead of always returning None.

Update the get_api_key method as follows:

@staticmethod
def get_api_key(api_key: Optional[str] = None) -> Optional[str]:
    return api_key or get_secret_str("VLLM_API_KEY")

This change ensures that the api_key from the deployment configuration is forwarded to the upstream vLLM server as the x-api-key header.

Verification

To verify the fix, follow these steps:

Update the common_utils.py file with the new get_api_key method.
Restart the LiteLLM service.
Send a pass-through request using the curl command provided in the issue description.
Check the upstream vLLM server logs to ensure that the x-api-key header is being sent with the correct api_key value.

Extra Tips

Make sure to test the fix with different deployment configurations to ensure that the api_key is being correctly forwarded in all cases.
Consider adding additional logging or monitoring to detect any future issues with the api_key forwarding.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

The api_key from the deployment configuration should be forwarded to the upstream vLLM server as the x-api-key header.

#api #ssr #installation #tensor shape #autograd error #callback error #memory management #API rate limit #retriever error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

litellm - 💡(How to fix) Fix Bug: vLLM pass-through endpoint ignores api_key from DB-stored model deployments (401 error) [4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

What happened?

Steps to reproduce

Root cause

Expected behavior

Suggested fix

LiteLLM version

Relevant code path

extent analysis

Fix Plan

Verification

Extra Tips

FAQ

Expected behavior

Still need to ship something?

TRENDING

litellm - 💡(How to fix) Fix Bug: vLLM pass-through endpoint ignores api_key from DB-stored model deployments (401 error) [4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

What happened?

Steps to reproduce

Root cause

Expected behavior

Suggested fix

LiteLLM version

Relevant code path

extent analysis

Fix Plan

Verification

Extra Tips

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING