vllm - 💡(How to fix) Fix [Usage]: Using RIXL Connector on AMD GPU [2 comments, 3 participants]

ssyrc · 2026-03-24T00:28:51Z

[vllm] Your current environment I have successfully used the Mooncake connector on AMD GPUs, and now I'm trying to use the Nixl connector for PD disaggregation… ### Your current environment I have successfully used the Mooncake connector on AMD GPUs, and now I'm trying to use the Nixl connector for PD disaggregation. I am in an air-gapped environment, so building Docker images is quite difficult. 1. Is there any prebuilt Docker image based on this Dockerfile? https://github.com/vllm-project/vllm/blob/main/docker/Dockerfile.rocm 2. The documentation states: For ROCm platform, the base ROCm docker file includes RIXL and ucx already. https://docs.vllm.ai/en/stable/features/nixl_connector_usage/#prerequisites But I tested vllm/vllm-openai-rocm:latest, but rixl is not installed (ModuleNotFoundError). Could anyone clarify whether the official Docker images include RIXL or not? 3. Is NixlConnector on ROCm intended to support communication with NVIDIA GPUs (cross-vendor), or only AMD-to-AMD?

vllm2026-03-24 00:28:51

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

vllm-project/vllm#37941•Fetched 2026-04-08 01:22:31

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

mentioned ×3subscribed ×3commented ×2labeled ×2

RAW_BUFFERClick to expand / collapse

Your current environment

I have successfully used the Mooncake connector on AMD GPUs, and now I'm trying to use the Nixl connector for PD disaggregation. I am in an air-gapped environment, so building Docker images is quite difficult.

Is there any prebuilt Docker image based on this Dockerfile? https://github.com/vllm-project/vllm/blob/main/docker/Dockerfile.rocm
The documentation states: For ROCm platform, the base ROCm docker file includes RIXL and ucx already. https://docs.vllm.ai/en/stable/features/nixl_connector_usage/#prerequisites But I tested vllm/vllm-openai-rocm:latest, but rixl is not installed (ModuleNotFoundError). Could anyone clarify whether the official Docker images include RIXL or not?
Is NixlConnector on ROCm intended to support communication with NVIDIA GPUs (cross-vendor), or only AMD-to-AMD?

extent analysis

Fix Plan

To resolve the issue with the Nixl connector on ROCm, we need to ensure that the RIXL library is installed in the Docker image.

Steps to Fix

Option 1: Use a custom Dockerfile
1. Create a new Dockerfile based on the official vllm/vllm-openai-rocm:latest image.
2. Install RIXL using pip: RUN pip install rixl
3. Build a new Docker image using the custom Dockerfile.
Option 2: Modify the existing Docker image
1. Pull the official vllm/vllm-openai-rocm:latest image.
2. Run a container from the image and install RIXL using pip: docker exec -it <container_id> pip install rixl
3. Commit the changes to create a new image.

Example Code

# Custom Dockerfile
FROM vllm/vllm-openai-rocm:latest
RUN pip install rixl

Verification

Run a container from the new image and verify that RIXL is installed: docker run -it <new_image_id> python -c "import rixl"

Extra Tips

For cross-vendor communication between AMD and NVIDIA GPUs, you may need to use a different connector or library that supports both vendors.
Ensure that the Docker image is built and tested in an environment similar to the production environment to avoid compatibility issues.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#latency issue #model loading #dependency error #configuration error #environment variable

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

vllm - 💡(How to fix) Fix [Usage]: Using RIXL Connector on AMD GPU [2 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Your current environment

extent analysis

Fix Plan

Steps to Fix

Example Code

Verification

Extra Tips

Still need to ship something?

TRENDING

vllm - 💡(How to fix) Fix [Usage]: Using RIXL Connector on AMD GPU [2 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Your current environment

extent analysis

Fix Plan

Steps to Fix

Example Code

Verification

Extra Tips

Still need to ship something?

RELATED_DISCOVERY

TRENDING