transformers - 💡(How to fix) Fix [Windows] RTX 5070 Ti (Blackwell sm_120) - setup and deployment notes [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
vllm-project/vllm#41614Fetched 2026-05-05 05:44:41
View on GitHub
Comments
1
Participants
2
Timeline
1
Reactions
0
Timeline (top)
commented ×1

Fix Action

Solution

$env:TORCH_CUDA_ARCH_LIST = "12.0"
$env:CUDA_VISIBLE_DEVICES = "0"
$env:CUDA_MODULE_LOADING = "LAZY"

Code Example

$env:TORCH_CUDA_ARCH_LIST = "12.0"
$env:CUDA_VISIBLE_DEVICES = "0"
$env:CUDA_MODULE_LOADING = "LAZY"
RAW_BUFFERClick to expand / collapse

Environment

  • GPU: NVIDIA GeForce RTX 5070 Ti Laptop GPU (Blackwell, compute capability 12.0)
  • Driver: 595.79 (CUDA 13.2)
  • OS: Windows 11
  • Python: 3.14
  • transformers: [latest]

Problem

transformers on RTX 5070 Ti requires workarounds:

  • TORCH_CUDA_ARCH_LIST=12.0 required for Blackwell
  • Model loading may fail or fall back to CPU without explicit arch
  • CUDA_VISIBLE_DEVICES=-1 system env var blocks GPU detection

Solution

$env:TORCH_CUDA_ARCH_LIST = "12.0"
$env:CUDA_VISIBLE_DEVICES = "0"
$env:CUDA_MODULE_LOADING = "LAZY"

Question

Would it be helpful to add RTX 50 series notes to the GPU documentation? I can contribute a PR with Windows troubleshooting.

extent analysis

TL;DR

Set TORCH_CUDA_ARCH_LIST=12.0 and apply the registry fix to utilize the NVIDIA GeForce RTX 5070 Ti Laptop GPU for deploying vLLM.

Guidance

  • Verify that the TORCH_CUDA_ARCH_LIST environment variable is set correctly to 12.0 to support the Blackwell architecture.
  • Apply the registry fix to set the GPU preference to the NVIDIA GPU, as the default setting routes Python to the Intel iGPU.
  • Consider contributing a PR to update the supported architectures list in vLLM to include Blackwell (sm_120) for better support of RTX 5070 Ti/5080/5090 laptops.
  • Ensure that the CUDA Toolkit and PyTorch cu130 PATH priorities are correctly configured to avoid DLL conflicts.

Example

export TORCH_CUDA_ARCH_LIST=12.0
export CUDA_VISIBLE_DEVICES=0
export CUDA_MODULE_LOADING=LAZY

And for Windows laptops:

import winreg
key = winreg.OpenKey(winreg.HKEY_CURRENT_USER,
    r'Software\Microsoft\DirectX\UserGpuPreferences',
    0, winreg.KEY_SET_VALUE)
winreg.SetValueEx(key, r'C:\path\to\python.exe', 0, winreg.REG_SZ, 'GpuPreference=2;')

Notes

The provided solution is specific to the NVIDIA GeForce RTX 5070 Ti Laptop GPU and may not apply to other GPU models or architectures.

Recommendation

Apply the workaround by setting TORCH_CUDA_ARCH_LIST=12.0 and applying the registry fix, as there is no indication of a fixed version that includes support for the Blackwell architecture.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING