vllm - 💡(How to fix) Fix [Bug]: [ROCm CI] HuggingFace dataset loading fails with "Feature type 'List' not found"

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…

Multiple ROCm CI suites are failing during HuggingFace dataset loading with:

ValueError: Feature type 'List' not found

Affected suites observed:

  • mi250_1: Multi-Modal Accuracy Eval (Small Models)
  • mi355_1: Examples

The failure occurs while loading:

load_dataset("philschmid/mt-bench")

inside:

vllm/benchmarks/datasets/datasets.py

Root Cause

Multiple ROCm CI suites are failing during HuggingFace dataset loading with:

ValueError: Feature type 'List' not found

Affected suites observed:

  • mi250_1: Multi-Modal Accuracy Eval (Small Models)
  • mi355_1: Examples

The failure occurs while loading:

load_dataset("philschmid/mt-bench")

inside:

vllm/benchmarks/datasets/datasets.py

Code Example

Affected suites observed:

* `mi250_1: Multi-Modal Accuracy Eval (Small Models)`
* `mi355_1: Examples`

The failure occurs while loading:

---

inside:

---

## Investigation

The failure appears environment-specific rather than dataset-specific.

I verified in a clean Linux environment that:

---

loads successfully using a modern HuggingFace `datasets` installation.

Additionally:

---

returns `True`.

However, the ROCm CI environment fails with:
RAW_BUFFERClick to expand / collapse

Your current environment

collect_env.py could not complete on the local machine due to a local CUDA driver/runtime mismatch during torch.cuda.init().

Relevant local test environment:

  • Ubuntu Linux
  • Python 3.12.3
  • HuggingFace datasets successfully loads philschmid/mt-bench
  • List exists in datasets.features._FEATURE_TYPES

🐛 Describe the bug

Summary

Multiple ROCm CI suites are failing during HuggingFace dataset loading with:

ValueError: Feature type 'List' not found

Affected suites observed:

  • mi250_1: Multi-Modal Accuracy Eval (Small Models)
  • mi355_1: Examples

The failure occurs while loading:

load_dataset("philschmid/mt-bench")

inside:

vllm/benchmarks/datasets/datasets.py

Investigation

The failure appears environment-specific rather than dataset-specific.

I verified in a clean Linux environment that:

from datasets import load_dataset
ds = load_dataset("philschmid/mt-bench")

loads successfully using a modern HuggingFace datasets installation.

Additionally:

from datasets.features import features
print("List" in features._FEATURE_TYPES)

returns True.

However, the ROCm CI environment fails with:

ValueError: Feature type 'List' not found

during dataset loading.

Likely cause

This suggests the issue is likely related to the ROCm CI environment itself, potentially:

  • outdated/inconsistent datasets package
  • stale HuggingFace dataset cache
  • dependency mismatch
  • CI image drift between ROCm and upstream environments

The issue does not appear to be related to ROCm kernel execution or model runtime initialization.

Additional context

The engine/model initialization succeeds successfully before the failure:

  • graph capture completes
  • engine warmup succeeds
  • failure occurs specifically during load_dataset(...)

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING