vllm - 💡(How to fix) Fix [CI Failure]: mi300_1: DeepSeek V2-Lite Prefetch Offload Accuracy (H100-MI300) [3 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
vllm-project/vllm#41579Fetched 2026-05-04 04:58:44
View on GitHub
Comments
3
Participants
2
Timeline
17
Reactions
0
Timeline (top)
mentioned ×4subscribed ×4commented ×3added_to_project_v2 ×2

Error Message

  • python3 - Traceback (most recent call last): File "<stdin>", line 3, in <module> AssertionError: deepseek-ai/DeepSeek-V2-Lite prefetch_offload accuracy 0.015

Root Cause

  • Flaky test
  • Can reproduce locally
  • Caused by external libraries (e.g. bug in transformers)

Code Example

+ python3 -
Traceback (most recent call last):
  File "<stdin>", line 3, in <module>
AssertionError: deepseek-ai/DeepSeek-V2-Lite prefetch_offload accuracy 0.015
RAW_BUFFERClick to expand / collapse

Name of failing test

(command rocm-smi || true) && export VLLM_TEST_GROUP_NAME=mi300_1-deepseek-v2-lite-prefetch-offload-accuracy-h100-mi300 && export VLLM_ALLOW_DEPRECATED_BEAM_SEARCH=1 && cd /vllm-workspace && bash .buildkite/scripts/scheduled_integration_test/deepseek_v2_lite_prefetch_offload.sh 0.25 200 8030

Basic information

  • Flaky test
  • Can reproduce locally
  • Caused by external libraries (e.g. bug in transformers)

🧪 Describe the failing test

+ python3 -
Traceback (most recent call last):
  File "<stdin>", line 3, in <module>
AssertionError: deepseek-ai/DeepSeek-V2-Lite prefetch_offload accuracy 0.015

📝 History of failing test

  • Current streak start: 2026-05-02
  • First failure in 60d window: 2026-04-21
  • Last successful nightly: 2026-05-01
  • Break frequency (60d, pass↔fail flips): 2
  • Latest nightly date: 2026-05-03
  • Latest build(s): amd-ci #8177
  • Latest hardware status: mi300_1=fail

extent analysis

TL;DR

The failing test can be mitigated by investigating the deepseek_v2_lite_prefetch_offload.sh script and the AssertionError related to the deepseek-ai/DeepSeek-V2-Lite model's prefetch offload accuracy.

Guidance

  • Review the deepseek_v2_lite_prefetch_offload.sh script to understand the test workflow and identify potential issues.
  • Investigate the AssertionError and the expected accuracy value of 0.015 to determine if it's a valid threshold.
  • Check the model's configuration and training data to ensure they are correct and up-to-date.
  • Run the test with additional logging or debugging to gather more information about the failure.

Example

No code snippet is provided as the issue lacks specific code details.

Notes

The issue seems to be related to a specific test case and model configuration. The provided information is limited, and further investigation is needed to determine the root cause.

Recommendation

Apply workaround: Investigate and adjust the deepseek_v2_lite_prefetch_offload.sh script and model configuration to resolve the AssertionError and improve test reliability.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING