vllm - 💡(How to fix) Fix [vLLM IR] E2E lowering tests [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
vllm-project/vllm#39358Fetched 2026-04-09 07:51:39
View on GitHub
Comments
0
Participants
1
Timeline
4
Reactions
0
Participants
Timeline (top)
added_to_project_v2 ×1issue_type_added ×1labeled ×1project_v2_item_status_changed ×1
RAW_BUFFERClick to expand / collapse

Current lowering tests are very one-off: just invoke the rms_norm op and manually look at the lowering dictionary in the pass. As more ops are added, we should make sure this testing is robust: per-op, unit tests (with fake ops), and E2E tests on real models.

We should add the following tests:

  • per-op lowering tests, making sure all (supported) op implementations are lowered correctly (part of #38782). These should make sure all implementation and supports_args functions are properly executable by Dynamo. We should also check that supports_args does not specialize on the batch size by invoking it on an unbacked integer.
  • lowering unit tests, using fake ops & implementations. Crucially it should stress-test implementation selection by using fake ops with complex supports_args.
  • E2E correctness tests, comparing the lowering pipeline with ir_enable_torch_wrap=False where implementations get traced through with Dynamo directly, and comparing with no lowering where IR ops remain in the Inductor-produced artifact. This should compare outputs and confirm the correct number of ops are lowered into chosen implementations.

extent analysis

TL;DR

Add per-op lowering tests, unit tests with fake ops, and E2E tests to ensure robust testing of the lowering process.

Guidance

  • Create per-op lowering tests to verify that all supported op implementations are lowered correctly, including checking supports_args functions.
  • Develop unit tests using fake ops and implementations to stress-test implementation selection, focusing on complex supports_args scenarios.
  • Design E2E correctness tests to compare the lowering pipeline with and without tracing implementations through Dynamo, validating output correctness and op counting.
  • Ensure that supports_args does not specialize on batch size by testing with unbacked integers.

Notes

The proposed tests aim to improve the robustness of the lowering process testing, but the exact implementation details are not provided in the issue.

Recommendation

Apply workaround by adding the proposed tests to ensure robust testing of the lowering process, as this will help identify and fix potential issues with op implementation lowering and implementation selection.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING