vllm - 💡(How to fix) Fix [vLLM IR] E2E lowering tests [1 participants]

ProExpertProg · 2026-04-08T22:37:03Z

[vllm] Current lowering tests are very one-off: just invoke the rms norm op and manually look at the lowering dictionary in the pass. As more ops are added, we… Current lowering tests are very one-off: just invoke the `rms_norm` op and manually look at the lowering dictionary in the pass. As more ops are added, we should make sure this testing is robust: per-op, unit tests (with fake ops), and E2E tests on real models. We should add the following tests: - per-op lowering tests, making sure all (supported) op implementations are lowered correctly (part of #38782). These should make sure all implementation and `supports_args` functions are properly executable by Dynamo. We should also check that `supports_args` does not specialize on the batch size by invoking it on an unbacked integer. - lowering unit tests, using fake ops & implementations. Crucially it should stress-test implementation selection by using fake ops with complex `supports_args`. - E2E correctness tests, comparing the lowering pipeline with `ir_enable_torch_wrap=False` where implementations get traced through with Dynamo directly, and comparing with no lowering where IR ops remain in the Inductor-produced artifact. This should compare outputs and confirm the correct number of ops are lowered into chosen implementations.

vllm2026-04-08 22:37:03

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

vllm-project/vllm#39358•Fetched 2026-04-09 07:51:39

View on GitHub

Comments

Participants

Timeline

Reactions

Author

ProExpertProg

Participants

ProExpertProg

Timeline (top)

added_to_project_v2 ×1issue_type_added ×1labeled ×1project_v2_item_status_changed ×1

RAW_BUFFERClick to expand / collapse

Current lowering tests are very one-off: just invoke the rms_norm op and manually look at the lowering dictionary in the pass. As more ops are added, we should make sure this testing is robust: per-op, unit tests (with fake ops), and E2E tests on real models.

We should add the following tests:

per-op lowering tests, making sure all (supported) op implementations are lowered correctly (part of #38782). These should make sure all implementation and supports_args functions are properly executable by Dynamo. We should also check that supports_args does not specialize on the batch size by invoking it on an unbacked integer.
lowering unit tests, using fake ops & implementations. Crucially it should stress-test implementation selection by using fake ops with complex supports_args.
E2E correctness tests, comparing the lowering pipeline with ir_enable_torch_wrap=False where implementations get traced through with Dynamo directly, and comparing with no lowering where IR ops remain in the Inductor-produced artifact. This should compare outputs and confirm the correct number of ops are lowered into chosen implementations.

extent analysis

TL;DR

Add per-op lowering tests, unit tests with fake ops, and E2E tests to ensure robust testing of the lowering process.

Guidance

Create per-op lowering tests to verify that all supported op implementations are lowered correctly, including checking supports_args functions.
Develop unit tests using fake ops and implementations to stress-test implementation selection, focusing on complex supports_args scenarios.
Design E2E correctness tests to compare the lowering pipeline with and without tracing implementations through Dynamo, validating output correctness and op counting.
Ensure that supports_args does not specialize on batch size by testing with unbacked integers.

Notes

The proposed tests aim to improve the robustness of the lowering process testing, but the exact implementation details are not provided in the issue.

Recommendation

Apply workaround by adding the proposed tests to ensure robust testing of the lowering process, as this will help identify and fix potential issues with op implementation lowering and implementation selection.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#environment variable #network issue #logging issue #authentication issue #prompt issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

vllm - 💡(How to fix) Fix [vLLM IR] E2E lowering tests [1 participants]

Recommended Tools

GitHub issue graph ai analysis

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

TRENDING

vllm - 💡(How to fix) Fix [vLLM IR] E2E lowering tests [1 participants]

Recommended Tools

GitHub issue graph ai analysis

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING