pytorch - ✅(Solved) Fix DISABLED test_passed_swizzle_arrays_cuda (__main__.TestFP8MatmulCUDA) [2 pull requests, 1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
pytorch/pytorch#180073Fetched 2026-04-11 06:08:29
View on GitHub
Comments
1
Participants
2
Timeline
30
Reactions
0
Author
Timeline (top)
mentioned ×11subscribed ×11labeled ×7commented ×1

Root Cause

This test was disabled because it is failing on main branch (recent examples).

PR fix notes

PR #179782: [ROCm] Enable ROCm swizzle check and update scaled_mm swizzle tests

Description (problem / solution / changelog)

On ROCm, require MX scaled_mm swizzle inputs to provide one value for both A and B and enforce that both are NO_SWIZZLE. Update test_passed_swizzle_arrays to use ROCm-specific expectations and add coverage for the explicit NO_SWIZZLE value check. For nvfp4, swizzle check is skipped but eventually fails with error NVFP4 scaling not supported on ROCM. miscellaneous fix: fix swizzle validation error messages to use correct singular/plural value wording.

Fixes https://github.com/pytorch/pytorch/issues/180073 cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @pragupta @jerrymannil @xinyazhang

Changed files

  • aten/src/ATen/native/cuda/ScaledBlas.cpp (modified, +35/-7)
  • test/test_scaled_matmul_cuda.py (modified, +33/-8)

PR #180691: [ROCm] Enable ROCm swizzle check and update scaled_mm swizzle tests

Description (problem / solution / changelog)

On ROCm, require MX scaled_mm swizzle inputs to provide one value for both A and B and enforce that both are NO_SWIZZLE. Update test_passed_swizzle_arrays to use ROCm-specific expectations and add coverage for the explicit NO_SWIZZLE value check. For nvfp4, swizzle check is skipped but eventually fails with error NVFP4 scaling not supported on ROCM. miscellaneous fix: fix swizzle validation error messages to use correct singular/plural value wording.

Fixes https://github.com/pytorch/pytorch/issues/180073 cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @pragupta @jerrymannil @xinyazhang

Changed files

  • aten/src/ATen/native/cuda/ScaledBlas.cpp (modified, +35/-7)
  • test/test_scaled_matmul_cuda.py (modified, +33/-8)
RAW_BUFFERClick to expand / collapse

Platforms: rocm

This test was disabled because it is failing on main branch (recent examples).

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @jerrymannil @xinyazhang @mruberry

extent analysis

TL;DR

The test test_passed_swizzle_arrays_cuda in test_scaled_matmul_cuda.py is failing on the main branch and needs to be investigated and fixed.

Guidance

  • Review the recent failure examples on torch-ci.com to understand the failure patterns and potential causes.
  • Check the test case test_passed_swizzle_arrays_cuda in test_scaled_matmul_cuda.py for any platform-specific issues related to rocm.
  • Investigate potential issues with the swizzle arrays functionality on CUDA.
  • Consider re-enabling the test and running it locally to gather more information about the failure.

Notes

The issue seems to be specific to the rocm platform, so any fixes or workarounds may need to be targeted at this platform.

Recommendation

Apply workaround: Temporarily disable the failing test until a proper fix can be implemented, to prevent blocking other changes on the main branch.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

pytorch - ✅(Solved) Fix DISABLED test_passed_swizzle_arrays_cuda (__main__.TestFP8MatmulCUDA) [2 pull requests, 1 comments, 2 participants]