pytorch - ✅(Solved) Fix DISABLED test_scaled_mm_deepseek_error_messages_bfloat16_lhs_block_1_rhs_block_128_M_256_N_256_K_256_cuda (__main__.TestFP8MatmulCUDA) [2 pull requests, 1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
pytorch/pytorch#180076Fetched 2026-04-11 06:08:24
View on GitHub
Comments
1
Participants
2
Timeline
29
Reactions
0
Author
Timeline (top)
mentioned ×11subscribed ×11labeled ×6commented ×1

Root Cause

This test was disabled because it is failing on main branch (recent examples).

PR fix notes

PR #180384: [ROCm] Update scaled_mm DeepSeek error message

Description (problem / solution / changelog)

Fixes https://github.com/pytorch/pytorch/issues/180074. Fixes https://github.com/pytorch/pytorch/issues/180075. Fixes https://github.com/pytorch/pytorch/issues/180076. Fixes https://github.com/pytorch/pytorch/issues/180077. Fixes https://github.com/pytorch/pytorch/issues/180078. Fixes https://github.com/pytorch/pytorch/issues/179954.

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @pragupta @jerrymannil @xinyazhang

Changed files

  • test/test_scaled_matmul_cuda.py (modified, +2/-2)

PR #180690: [ROCm] Update scaled_mm DeepSeek error message

Description (problem / solution / changelog)

Fixes https://github.com/pytorch/pytorch/issues/180074. Fixes https://github.com/pytorch/pytorch/issues/180075. Fixes https://github.com/pytorch/pytorch/issues/180076. Fixes https://github.com/pytorch/pytorch/issues/180077. Fixes https://github.com/pytorch/pytorch/issues/180078. Fixes https://github.com/pytorch/pytorch/issues/179954.

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @pragupta @jerrymannil @xinyazhang

Changed files

  • test/test_scaled_matmul_cuda.py (modified, +2/-2)
RAW_BUFFERClick to expand / collapse

Platforms: rocm

This test was disabled because it is failing on main branch (recent examples).

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @jerrymannil @xinyazhang @mruberry

extent analysis

TL;DR

  • Re-enable and re-run the disabled test test_scaled_matmul_cuda.py to assess the current failure status on the main branch.

Guidance

  • Review the recent failure examples on torch-ci.com to understand the error messages and potential causes.
  • Investigate the test_scaled_mm_deepseek_error_messages_bfloat16_lhs_block_1_rhs_block_128_M_256_N_256_K_256_cuda test case for specific failure conditions.
  • Check for any recent changes in the main branch that might be contributing to the test failure.
  • Consider reaching out to the mentioned individuals (e.g., @jeffdaily, @sunway513) for additional insight or expertise on the test and its dependencies.

Notes

  • The issue lacks detailed technical information about the failure, so a thorough investigation of the test case and recent changes is necessary.
  • Collaboration with the mentioned individuals may be crucial in resolving the issue due to their potential familiarity with the test and its components.

Recommendation

  • Apply workaround: Re-enable the test and gather more information about the failure to inform a more targeted fix.
  • Reason: The current information is insufficient to propose a direct fix, so gathering more data is a necessary step.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

pytorch - ✅(Solved) Fix DISABLED test_scaled_mm_deepseek_error_messages_bfloat16_lhs_block_1_rhs_block_128_M_256_N_256_K_256_cuda (__main__.TestFP8MatmulCUDA) [2 pull requests, 1 comments, 2 participants]