pytorch - 💡(How to fix) Fix AArch64 Unit Test Failure - Multiple failures in SDPAPatternRewriterCpuTests [1 participants]

pytorch2026-03-12 10:51:51

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

pytorch/pytorch#177244•Fetched 2026-04-08 00:42:57

View on GitHub

Comments

Participants

Timeline

144

Reactions

Author

robert-hardwick

Participants

robert-hardwick

Timeline (top)

mentioned ×60subscribed ×60referenced ×15labeled ×8

Error Message

Traceback (most recent call last): File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 366, in _test_sdpa_rewriter_5 self._check_common(sfdp_pattern_5_v1, contains=False) File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 114, in _check_common self.assertEqual(result1, result2, atol=atol, rtol=rtol) File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual return super().assertEqual(x, y, *args, **kwargs) File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4365, in assertEqual raise error_metas.pop()[0].to_error( # type: ignore[index] AssertionError: Tensor-likes are not close!

Mismatched elements: 2259 / 4096 (55.2%) Greatest absolute difference: 0.7239781618118286 at index (3, 1, 13, 1) (up to 0.001 allowed) Greatest relative difference: 412.53863525390625 at index (1, 1, 7, 16) (up to 0.2 allowed)

To execute this test, run the following from the base repo dir: python test/inductor/test_fused_attention.py SDPAPatternRewriterCpuTests.test_sdpa_rewriter_5_cpu

Code Example

Traceback (most recent call last):
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 366, in _test_sdpa_rewriter_5
    self._check_common(sfdp_pattern_5_v1, contains=False)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 114, in _check_common
    self.assertEqual(result1, result2, atol=atol, rtol=rtol)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual
    return super().assertEqual(x, y, *args, **kwargs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4365, in assertEqual
    raise error_metas.pop()[0].to_error(  # type: ignore[index]
AssertionError: Tensor-likes are not close!

Mismatched elements: 2259 / 4096 (55.2%)
Greatest absolute difference: 0.7239781618118286 at index (3, 1, 13, 1) (up to 0.001 allowed)
Greatest relative difference: 412.53863525390625 at index (1, 1, 7, 16) (up to 0.2 allowed)

To execute this test, run the following from the base repo dir:
    python test/inductor/test_fused_attention.py SDPAPatternRewriterCpuTests.test_sdpa_rewriter_5_cpu

RAW_BUFFERClick to expand / collapse

🐛 Describe the bug

Creating this issue in order to mark tests as XFAIL

There are actually 3 similarly failing tests

SDPAPatternRewriterCpuDynamicTests - test_sdpa_rewriter_14_cpu SDPAPatternRewriterCpuTests - test_sdpa_rewriter_5_cpu SDPAPatternRewriterCpuTests - test_sdpa_rewriter_14_cpu SDPAPatternRewriterCpuDynamicTests - test_sdpa_rewriter_5_cpu

Example Traceback

Traceback (most recent call last):
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 366, in _test_sdpa_rewriter_5
    self._check_common(sfdp_pattern_5_v1, contains=False)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_fused_attention.py", line 114, in _check_common
    self.assertEqual(result1, result2, atol=atol, rtol=rtol)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual
    return super().assertEqual(x, y, *args, **kwargs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4365, in assertEqual
    raise error_metas.pop()[0].to_error(  # type: ignore[index]
AssertionError: Tensor-likes are not close!

Mismatched elements: 2259 / 4096 (55.2%)
Greatest absolute difference: 0.7239781618118286 at index (3, 1, 13, 1) (up to 0.001 allowed)
Greatest relative difference: 412.53863525390625 at index (1, 1, 7, 16) (up to 0.2 allowed)

To execute this test, run the following from the base repo dir:
    python test/inductor/test_fused_attention.py SDPAPatternRewriterCpuTests.test_sdpa_rewriter_5_cpu

Affects Neoverse-V1

Versions

Commit - https://github.com/pytorch/pytorch/commit/08b6f48d871affbc7abe9277020aed882fdf110a

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @aditew01 @mruberry @snadampal @milpuz01 @nikhil-arm @fadara01 @nWEIdia @chauhang @penguinwu @voznesenskym @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @aakhundov @coconutruben @jataylo @drisspg @liangel-02 @howardzhang-cv

extent analysis

Fix Plan

To fix the failing tests, we need to update the tolerance values in the assertEqual method. The current values are too strict, causing the tests to fail due to minor differences in the results.

Update the atol and rtol values in the assertEqual method to more reasonable values, such as atol=1e-3 and rtol=1e-2.
Alternatively, consider using a more robust comparison method, such as torch.allclose with custom tolerance values.

Example code:

self.assertEqual(result1, result2, atol=1e-3, rtol=1e-2)

Or:

self.assertTrue(torch.allclose(result1, result2, atol=1e-3, rtol=1e-2))

Verification

To verify that the fix worked, re-run the failing tests with the updated tolerance values. If the tests pass, it indicates that the fix was successful.

Extra Tips

When updating tolerance values, make sure to consider the specific requirements of the test and the expected results.
Using more robust comparison methods, such as torch.allclose, can help reduce the likelihood of false positives due to minor differences in results.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #installation #tensor shape #autograd error #model loading #dependency error #configuration error #environment variable #network issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

pytorch - 💡(How to fix) Fix AArch64 Unit Test Failure - Multiple failures in SDPAPatternRewriterCpuTests [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Code Example

🐛 Describe the bug

Versions

extent analysis

Fix Plan

Verification

Extra Tips

Still need to ship something?

TRENDING

pytorch - 💡(How to fix) Fix AArch64 Unit Test Failure - Multiple failures in SDPAPatternRewriterCpuTests [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Code Example

🐛 Describe the bug

Versions

extent analysis

Fix Plan

Verification

Extra Tips

Still need to ship something?

RELATED_DISCOVERY

TRENDING