pytorch - ✅(Solved) Fix AArch64 Unit Test Failure - test_aot_inductor_arrayref - RuntimeError: unknown architecure [1 pull requests, 1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
pytorch/pytorch#177254Fetched 2026-04-08 00:42:40
View on GitHub
Comments
0
Participants
1
Timeline
48
Reactions
0
Participants
Timeline (top)
referenced ×15mentioned ×14subscribed ×14labeled ×3

Error Message

Traceback (most recent call last): File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1744, in test_quanatized_int8_linear self.check_model(Model(self.device), example_inputs) File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor_utils.py", line 249, in check_model expected = ref_model(*ref_inputs) File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl return self._call_impl(*args, **kwargs) File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl return forward_call(*args, **kwargs) File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1729, in forward return torch.ops._quantized.wrapped_quantized_linear( File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 1275, in call return self._op(*args, **kwargs) RuntimeError: unknown architecure

To execute this test, run the following from the base repo dir: python test/inductor/test_aot_inductor_arrayref.py AOTInductorTestABICompatibleCpuWithStackAllocation.test_quanatized_int8_linear_cpu_with_stack_allocation

Fix Action

Fixed

PR fix notes

PR #177584: Add AArch64 xfails for inductor, nn, jit, and linalg tests

Description (problem / solution / changelog)

Stack from ghstack (oldest at bottom):

  • -> #177584

This PR marks all known unit test failures for AArch64 as xfail or skip with a small code comment referencing the github issues. The test files affected are also added to the linux-aarch64 unit test suite.

Once this PR has been merged we should be able to run ALL unit tests on all AArch64 cpus without any reported failures ( this will be a follow up PR ).

Related PRs #177243, #177244, #177245, #177247, #177249, #177250, #177251, #177254, #177255, #177258, #177264, #170787, #146483, #177327

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo @fadara01 @aditew01 @nikhil-arm @milpuz01

Changed files

  • .ci/pytorch/test.sh (modified, +3/-0)
  • test/inductor/test_aot_inductor.py (modified, +7/-0)
  • test/inductor/test_cpu_repro.py (modified, +8/-0)
  • test/inductor/test_cpu_select_algorithm.py (modified, +8/-0)
  • test/inductor/test_fused_attention.py (modified, +14/-3)
  • test/inductor/test_torchinductor.py (modified, +5/-1)
  • test/inductor/test_torchinductor_opinfo.py (modified, +6/-2)
  • test/jit/test_freezing.py (modified, +4/-0)
  • test/nn/test_convolution.py (modified, +6/-0)
  • test/test_jit.py (modified, +4/-2)
  • test/test_jit_autocast.py (modified, +13/-1)
  • test/test_nn.py (modified, +4/-1)
  • torch/testing/_internal/common_methods_invocations.py (modified, +11/-0)
  • torch/testing/_internal/opinfo/definitions/linalg.py (modified, +21/-0)

Code Example

Traceback (most recent call last):
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1744, in test_quanatized_int8_linear
    self.check_model(Model(self.device), example_inputs)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor_utils.py", line 249, in check_model
    expected = ref_model(*ref_inputs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
    return forward_call(*args, **kwargs)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1729, in forward
    return torch.ops._quantized.wrapped_quantized_linear(
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 1275, in __call__
    return self._op(*args, **kwargs)
RuntimeError: unknown architecure

To execute this test, run the following from the base repo dir:
    python test/inductor/test_aot_inductor_arrayref.py AOTInductorTestABICompatibleCpuWithStackAllocation.test_quanatized_int8_linear_cpu_with_stack_allocation
RAW_BUFFERClick to expand / collapse

🐛 Describe the bug

Test

test/inductor/test_aot_inductor_arrayref.py AOTInductorTestABICompatibleCpuWithStackAllocation.test_quanatized_int8_linear_cpu_with_stack_allocation

Stracktrace

Traceback (most recent call last):
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1744, in test_quanatized_int8_linear
    self.check_model(Model(self.device), example_inputs)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor_utils.py", line 249, in check_model
    expected = ref_model(*ref_inputs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
    return forward_call(*args, **kwargs)
  File "/builds/software-machine-learning-infra-frameworks-workspaces-robhar02/pytorch/test/inductor/test_aot_inductor.py", line 1729, in forward
    return torch.ops._quantized.wrapped_quantized_linear(
  File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 1275, in __call__
    return self._op(*args, **kwargs)
RuntimeError: unknown architecure

To execute this test, run the following from the base repo dir:
    python test/inductor/test_aot_inductor_arrayref.py AOTInductorTestABICompatibleCpuWithStackAllocation.test_quanatized_int8_linear_cpu_with_stack_allocation

Affects all AArch64

Versions

Commit - https://github.com/pytorch/pytorch/commit/08b6f48d871affbc7abe9277020aed882fdf110a

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo @jgong5 @Xia-Weiwen @leslie-fang-intel @snadampal @milpuz01 @aditew01 @nikhil-arm @fadara01 @nWEIdia

extent analysis

Fix Plan

The fix involves updating the torch.ops._quantized.wrapped_quantized_linear function to support AArch64 architecture.

  • Update the torch.ops module to include AArch64 architecture support.
  • Modify the wrapped_quantized_linear function to handle the new architecture.

Example code:

import torch

# Update torch.ops to support AArch64
torch.ops._quantized.register_module("wrapped_quantized_linear", 
                                     lambda *args, **kwargs: 
                                     torch.ops._quantized._wrapped_quantized_linear_aa(*args, **kwargs))

# Define the new _wrapped_quantized_linear_aa function
def _wrapped_quantized_linear_aa(input, weight, bias, scale, zero_point, dtype):
    # Implementation for AArch64 architecture
    # ...
    pass

Verification

To verify the fix, run the test case again:

python test/inductor/test_aot_inductor_arrayref.py AOTInductorTestABICompatibleCpuWithStackAllocation.test_quanatized_int8_linear_cpu_with_stack_allocation

If the test passes, the fix is successful.

Extra Tips

  • Ensure that the torch.ops module is updated correctly to support the new architecture.
  • Test the fix on different platforms to ensure compatibility.
  • Consider adding additional logging or error handling to the wrapped_quantized_linear function to handle any potential issues.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING