PR #173663: [RISCV] disable cuda-bingings on riscv64 CI

fernchen · 2026-04-21T12:42:39Z

[pytorch] PR 173663: RISCV disable cuda-bingings on riscv64 CI - Repository: pytorch/pytorch - Author: yuzibo - State: closed | merged: False - Link: https://g… # PR #173663: [RISCV] disable cuda-bingings on riscv64 CI - Repository: pytorch/pytorch - Author: yuzibo - State: closed | merged: False - Link: https://github.com/pytorch/pytorch/pull/173663 ## Description (problem / solution / changelog) cuda-bindings is currently gated on riscv64 because there is no supported CUDA toolchain or installable Python package available for this architecture. Attempting to resolve cuda-bindings on riscv64 results in install-time failures, even when CUDA is explicitly disabled at build time. This change disables cuda-bindings on riscv64 for now, and it can be re-enabled once official CUDA support or a working packaging path becomes available. ## Changed files - `.ci/docker/requirements-ci.txt` (modified, +1/-1) --- # PR #178778: [CI] Restrict mkl installation to x86 systems only - Repository: pytorch/pytorch - Author: yuzibo - State: open | merged: False - Link: https://github.com/pytorch/pytorch/pull/178778 ## Description (problem / solution / changelog) For mkl [2024.02](https://pypi.org/project/mkl/2024.2.0/#files), there is no riscv64 wheel support, so disable it at this stage. Otherwise it will block CI on riscv64. ## Changed files - `.ci/docker/requirements-ci.txt` (modified, +2/-2) ## Fix / Workaround * [ ] Fix RISC-V architecture-specific bugs and upstream patches * [ ] Implement dispatch mechanism (select optimal kernel at runtime) ## Background Following the prior RFC (https://github.com/pytorch/pytorch/issues/171659), we have concluded multiple rounds of discussions and initial experiments. This effort has been jointly driven by **XuanTie Team**(from Alibaba Damo) and **Ruyi Community**(from Institute of Software, Chinese Academy of Sciences), with active participation and valuable inputs from multiple community partners @CodersAcademy006 @yuzibo @zhanghb97 @malfet @ezyang . Through these collective efforts, we have now arrived at a set of concrete and actionable goals for enabling PyTorch on the RISC-V ecosystem. This issue serves as a central tracking item to monitor the concrete implementation tasks towards enabling full RISC-V support in PyTorch. ## Tracking ### Phase 1: CI Infrastructure and Validation **Goal:** Establish a robust CI pipeline for RISC-V builds and testing #### 1.1 Build Environment & Wheel Construction * [ ] Set up RISC-V cross-compilation and native compilation environment (RuyiRepo PyPI [https://github.com/RuyiRepo/ruyirepo](https://github.com/RuyiRepo/ruyirepo?tab=readme-ov-file)) * [ ] Build and validate PyTorch wheel package for RISC-V (riscv64) * [https://github.com/pytorch/pytorch/pull/173663](https://github.com/pytorch/pytorch/pull/173663) * [https://github.com/pytorch/pytorch/pull/178778](https://github.com/pytorch/pytorch/pull/178778) #### 1.2 CI Pipeline Integration * [ ] Jenkins CI build for RISC-V PyTorch - https://community-ci.openruyi.cn/rvci/job/PytorchCi/job/11-pytorch-ci-wip/ * [ ] GitHub Actions CI build for RISC-V PyTorch * [ ] Automated monitoring agent flow for CI pipeline #### 1.3 Test Suite Validation & Bug Fixing * [ ] Build and maintain RISC-V test blocklist - https://github.com/RuyiAI-Stack/pytorch/pull/1 * [ ] Fix RISC-V architecture-specific bugs and upstream patches * [ ] Generate and maintain `test_times.json` for RISC-V CI sharding ### Phase 2: High-Performance Micro-Kernel Library (uKernel) **Goal:** Deliver optimized RISC-V implementations for core ATen operators, analogous to KleidiAI for ARM #### 2.1 Library Infrastructure * [ ] Add uKernel library dependency and build system integration into PyTorch * [ ] Implement runtime CPU feature detection for RISC-V (VLEN/RLEN) * [ ] Implement dispatch mechanism (select optimal kernel at runtime) #### 2.2 Core Compute Kernels (GEMM / Matmul) * [ ] Implement FP32 GEMM kernel with RVV * [ ] Implement FP16 GEMM kernel with RVV * [ ] Implement BF16 GEMM kernel with RVV * [ ] Implement FP8 GEMM kernel with RVV * [ ] Implement INT8 GEMM kernel with RVV (quantized) * [ ] Implement INT4 GEMM kernel with RVV (quantized) * [ ] Implement RVFP4 GEMM kernel with RVV * [ ] Implement GEMM with RVM (Matrix Extension) * [ ] Implement batched GEMM * [ ] Implement GEMV (matrix-vector) optimized kernels #### 2.3 Convolution Kernels * [ ] Implement im2col + GEMM based Conv2d with RVV/RVM * [ ] Implement direct Conv2d kernel with RVV/RVM * [ ] Implement depthwise Conv2d with RVV/RVM * [ ] Implement Conv1d with RVV/RVM #### 2.4 Element-wise & Activation Kernels * [ ] Implement vectorized ReLU / ReLU6 / LeakyReLU * [ ] Implement vectorized GELU / SiLU / Swish * [ ] Implement vectorized Sigmoid / Tanh * [ ] Implement vectorized element-wise Add / Mul / Sub / Div * [ ] Implement vectorized Softmax * [ ] Implement vectorized type cast kernels #### 2.5 Normalization & Reduction Kernels * [ ] Implement LayerNorm * [ ] Implement RMSNorm * [ ]

Repository: pytorch/pytorch
Author: yuzibo
State: closed | merged: False
Link: https://github.com/pytorch/pytorch/pull/173663

Description (problem / solution / changelog)

cuda-bindings is currently gated on riscv64 because there is no supported CUDA toolchain or installable Python package available for this architecture.

Attempting to resolve cuda-bindings on riscv64 results in install-time failures, even when CUDA is explicitly disabled at build time.

This change disables cuda-bindings on riscv64 for now, and it can be re-enabled once official CUDA support or a working packaging path becomes available.

Changed files

.ci/docker/requirements-ci.txt (modified, +1/-1)

PR #178778: [CI] Restrict mkl installation to x86 systems only

Repository: pytorch/pytorch
Author: yuzibo
State: open | merged: False
Link: https://github.com/pytorch/pytorch/pull/178778

Description (problem / solution / changelog)

For mkl 2024.02, there is no riscv64 wheel support, so disable it at this stage. Otherwise it will block CI on riscv64.

Changed files

.ci/docker/requirements-ci.txt (modified, +2/-2)

TL;DR

To enable PyTorch on the RISC-V ecosystem, focus on establishing a robust CI pipeline, implementing optimized RISC-V operators, and extending torch.compile backend support.

Guidance

Establish CI Infrastructure: Set up RISC-V cross-compilation and native compilation environments, and integrate Jenkins CI and GitHub Actions CI builds for RISC-V PyTorch.
Implement Optimized Operators: Focus on implementing optimized RISC-V implementations for core ATen operators, including GEMM, convolution, and element-wise operations.
Extend torch.compile Backend: Evaluate and validate existing torch.compile backends on RVV hardware, and extend PyTorch CPU vector ISA abstractions to incorporate RVV semantics.
Validate and Benchmark: Verify compilation and execution correctness of torch.compile() across representative operators, models, and workloads on RISC-V, and benchmark compiled execution against eager execution.

Example

No specific code snippet is provided, as the issue focuses on high-level implementation tasks and goals.

Notes

The provided issue lacks specific technical details and code snippets, making it challenging to provide a detailed solution. However, the guidance points above should help in establishing a robust CI pipeline, implementing optimized operators, and extending torch.compile backend support.

Recommendation

Apply the workaround by focusing on the implementation tasks outlined in the issue, particularly establishing a robust CI pipeline and implementing optimized RISC-V operators. This approach will help enable PyTorch on the RISC-V ecosystem.

pytorch - ✅(Solved) Fix [Tracking] RISC-V PyTorch enablement [2 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fix / Workaround

PR fix notes

PR #173663: [RISCV] disable cuda-bingings on riscv64 CI

Description (problem / solution / changelog)

Changed files

PR #178778: [CI] Restrict mkl installation to x86 systems only

Description (problem / solution / changelog)

Changed files

Background

Tracking

Phase 1: CI Infrastructure and Validation

1.1 Build Environment & Wheel Construction

1.2 CI Pipeline Integration

1.3 Test Suite Validation & Bug Fixing

Phase 2: High-Performance Micro-Kernel Library (uKernel)

2.1 Library Infrastructure

2.2 Core Compute Kernels (GEMM / Matmul)

2.3 Convolution Kernels

2.4 Element-wise & Activation Kernels

2.5 Normalization & Reduction Kernels

2.6 Attention & Transformer-Specific Kernels

2.7 Pooling & Data Rearrangement

Phase 3: torch.compile Backend Extension

3.1 RISC-V Ratified Extensions: Native torch.compile Support

3.2 RISC-V Draft / Custom Extensions: torch.compile Lowering via RuyiAI Buddy Compiler

3.3 Validation and Benchmarking

Phase 4: Triton / TileLang RISC-V Support and PyTorch Integration

Call for Participation

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING

3.1 RISC-V Ratified Extensions: Native `torch.compile` Support

3.2 RISC-V Draft / Custom Extensions: `torch.compile` Lowering via RuyiAI Buddy Compiler