vllm - 💡(How to fix) Fix [Installation]: ERROR: Failed building wheel for vllm [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
vllm-project/vllm#40901Fetched 2026-04-27 05:29:29
View on GitHub
Comments
0
Participants
1
Timeline
1
Reactions
0
Participants
Timeline (top)
labeled ×1

Error Message

note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 71 | static_assert(0 < M <= 16); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:51:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<12>' requested here 51 | gemm_micro<12>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here 143 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 242 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 19 | AT_DISPATCH_CASE(at::ScalarType::Float, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 71 | static_assert(0 < M <= 16); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:58:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<16>' requested here 58 | gemm_micro<16>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here 143 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 242 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 19 | AT_DISPATCH_CASE(at::ScalarType::Float, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 56 | static_assert(0 < M <= 8); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:24:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm_micro<1>' requested here 24 | gemm_micro<1>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82c10::Half>::operator()' requested here 138 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 236 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 20 | AT_DISPATCH_CASE(at::ScalarType::Half, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 56 | static_assert(0 < M <= 8); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:28:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm_micro<2>' requested here 28 | gemm_micro<2>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82c10::Half>::operator()' requested here 138 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 236 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 20 | AT_DISPATCH_CASE(at::ScalarType::Half, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 56 | static_assert(0 < M <= 8); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:33:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm_micro<4>' requested here 33 | gemm_micro<4>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82c10::Half>::operator()' requested here 138 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 236 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 20 | AT_DISPATCH_CASE(at::ScalarType::Half, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 56 | static_assert(0 < M <= 8); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:38:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm_micro<6>' requested here 38 | gemm_micro<6>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82c10::Half>::operator()' requested here 138 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 236 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 20 | AT_DISPATCH_CASE(at::ScalarType::Half, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1: In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7: /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses] 56 | static_assert(0 < M <= 8); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:43:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm_micro<8>' requested here 43 | gemm_micro<8>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size, | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82c10::Half::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here 953 | tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82c10::Half>::operator()' requested here 138 | attention_iteration(CPU_ATTENTION_PARAMS); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here 1707 | attn_impl.template execute_attention<Attention>( | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here 186 | mainloop(&input); | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH' 236 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES' 24 | AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(VA_ARGS)) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES' 20 | AT_DISPATCH_CASE(at::ScalarType::Half, VA_ARGS)
| ^ note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all) /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL' 13 | return VA_ARGS();
| ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH' 190 | VA_ARGS) | ^ /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL' 40 | VA_ARGS
| ^ fatal error: too many errors emitted, stopping now [-ferror-limit=] 1 warning and 20 errors generated. [2/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/mla_decode.cpp.o [3/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/layernorm.cpp.o [4/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/utils.cpp.o [5/9] Building CXX object CMakeFiles/_C.dir/csrc/moe/dynamic_4bit_int_moe_cpu.cpp.o [6/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/pos_encoding.cpp.o [7/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/activation.cpp.o [8/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/torch_bindings.cpp.o ninja: build stopped: subcommand failed. Traceback (most recent call last): File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module> main() File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main json_out["return_val"] = hook(**hook_input["kwargs"]) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 280, in build_wheel return _build_backend().build_wheel( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 439, in build_wheel return _build(['bdist_wheel', '--dist-info-dir', str(metadata_directory)]) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 427, in _build return self._build_with_temp_dir( ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 408, in _build_with_temp_dir self.run_setup() File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 317, in run_setup exec(code, locals()) File "<string>", line 1041, in <module> File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/init.py", line 117, in setup return distutils.core.setup(**attrs) # type: ignore[return-value] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 186, in setup return run_commands(dist) ^^^^^^^^^^^^^^^^^^ File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 202, in run_commands dist.run_commands() File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands self.run_command(cmd) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command super().run_command(command) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command cmd_obj.run() File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/bdist_wheel.py", line 370, in run self.run_command("build") File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command self.distribution.run_command(command) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command super().run_command(command) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command cmd_obj.run() File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build.py", line 135, in run self.run_command(cmd_name) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command self.distribution.run_command(command) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command super().run_command(command) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command cmd_obj.run() File "<string>", line 346, in run File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 97, in run _build_ext.run(self) File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run self.build_extensions() File "<string>", line 315, in build_extensions File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/subprocess.py", line 413, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['cmake', '--build', '.', '-j=10', '--target=_C']' returned non-zero exit status 1. [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for vllm Successfully built tinker-atropos tinker Failed to build vllm error: failed-wheel-build-for-install

× Failed to build installable wheels for some pyproject.toml based projects ╰─> vllm

Fix Action

Fix / Workaround

The error information is

      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:51:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<12>' requested here
         51 |         gemm_micro<12>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:58:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<16>' requested here
         58 |         gemm_micro<16>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:24:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<1>' requested here
         24 |         gemm_micro<1>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:28:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<2>' requested here
         28 |         gemm_micro<2>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:33:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<4>' requested here
         33 |         gemm_micro<4>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:38:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<6>' requested here
         38 |         gemm_micro<6>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:43:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<8>' requested here
         43 |         gemm_micro<8>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      fatal error: too many errors emitted, stopping now [-ferror-limit=]
      1 warning and 20 errors generated.
      [2/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/mla_decode.cpp.o
      [3/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/layernorm.cpp.o
      [4/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/utils.cpp.o
      [5/9] Building CXX object CMakeFiles/_C.dir/csrc/moe/dynamic_4bit_int_moe_cpu.cpp.o
      [6/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/pos_encoding.cpp.o
      [7/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/activation.cpp.o
      [8/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/torch_bindings.cpp.o
      ninja: build stopped: subcommand failed.
      Traceback (most recent call last):
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module>
          main()
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main
          json_out["return_val"] = hook(**hook_input["kwargs"])
                                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 280, in build_wheel
          return _build_backend().build_wheel(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 439, in build_wheel
          return _build(['bdist_wheel', '--dist-info-dir', str(metadata_directory)])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 427, in _build
          return self._build_with_temp_dir(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 408, in _build_with_temp_dir
          self.run_setup()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 317, in run_setup
          exec(code, locals())
        File "<string>", line 1041, in <module>
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/__init__.py", line 117, in setup
          return distutils.core.setup(**attrs)  # type: ignore[return-value]
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 186, in setup
          return run_commands(dist)
                 ^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 202, in run_commands
          dist.run_commands()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
          self.run_command(cmd)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/bdist_wheel.py", line 370, in run
          self.run_command("build")
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build.py", line 135, in run
          self.run_command(cmd_name)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "<string>", line 346, in run
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 97, in run
          _build_ext.run(self)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
          self.build_extensions()
        File "<string>", line 315, in build_extensions
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/subprocess.py", line 413, in check_call
          raise CalledProcessError(retcode, cmd)
      subprocess.CalledProcessError: Command '['cmake', '--build', '.', '-j=10', '--target=_C']' returned non-zero exit status 1.
      [end of output]

Code Example

[build-system]
requires = ["setuptools>=45", "wheel"]
build-backend = "setuptools.build_meta"

[project]
name = "tinker-atropos"
version = "0.1.0"
description = "Integration between Tinker training API and Atropos RL framework"
readme = "README.md"
requires-python = ">=3.9"
dependencies = [
    "atroposlib[all]",
    "tinker @ git+https://github.com/thinking-machines-lab/tinker.git",
    "fastapi>=0.104.0",
    "uvicorn[standard]>=0.24.0",
    "pydantic>=2.0.0",
    "requests>=2.31.0",
    "aiohttp>=3.9.0",
    "wandb>=0.15.0",
    "python-dotenv>=1.0.0",
]

[project.scripts]
rl-server = "tinker_atropos.rl_api_server:main"

[project.optional-dependencies]
dev = [
    "pytest>=7.4.0",
    "pytest-asyncio>=0.21.0",
    "black>=23.0.0",
    "ruff>=0.1.0",
    "mypy>=1.6.0",
    "pre-commit>=3.5.0",
]

[tool.setuptools.packages.find]
include = ["tinker_atropos*"]

[tool.black]
line-length = 100
target-version = ['py39']

[tool.ruff]
line-length = 100
target-version = "py39"

[tool.mypy]
python_version = "3.9"
warn_return_any = true
warn_unused_configs = true
disallow_untyped_defs = false

---

note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:51:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<12>' requested here
         51 |         gemm_micro<12>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:58:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<16>' requested here
         58 |         gemm_micro<16>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:24:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<1>' requested here
         24 |         gemm_micro<1>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:28:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<2>' requested here
         28 |         gemm_micro<2>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:33:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<4>' requested here
         33 |         gemm_micro<4>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:38:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<6>' requested here
         38 |         gemm_micro<6>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:43:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<8>' requested here
         43 |         gemm_micro<8>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      fatal error: too many errors emitted, stopping now [-ferror-limit=]
      1 warning and 20 errors generated.
      [2/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/mla_decode.cpp.o
      [3/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/layernorm.cpp.o
      [4/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/utils.cpp.o
      [5/9] Building CXX object CMakeFiles/_C.dir/csrc/moe/dynamic_4bit_int_moe_cpu.cpp.o
      [6/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/pos_encoding.cpp.o
      [7/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/activation.cpp.o
      [8/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/torch_bindings.cpp.o
      ninja: build stopped: subcommand failed.
      Traceback (most recent call last):
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module>
          main()
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main
          json_out["return_val"] = hook(**hook_input["kwargs"])
                                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 280, in build_wheel
          return _build_backend().build_wheel(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 439, in build_wheel
          return _build(['bdist_wheel', '--dist-info-dir', str(metadata_directory)])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 427, in _build
          return self._build_with_temp_dir(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 408, in _build_with_temp_dir
          self.run_setup()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 317, in run_setup
          exec(code, locals())
        File "<string>", line 1041, in <module>
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/__init__.py", line 117, in setup
          return distutils.core.setup(**attrs)  # type: ignore[return-value]
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 186, in setup
          return run_commands(dist)
                 ^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 202, in run_commands
          dist.run_commands()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
          self.run_command(cmd)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/bdist_wheel.py", line 370, in run
          self.run_command("build")
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build.py", line 135, in run
          self.run_command(cmd_name)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "<string>", line 346, in run
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 97, in run
          _build_ext.run(self)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
          self.build_extensions()
        File "<string>", line 315, in build_extensions
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/subprocess.py", line 413, in check_call
          raise CalledProcessError(retcode, cmd)
      subprocess.CalledProcessError: Command '['cmake', '--build', '.', '-j=10', '--target=_C']' returned non-zero exit status 1.
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for vllm
Successfully built tinker-atropos tinker
Failed to build vllm
error: failed-wheel-build-for-install

× Failed to build installable wheels for some pyproject.toml based projects
╰─> vllm
RAW_BUFFERClick to expand / collapse

Your current environment

pyproject.toml is as follows

[build-system]
requires = ["setuptools>=45", "wheel"]
build-backend = "setuptools.build_meta"

[project]
name = "tinker-atropos"
version = "0.1.0"
description = "Integration between Tinker training API and Atropos RL framework"
readme = "README.md"
requires-python = ">=3.9"
dependencies = [
    "atroposlib[all]",
    "tinker @ git+https://github.com/thinking-machines-lab/tinker.git",
    "fastapi>=0.104.0",
    "uvicorn[standard]>=0.24.0",
    "pydantic>=2.0.0",
    "requests>=2.31.0",
    "aiohttp>=3.9.0",
    "wandb>=0.15.0",
    "python-dotenv>=1.0.0",
]

[project.scripts]
rl-server = "tinker_atropos.rl_api_server:main"

[project.optional-dependencies]
dev = [
    "pytest>=7.4.0",
    "pytest-asyncio>=0.21.0",
    "black>=23.0.0",
    "ruff>=0.1.0",
    "mypy>=1.6.0",
    "pre-commit>=3.5.0",
]

[tool.setuptools.packages.find]
include = ["tinker_atropos*"]

[tool.black]
line-length = 100
target-version = ['py39']

[tool.ruff]
line-length = 100
target-version = "py39"

[tool.mypy]
python_version = "3.9"
warn_return_any = true
warn_unused_configs = true
disallow_untyped_defs = false

The error information is

      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:51:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<12>' requested here
         51 |         gemm_micro<12>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:8:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:71:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         71 |     static_assert(0 < M <= 16);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:58:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm_micro<16>' requested here
         58 |         gemm_micro<16>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm161<float>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec16.hpp:143:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm161<float>>::operator()' requested here
        143 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC16, float, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:242:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        242 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:19:43: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         19 |   AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__) \
            |                                           ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:24:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<1>' requested here
         24 |         gemm_micro<1>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:28:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<2>' requested here
         28 |         gemm_micro<2>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:33:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<4>' requested here
         33 |         gemm_micro<4>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:38:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<6>' requested here
         38 |         gemm_micro<6>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:1:
      In file included from /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:7:
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:56:25: error: chained comparison 'X < Y <= Z' does not behave the same as a mathematical expression [-Wparentheses]
         56 |     static_assert(0 < M <= 8);
            |                         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:43:9: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm_micro<8>' requested here
         43 |         gemm_micro<8>(a_tile, b_tile, c_tile, lda, ldb, ldc, block_size,
            |         ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:953:35: note: in instantiation of function template specialization 'cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>::gemm<cpu_attention::AttentionGemmPhase::QK, 32>' requested here
        953 |             tile_gemm_t::template gemm<AttentionGemmPhase::QK, head_dim>(
            |                                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_vec.hpp:138:5: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention<cpu_attention::(anonymous namespace)::TileGemm82<c10::Half>>::operator()' requested here
        138 |     attention_iteration(CPU_ATTENTION_PARAMS);
            |     ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_impl.hpp:1707:38: note: in instantiation of function template specialization 'cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>::execute_attention<cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::Attention>' requested here
       1707 |                   attn_impl.template execute_attention<Attention>(
            |                                      ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn.cpp:186:19: note: in instantiation of member function 'cpu_attention::AttentionMainLoop<cpu_attention::AttentionImpl<cpu_attention::ISA::VEC, c10::Half, 32>>::operator()' requested here
        186 |           mainloop(&input);
            |                   ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_attn_dispatch_generated.h:236:16: note: expanded from macro 'CPU_ATTN_DISPATCH'
        236 |         return __VA_ARGS__(); \
            |                ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:24:68: note: expanded from macro 'VLLM_DISPATCH_FLOATING_TYPES'
         24 |   AT_DISPATCH_SWITCH(TYPE, NAME, VLLM_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))
            |                                                                    ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-install-x53o1r4f/vllm_f1ad1ecc0f5449c898df5f3e33d9481d/csrc/cpu/cpu_types_arm.hpp:20:42: note: expanded from macro 'VLLM_DISPATCH_CASE_FLOATING_TYPES'
         20 |   AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)  \
            |                                          ^
      note: (skipping 2 expansions in backtrace; use -fmacro-backtrace-limit=0 to see all)
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:13:12: note: expanded from macro 'THO_PRIVATE_CASE_TYPE_USING_HINT_TMPL'
         13 |     return __VA_ARGS__();                                                    \
            |            ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/ATen/Dispatch.h:190:7: note: expanded from macro 'AT_DISPATCH_SWITCH'
        190 |       __VA_ARGS__)
            |       ^
      /private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/torch/include/torch/headeronly/core/Dispatch.h:40:7: note: expanded from macro 'THO_DISPATCH_SWITCH_TMPL'
         40 |       __VA_ARGS__                                                           \
            |       ^
      fatal error: too many errors emitted, stopping now [-ferror-limit=]
      1 warning and 20 errors generated.
      [2/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/mla_decode.cpp.o
      [3/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/layernorm.cpp.o
      [4/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/utils.cpp.o
      [5/9] Building CXX object CMakeFiles/_C.dir/csrc/moe/dynamic_4bit_int_moe_cpu.cpp.o
      [6/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/pos_encoding.cpp.o
      [7/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/activation.cpp.o
      [8/9] Building CXX object CMakeFiles/_C.dir/csrc/cpu/torch_bindings.cpp.o
      ninja: build stopped: subcommand failed.
      Traceback (most recent call last):
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module>
          main()
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main
          json_out["return_val"] = hook(**hook_input["kwargs"])
                                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 280, in build_wheel
          return _build_backend().build_wheel(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 439, in build_wheel
          return _build(['bdist_wheel', '--dist-info-dir', str(metadata_directory)])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 427, in _build
          return self._build_with_temp_dir(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 408, in _build_with_temp_dir
          self.run_setup()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/build_meta.py", line 317, in run_setup
          exec(code, locals())
        File "<string>", line 1041, in <module>
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/__init__.py", line 117, in setup
          return distutils.core.setup(**attrs)  # type: ignore[return-value]
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 186, in setup
          return run_commands(dist)
                 ^^^^^^^^^^^^^^^^^^
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/core.py", line 202, in run_commands
          dist.run_commands()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
          self.run_command(cmd)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/bdist_wheel.py", line 370, in run
          self.run_command("build")
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build.py", line 135, in run
          self.run_command(cmd_name)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
          self.distribution.run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/dist.py", line 1107, in run_command
          super().run_command(command)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
          cmd_obj.run()
        File "<string>", line 346, in run
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/command/build_ext.py", line 97, in run
          _build_ext.run(self)
        File "/private/var/folders/s1/bm4d9wrj3bd9hp4d65clcjb40000gn/T/pip-build-env-zf6xiy0f/overlay/lib/python3.12/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
          self.build_extensions()
        File "<string>", line 315, in build_extensions
        File "/Users/yaoyaoyao/miniconda3/envs/evolve_opt/lib/python3.12/subprocess.py", line 413, in check_call
          raise CalledProcessError(retcode, cmd)
      subprocess.CalledProcessError: Command '['cmake', '--build', '.', '-j=10', '--target=_C']' returned non-zero exit status 1.
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for vllm
Successfully built tinker-atropos tinker
Failed to build vllm
error: failed-wheel-build-for-install

× Failed to build installable wheels for some pyproject.toml based projects
╰─> vllm

How you are installing vllm

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

extent analysis

TL;DR

The issue is likely due to a compatibility problem between the Python version and the vllm library, causing a build error when installing vllm using pip.

Guidance

  1. Check Python version: The error message indicates that the build process is using Python 3.12, which might not be compatible with the vllm library.
  2. Specify Python version: Try installing vllm using a specific Python version, for example, Python 3.9, which is specified in the pyproject.toml file as the required Python version.
  3. Update vllm library: If possible, try updating the vllm library to a version that supports Python 3.12.

Example

No code example is provided as the issue is related to the build process and not a specific code snippet.

Notes

The error message suggests that the build process failed due to a compatibility issue, but it does not provide a clear solution. The above guidance is based on the information provided in the error message and the pyproject.toml file.

Recommendation

Apply workaround: Try installing vllm using a specific Python version, such as Python 3.9, to see if it resolves the issue. If not, consider updating the vllm library to a version that supports Python 3.12.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING