[CI/Build] fix cpu_extension for apple silicon #21195

ignaciosica · 2025-07-18T15:40:28Z

Purpose

The purpose for this pr is to fix the cpu installation from source for apple silicon CPUs. Recently, basic serving capabilities and examples stopped working in my local setup (M3 pro). I bisected my way to this pr #14129 which introduced int8 quantization for ARM CPU. The problem lied on the fact that although apple silicon shared some codepath with the rest of the arm cpu in cpu_extension.cmake, it also had some specific configurations that ended up breaking build after the int8 support was introduced; more specifically, apple silicon pathway was not enabling ASIMD_FOUND thus not including quant.cpp (cpu_extension.cmake:284) source.

After a fresh install from source

git clone https://github.com/vllm-project/vllm.git
cd vllm
uv venv --python 3.12 --seed
source .venv/bin/activate
uv pip install -r requirements/cpu.txt
uv pip install -e .

Basic example failed with

python examples/offline_inference/basic/basic.py
> WARNING 07-18 12:13:35 [_custom_ops.py:20] Failed to import from vllm._C with ImportError("dlopen([...]/vllm/vllm/_C.abi3.so, 0x0002): symbol not found in flat namespace '__Z14int8_scaled_mmRN2at6TensorERKS0_S3_S3_S3_RKNSt3__18optionalIS0_EE'")
> [...]
> AttributeError: '_OpNamespace' '_C_cache_ops' object has no attribute 'reshape_and_cache'

In order to fix, this pr enabled ASIMD_FOUND for apple silicon as well. For safety, it checked for support with the following command: sysctl -n hw.optional.neon. As far as I know, all apple silicon, starting from M1 up to M4 generation support this feature, but still decided to gate ASIMD_FOUND on this check in the case the support is dropped for future generations.

After this fix, cpu installation from source started working again.

This pr also enables bf16 support for apple silicon. This feature is gated via the following check hw.optional.arm.FEAT_BF16. Based on this LLVM's commit, bf16 support was introduced for cpu in m2 generation.

Test Plan

Unfortunately I'm not familiar enough with build runs in CI, I would appreciate some guidance for this point.

github-actions · 2025-07-18T15:40:37Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request fixes a build issue for Apple Silicon CPUs by enabling ASIMD support and introducing a check_sysctl function. To improve robustness, I've identified a critical issue regarding argument quoting in the new check_sysctl function that should be addressed.

cmake/cpu_extension.cmake

ignaciosica · 2025-07-20T01:19:29Z

addressed commit signing issue

ignaciosica · 2025-07-20T01:27:48Z

cmake/cpu_extension.cmake

@@ -70,7 +86,10 @@ endfunction()
 is_avx512_disabled(AVX512_DISABLED)

 if (MACOSX_FOUND AND CMAKE_SYSTEM_PROCESSOR STREQUAL "arm64")
-    set(APPLE_SILICON_FOUND TRUE)


removed APPLE_SILICON_FOUND as it wasn't used downstream anymore but maybe it's still useful to keep it for future reference/use?

Yeah this would be useful to follow the pattern and use in downstream code

Signed-off-by: ignaciosica <[email protected]>

mgoin

LGTM I think this is reasonable and shouldn't be disruptive to other backends

Signed-off-by: ignaciosica <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: shuw <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: x22x22 <[email protected]>

Signed-off-by: ignaciosica <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Paul Pak <[email protected]>

Signed-off-by: ignaciosica <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Boyuan Feng <[email protected]>

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

Signed-off-by: ignaciosica <[email protected]>

mergify bot added the ci/build label Jul 18, 2025

gemini-code-assist bot reviewed Jul 18, 2025

View reviewed changes

cmake/cpu_extension.cmake Outdated Show resolved Hide resolved

DarkLight1337 requested a review from mgoin July 19, 2025 09:36

ignaciosica force-pushed the fix_fresh_source_install branch from 537a376 to 34d30da Compare July 20, 2025 01:18

ignaciosica commented Jul 20, 2025

View reviewed changes

ignaciosica added 3 commits July 24, 2025 20:29

fix cpu_extension for apple silicon

b8b9f82

Signed-off-by: ignaciosica <[email protected]>

revert whitespace change

6a92cbf

Signed-off-by: ignaciosica <[email protected]>

address gemini suggestion

e25e799

Signed-off-by: ignaciosica <[email protected]>

ignaciosica force-pushed the fix_fresh_source_install branch from 34d30da to e25e799 Compare July 24, 2025 23:29

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 24, 2025

mgoin approved these changes Jul 25, 2025

View reviewed changes

mgoin added the cpu Related to CPU backends label Jul 25, 2025

vllm-bot merged commit 5140f54 into vllm-project:main Jul 25, 2025
46 of 48 checks passed

liuyumoye pushed a commit to liuyumoye/vllm that referenced this pull request Jul 31, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

390ba7f

Signed-off-by: ignaciosica <[email protected]>

wenscarl pushed a commit to wenscarl/vllm that referenced this pull request Aug 4, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

df46c34

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: shuw <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

b634dcf

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: x22x22 <[email protected]>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

f3d2a70

Signed-off-by: ignaciosica <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

a9bd55a

Signed-off-by: ignaciosica <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

8c643fa

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

59ed2e9

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Paul Pak <[email protected]>

taneem-ibrahim pushed a commit to taneem-ibrahim/vllm that referenced this pull request Aug 14, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

2ef395e

Signed-off-by: ignaciosica <[email protected]>

BoyuanFeng pushed a commit to BoyuanFeng/vllm that referenced this pull request Aug 14, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

d7ea126

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Boyuan Feng <[email protected]>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

befccb9

Signed-off-by: ignaciosica <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

5f2c11e

Signed-off-by: ignaciosica <[email protected]>

googlercolin pushed a commit to googlercolin/vllm that referenced this pull request Aug 29, 2025

[CI/Build] fix cpu_extension for apple silicon (vllm-project#21195)

d1a8580

Signed-off-by: ignaciosica <[email protected]>

ignaciosica mentioned this pull request Sep 2, 2025

[Hardware][Apple-CPU] Enable native bfloat16 on Apple Silicon (M2 and later) #24129

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build] fix cpu_extension for apple silicon #21195

[CI/Build] fix cpu_extension for apple silicon #21195

Uh oh!

ignaciosica commented Jul 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ignaciosica commented Jul 20, 2025

Uh oh!

ignaciosica Jul 20, 2025

Uh oh!

mgoin Jul 25, 2025

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[CI/Build] fix cpu_extension for apple silicon #21195

[CI/Build] fix cpu_extension for apple silicon #21195

Uh oh!

Conversation

ignaciosica commented Jul 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ignaciosica commented Jul 20, 2025

Uh oh!

ignaciosica Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

mgoin Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ignaciosica commented Jul 18, 2025 •

edited by github-actions bot

Loading