[CPU] Disable oneDNN linear on non-x86 platforms #25166

bigPYJ1151 · 2025-09-18T12:57:16Z

Purpose

Noticed some issues about recently added oneDNN linear kernels on non-x86 platforms. Only enable it for x86 platform.

Fix #25155 #24976

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: jiang1.li <jiang1.li@intel.com>

gemini-code-assist

Code Review

This pull request aims to disable problematic oneDNN linear kernels on non-x86 platforms. The change correctly adds an architecture check for the unquantized GEMM path. However, the review identifies that this fix is likely incomplete, as the quantized oneDNN path does not seem to be covered by this change and may still cause issues on non-x86 platforms.

gemini-code-assist · 2025-09-18T12:59:06Z

vllm/model_executor/layers/utils.py

+    elif (ops._supports_onednn
+          and current_platform.get_cpu_architecture() == CpuArchEnum.X86):


This fix appears to be incomplete. While it correctly disables the oneDNN kernel for the unquantized path on non-x86 platforms, the quantized path, which also utilizes oneDNN, seems to be unaddressed. If the underlying issue with oneDNN on non-x86 platforms is general, this omission could lead to incorrect behavior or crashes when running quantized models on those platforms. A similar architecture check should be implemented for the quantized oneDNN dispatch path to ensure a complete fix.

Signed-off-by: jiang1.li <jiang1.li@intel.com>

Signed-off-by: jiang1.li <jiang1.li@intel.com> Signed-off-by: charlifu <charlifu@amd.com>

fix

77702c1

Signed-off-by: jiang1.li <jiang1.li@intel.com>

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

jikunshang approved these changes Sep 19, 2025

View reviewed changes

jikunshang enabled auto-merge (squash) September 19, 2025 05:54

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 19, 2025

jikunshang merged commit 8c1d4ac into vllm-project:main Sep 19, 2025
54 checks passed

bigPYJ1151 mentioned this pull request Sep 19, 2025

[Bug]: [CPU-only][Graviton3] oneDNN matmul primitive creation fails running examples/offline_inference/basic/basic.py #24976

Closed

1 task

debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025

[CPU] Disable oneDNN linear on non-x86 platforms (vllm-project#25166)

d3ed9f6

Signed-off-by: jiang1.li <jiang1.li@intel.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[CPU] Disable oneDNN linear on non-x86 platforms (vllm-project#25166)

00a0dd4

Signed-off-by: jiang1.li <jiang1.li@intel.com>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[CPU] Disable oneDNN linear on non-x86 platforms (vllm-project#25166)

bcbfd0f

Signed-off-by: jiang1.li <jiang1.li@intel.com> Signed-off-by: charlifu <charlifu@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CPU] Disable oneDNN linear on non-x86 platforms #25166

[CPU] Disable oneDNN linear on non-x86 platforms #25166

Uh oh!

bigPYJ1151 commented Sep 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

		elif (ops._supports_onednn
		and current_platform.get_cpu_architecture() == CpuArchEnum.X86):

Uh oh!

[CPU] Disable oneDNN linear on non-x86 platforms #25166

[CPU] Disable oneDNN linear on non-x86 platforms #25166

Uh oh!

Conversation

bigPYJ1151 commented Sep 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bigPYJ1151 commented Sep 18, 2025 •

edited by github-actions bot

Loading