[NativeCPU] Simplify enqueue. #19550

hvdijk · 2025-07-22T14:04:37Z

We were creating excessive numbers of threads. When we know we want a given amount of threads, just divide the number of workgroups by the number of threads and have each thread process that many workgroups.

This implementation also means we no longer need to resize workgroups, which was not generally safe.

We were creating excessive numbers of threads. When we know we want a given amount of threads, just divide the number of workgroups by the number of threads and have each thread process that many workgroups.

Testing on Pointnet showed that the nesting we originally had is faster.

uwedolinsky

It seems this PR also removes the unsafe enqueuing optimization for nd_range kernels. If so it's probably worth also mentioning this in the description.

unified-runtime/source/adapters/native_cpu/enqueue.cpp

hvdijk · 2025-07-23T12:14:05Z

It seems this PR also removes the unsafe enqueuing optimization for nd_range kernels. If so it's probably worth also mentioning this in the description.

Done, thanks.

unified-runtime/source/adapters/native_cpu/enqueue.cpp

hvdijk · 2025-07-23T17:34:47Z

@intel/llvm-gatekeepers This can be merged, thanks.

[NativeCPU] Simplify enqueue.

d4a93d1

We were creating excessive numbers of threads. When we know we want a given amount of threads, just divide the number of workgroups by the number of threads and have each thread process that many workgroups.

hvdijk requested a review from a team as a code owner July 22, 2025 14:04

hvdijk temporarily deployed to WindowsCILock July 22, 2025 14:04 — with GitHub Actions Inactive

Fix formatting.

ef993ce

hvdijk temporarily deployed to WindowsCILock July 22, 2025 14:25 — with GitHub Actions Inactive

Revert loop switching.

90bae9a

Testing on Pointnet showed that the nesting we originally had is faster.

hvdijk temporarily deployed to WindowsCILock July 22, 2025 15:02 — with GitHub Actions Inactive

hvdijk had a problem deploying to WindowsCILock July 22, 2025 15:36 — with GitHub Actions Failure

hvdijk temporarily deployed to WindowsCILock July 22, 2025 15:36 — with GitHub Actions Inactive

hvdijk temporarily deployed to WindowsCILock July 22, 2025 16:38 — with GitHub Actions Inactive

uwedolinsky reviewed Jul 23, 2025

View reviewed changes

unified-runtime/source/adapters/native_cpu/enqueue.cpp Outdated Show resolved Hide resolved

Use size_t and check for impossibly large dimensions.

b2b9b15

hvdijk temporarily deployed to WindowsCILock July 23, 2025 12:36 — with GitHub Actions Inactive

hvdijk temporarily deployed to WindowsCILock July 23, 2025 12:57 — with GitHub Actions Inactive

uwedolinsky reviewed Jul 23, 2025

View reviewed changes

unified-runtime/source/adapters/native_cpu/enqueue.cpp Outdated Show resolved Hide resolved

[NFC] Reduce the amount of captured data.

73ac8c9

hvdijk temporarily deployed to WindowsCILock July 23, 2025 14:51 — with GitHub Actions Inactive

hvdijk commented Jul 23, 2025

View reviewed changes

unified-runtime/source/adapters/native_cpu/enqueue.cpp Outdated Show resolved Hide resolved

hvdijk temporarily deployed to WindowsCILock July 23, 2025 15:31 — with GitHub Actions Inactive

Only use __builtin_umulll_overflow with GNU C compatible compilers.

b90df43

hvdijk temporarily deployed to WindowsCILock July 23, 2025 15:47 — with GitHub Actions Inactive

hvdijk temporarily deployed to WindowsCILock July 23, 2025 16:16 — with GitHub Actions Inactive

uwedolinsky approved these changes Jul 23, 2025

View reviewed changes

igchor merged commit 63c70a1 into intel:sycl Jul 23, 2025
48 of 49 checks passed

uwedolinsky mentioned this pull request Aug 7, 2025

[NATIVECPU] Emit Native CPU properties #19429

Open

hvdijk deleted the simplify-enqueue branch September 4, 2025 09:37

hvdijk mentioned this pull request Sep 24, 2025

[DRAFT][NATIVECPU] using tbb::parallel_for when oneTBB is enabled #20064

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NativeCPU] Simplify enqueue. #19550

[NativeCPU] Simplify enqueue. #19550

Uh oh!

hvdijk commented Jul 22, 2025 •

edited

Loading

Uh oh!

uwedolinsky left a comment

Uh oh!

Uh oh!

hvdijk commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

hvdijk commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

[NativeCPU] Simplify enqueue. #19550

[NativeCPU] Simplify enqueue. #19550

Uh oh!

Conversation

hvdijk commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

uwedolinsky left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hvdijk commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

hvdijk commented Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

hvdijk commented Jul 22, 2025 •

edited

Loading