[SYCL] avoid lock and wait in KernelProgramCache::getOrBuild #20780

lslusarczyk · 2025-11-28T12:20:15Z

Optimized common path when kernel is already build and exists in the cache. Use BuildStatus already provided by compare_exchange_strong instead of taking lock, calling wait and reading it again.

performance impact

Visible in most benchmarks.
Examples

Instructions decreased from 159.8k to 158.2k over UR baseline 133.7k, that is by 6.1%, see rightmost dots at:

Instructions decreased from 133.4k to 132.3k over UR baseline 119.2k, that is by 7.7%, see rightmost dots at:

And some example of time over L0 (but note time has high variance, so it is less certain)
Time overhead over L0 before: 18.0% (SYCL 17.65us, L0 14.96us), after 15.7% (SYCL 17.13us, L0 14.80us), reduced by ~2.2%

…ache::getOrBuild

steffenlarsen

LGTM!

lslusarczyk · 2025-11-28T15:31:36Z

@intel/llvm-gatekeepers please merge,
Failed CI also fails on upstream

E2E (Intel / Ponte Vecchio GPU) fails with HostInteropTask/host-task-failure.cpp timeout, seen also in other PR, e.g. here: https://github.com/intel/llvm/actions/runs/19766696194/job/56642904239?pr=20782
"SYCL Pre Commit on Linux / build_run_native_cpu_e2e_tests" fail with the same reason (SYCL :: DeviceLib/assert.cpp) as in other PRs, e.g. here https://github.com/intel/llvm/actions/runs/19757562868/job/56612822758?pr=20777

[SYCL] avoid taking lock and waiting in common path of KernelProgramC…

2f26840

…ache::getOrBuild

lslusarczyk requested a review from a team as a code owner November 28, 2025 12:20

lslusarczyk requested a review from vinser52 November 28, 2025 12:20

lslusarczyk temporarily deployed to WindowsCILock November 28, 2025 12:20 — with GitHub Actions Inactive

steffenlarsen approved these changes Nov 28, 2025

View reviewed changes

lslusarczyk temporarily deployed to WindowsCILock November 28, 2025 12:50 — with GitHub Actions Inactive

lslusarczyk had a problem deploying to WindowsCILock November 28, 2025 12:50 — with GitHub Actions Failure

vinser52 approved these changes Nov 28, 2025

View reviewed changes

lslusarczyk had a problem deploying to WindowsCILock November 28, 2025 15:15 — with GitHub Actions Failure

lslusarczyk temporarily deployed to WindowsCILock November 28, 2025 15:25 — with GitHub Actions Inactive

sergey-semenov merged commit e15c474 into intel:sycl Nov 28, 2025
79 of 89 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] avoid lock and wait in KernelProgramCache::getOrBuild #20780

[SYCL] avoid lock and wait in KernelProgramCache::getOrBuild #20780

Uh oh!

lslusarczyk commented Nov 28, 2025

Uh oh!

steffenlarsen left a comment

Uh oh!

lslusarczyk commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SYCL] avoid lock and wait in KernelProgramCache::getOrBuild #20780

[SYCL] avoid lock and wait in KernelProgramCache::getOrBuild #20780

Uh oh!

Conversation

lslusarczyk commented Nov 28, 2025

performance impact

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

lslusarczyk commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants