[SYCL] Run CompileTimePropertiesPass early in the pipeline #20602

bader · 2025-11-07T23:23:19Z

Some compile time properties work as a replacement for kernel
attributes. For example, work_group_size semantics must be identical to
sycl::reqd_work_group_size kernel attribute. The problem is kernel
attributes are lowered to LLVM metadata by Clang, but work_group_size
represented as an LLVM attribute.
CompileTimePropertiesPass converts attribute to canonical metadata
representation, but does it late in the opimization pipeline.
This patch moves CompileTimePropertiesPass to the beginning of the
optimization pipeline to keep canonical representation for SYCL kernel
attributes information passes via compile-time properties.

Some compile time properties work as a replacement for kernel attributes. For example, work_group_size semantics must be identical to sycl::reqd_work_group_size kernel attribute. The problem is kernel attributes are lowered to LLVM metadata by Clang, but work_group_size represented as an LLVM attribute. CompileTimePropertiesPass converts attribute to canonical metadata representation, but does it late in the opimization pipeline. This patch moves CompileTimePropertiesPass to the beginning of the optimization pipeline to keep canonical representation for SYCL kernel attributes information passes via compile-time properties.

steffenlarsen · 2025-11-10T06:42:30Z

Discussed shortly offline. It looks like some of the metadata gets lost if added early. An option could be to run the pass twice and separate the transformations into early and late transformations. The kernel properties, i.e. the ones that have OpenCL parallels (for example sycl-work-group-size -> reqd_work_group_size) could be done early while the rest could be done in the slot we apply it today.

elizabethandrews · 2025-11-10T16:22:45Z

Out of curiosity how/why is metadata lost if pass is earlier?

steffenlarsen · 2025-11-11T08:49:52Z

Out of curiosity how/why is metadata lost if pass is earlier?

Seemingly it affects the cache control properties the worst. Looks to me like the !spirv.Decorations metadata on the loads and stores are lost if it is added early, which is probably not very surprising. Kernel functions are unlikely to transform drastically, but pointer loads and stores are likely the target of a lot of transformations, and since metadata isn't guaranteed to be preserved it is likely lost along some of those.

bader · 2025-11-20T22:18:34Z

@aratajew, can we change the cache control LLVM representation from instruction metadata to something that can survive LLVM optimizations (e.g. llvm.ptr.annotation)?

Today, SYCL compiler emits llvm.ptr.annotation intrinsic annotating pointer with cache control hints, which works well for that purpose, but we have to maintain another LLVM pass to convert the intrinsic to SPIR-V metadata with the restriction to run this pass as close to SPIR-V CodeGen (or LLVM-SPIRV-Translator) as possible. I would prefer to have a single representation in LLVM for cache control hints respected by the LLVM optimizations.

SYCL compiler choice of llvm.ptr.annotation intrinsic might not be the best solution due to semantic definition (https://llvm.org/docs/LangRef.html#llvm-ptr-annotation-intrinsic):

Semantics:
This intrinsic allows annotation of a pointer to an integer with arbitrary strings. This can be useful for special purpose optimizations that want to look for these annotations. These have no other defined use; transformations preserve annotations on a best-effort basis but are allowed to replace the intrinsic with its first argument without breaking semantics and the intrinsic is completely dropped during instruction selection.

So far, all standard LLVM passes seem to do a good job with preserving annotations. Better alternatives are welcome.

…ontrols handling.

bader · 2025-11-21T02:14:52Z

An option could be to run the pass twice and separate the transformations into early and late transformations.

@steffenlarsen, I implemented this suggestion in the 2c8d041.

maarquitos14

Just a nit, otherwise LGTM.

clang/lib/CodeGen/BackendUtil.cpp

aratajew · 2025-11-21T10:07:17Z

@aratajew, can we change the cache control LLVM representation from instruction metadata to something that can survive LLVM optimizations (e.g. llvm.ptr.annotation)?

Do you generate cache control metadata attached to the pointer used by load/store instruction, or to the load/store instruction itself? The Khronos SPIRV-LLVM Translator initially supported only the former approach, which was indeed very prone to being optimized out. However, when the Triton Compiler faced this issue, the solution was this change: KhronosGroup/SPIRV-LLVM-Translator#2587. This change allows cache control metadata to be generated directly on a load/store instruction. The SPIRV-LLVM Translator then automatically generates a dummy GEP and reattaches the metadata to it, ensuring that proper SPIR-V can be generated.

steffenlarsen · 2025-11-21T11:41:48Z

llvm/include/llvm/SYCLLowerIR/CompileTimePropertiesPass.h

 class CompileTimePropertiesPass
    : public PassInfoMixin<CompileTimePropertiesPass> {
 public:
+  CompileTimePropertiesPass(bool ConvertCacheControls = true)


Nit; I think I would have preferred to call it something like EarlyRun or EarlyPass, so if other features are added that care whether it's transformed early or late, the bool still represents their needs.

bader · 2025-11-21T21:43:28Z

Do you generate cache control metadata attached to the pointer used by load/store instruction, or to the load/store instruction itself?

We generate cache control metadata attached to the instruction emitting the pointer argument for load/store instruction.

The Khronos SPIRV-LLVM Translator initially supported only the former approach, which was indeed very prone to being optimized out. However, when the Triton Compiler faced this issue, the solution was this change: KhronosGroup/SPIRV-LLVM-Translator#2587. This change allows cache control metadata to be generated directly on a load/store instruction. The SPIRV-LLVM Translator then automatically generates a dummy GEP and reattaches the metadata to it, ensuring that proper SPIR-V can be generated.

@aratajew, thanks for the hint! Let me try it out.

@steffenlarsen, @maarquitos14, @elizabethandrews, for the reviews. I really don't like the current solution with running the pass twice, so I'm going to try attaching the metadata to load/store instructions. If it doesn't work, I'll address your comments for the current patch.

Ideally, I would like Clang's IRGen to emit SPIR-V metadata for cache controls and avoid using CompileTimeProperties pass for that.

… cache controls handling." This reverts commit 2c8d041.

…ions producing a pointer argument for load/store instructions.

bader · 2025-11-22T00:34:05Z

sycl/test/check_device_code/extensions/properties/properties_cache_control.cpp


 // CHECK: spir_kernel{{.*}}cache_control_read_hint_func
-// CHECK: {{.*}}addrspacecast ptr addrspace(1){{.*}}!spirv.Decorations [[RHINT:.*]]
+// CHECK:  store float 5.500000e+01, ptr addrspace(1) %{{.*}}, !spirv.Decorations ![[RHINT:[0-9]+]]


NOTE: There is a bug in the test, which I reported here: #20718.
RHINT must be applied to load instructions.

bader · 2025-11-22T00:34:17Z

@aratajew, thanks again! New approach works like a charm. The metadata attached to the load/store instructions survives optimization. The least for the test we have in our pre-commit.

@steffenlarsen, I'm not sure if cache control hints feature is covered well. I updated the test checking LLVM IR, but it would be nice to check that hints are applied correctly at SPIR-V level as well. Do we have such tests?

steffenlarsen · 2025-11-24T08:03:54Z

@steffenlarsen, I'm not sure if cache control hints feature is covered well. I updated the test checking LLVM IR, but it would be nice to check that hints are applied correctly at SPIR-V level as well. Do we have such tests?

https://github.com/KhronosGroup/SPIRV-LLVM-Translator/tree/2f2a95e686e72ec77e6d0dfbf22413cf46c0e338/test/extensions/INTEL/SPV_INTEL_cache_controls has some tests related to the SPIR-V code generation. Is this what you had in mind?

steffenlarsen

Looking at the changes here, we may need to make sure that the SPIR-V translator is ready to make the proper conversions from the load/store instructions rather than the GEPs. Based on the testing in test/extensions/INTEL/SPV_INTEL_cache_controls it doesn't look like we have testing for such a case.

Tag @MrSidims & @maarquitos14

llvm/lib/SYCLLowerIR/CompileTimePropertiesPass.cpp

Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>

bader · 2025-11-24T14:51:46Z

@steffenlarsen, I'm not sure if cache control hints feature is covered well. I updated the test checking LLVM IR, but it would be nice to check that hints are applied correctly at SPIR-V level as well. Do we have such tests?

https://github.com/KhronosGroup/SPIRV-LLVM-Translator/tree/2f2a95e686e72ec77e6d0dfbf22413cf46c0e338/test/extensions/INTEL/SPV_INTEL_cache_controls has some tests related to the SPIR-V code generation. Is this what you had in mind?

This is a bare minimum. In addition to these, it might be worth adding a test checking the SPIR-V emitted from SYCL sources. What do you think?

steffenlarsen · 2025-11-24T15:00:04Z

This is a bare minimum. In addition to these, it might be worth adding a test checking the SPIR-V emitted from SYCL sources. What do you think?

Off the top of my head, it's not something we usually do. Typically we would check the resulting LLVM-IR, then we could have LLVM SPIR-V translator tests that check that uses the LLVM-IR output from the SYCL tests as the input of their tests. I personally like that structure as it separates the responsibilities of the tooling.

bader · 2025-11-24T16:15:05Z

This is a bare minimum. In addition to these, it might be worth adding a test checking the SPIR-V emitted from SYCL sources. What do you think?

Off the top of my head, it's not something we usually do. Typically we would check the resulting LLVM-IR, then we could have LLVM SPIR-V translator tests that check that uses the LLVM-IR output from the SYCL tests as the input of their tests. I personally like that structure as it separates the responsibilities of the tooling.

I'm okay with that. If there are no objections, this patch should be ready for merge.
As you mentioned in this comment, SPIR-V translator part is not covered. I rely on @maarquitos14 and/or @MrSidims to add the translator part when they back to work.

I hope integration is validated by the end-to-end tests.

bader · 2025-11-25T00:02:19Z

The pre-commit failure is not related to the patch and tracked by #20750.

bader requested review from a team as code owners November 7, 2025 23:23

bader had a problem deploying to WindowsCILock November 7, 2025 23:23 — with GitHub Actions Failure

bader requested a review from steffenlarsen November 7, 2025 23:24

bader temporarily deployed to WindowsCILock November 7, 2025 23:58 — with GitHub Actions Inactive

Parametrize CompileTimePropertiesPass with a flag controlling cache c…

2c8d041

…ontrols handling.

bader temporarily deployed to WindowsCILock November 21, 2025 02:13 — with GitHub Actions Inactive

bader had a problem deploying to WindowsCILock November 21, 2025 03:15 — with GitHub Actions Failure

bader temporarily deployed to WindowsCILock November 21, 2025 03:15 — with GitHub Actions Inactive

bader had a problem deploying to WindowsCILock November 21, 2025 03:15 — with GitHub Actions Failure

maarquitos14 approved these changes Nov 21, 2025

View reviewed changes

clang/lib/CodeGen/BackendUtil.cpp Show resolved Hide resolved

steffenlarsen reviewed Nov 21, 2025

View reviewed changes

steffenlarsen approved these changes Nov 21, 2025

View reviewed changes

elizabethandrews approved these changes Nov 21, 2025

View reviewed changes

bader added 4 commits November 21, 2025 15:16

Revert "Parametrize CompileTimePropertiesPass with a flag controlling…

c102d96

… cache controls handling." This reverts commit 2c8d041.

Apply metadata to load/store instructions rather than to the instruct…

0914222

…ions producing a pointer argument for load/store instructions.

Merge remote-tracking branch 'origin/sycl' into compile-time-properties

2b7a124

Update properties_cache_control test checks.

1026c0e

bader requested a review from a team as a code owner November 22, 2025 00:25

bader requested a review from sergey-semenov November 22, 2025 00:25

bader had a problem deploying to WindowsCILock November 22, 2025 00:25 — with GitHub Actions Error

bader commented Nov 22, 2025

View reviewed changes

bader had a problem deploying to WindowsCILock November 24, 2025 07:31 — with GitHub Actions Failure

bader temporarily deployed to WindowsCILock November 24, 2025 08:11 — with GitHub Actions Inactive

bader had a problem deploying to WindowsCILock November 24, 2025 08:11 — with GitHub Actions Failure

bader temporarily deployed to WindowsCILock November 24, 2025 08:11 — with GitHub Actions Inactive

steffenlarsen reviewed Nov 24, 2025

View reviewed changes

llvm/lib/SYCLLowerIR/CompileTimePropertiesPass.cpp Outdated Show resolved Hide resolved

Update llvm/lib/SYCLLowerIR/CompileTimePropertiesPass.cpp

178a445

Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>

bader temporarily deployed to WindowsCILock November 24, 2025 14:49 — with GitHub Actions Inactive

bader had a problem deploying to WindowsCILock November 24, 2025 15:38 — with GitHub Actions Failure

bader temporarily deployed to WindowsCILock November 24, 2025 15:38 — with GitHub Actions Inactive

Merge remote-tracking branch 'origin/sycl' into compile-time-properties

6c4adb1

bader temporarily deployed to WindowsCILock November 24, 2025 22:15 — with GitHub Actions Inactive

bader temporarily deployed to WindowsCILock November 24, 2025 23:03 — with GitHub Actions Inactive

bader had a problem deploying to WindowsCILock November 24, 2025 23:03 — with GitHub Actions Failure

bader mentioned this pull request Nov 25, 2025

SYCL Pre Commit on Windows fails on "Detect hung tests" check on BMG machine. #20750

Open

bader merged commit 02d8168 into intel:sycl Nov 25, 2025
28 of 29 checks passed

bader deleted the compile-time-properties branch November 25, 2025 00:02

[SYCL] Run CompileTimePropertiesPass early in the pipeline #20602

[SYCL] Run CompileTimePropertiesPass early in the pipeline #20602

Uh oh!

Conversation

bader commented Nov 7, 2025

Uh oh!

steffenlarsen commented Nov 10, 2025

Uh oh!

elizabethandrews commented Nov 10, 2025

Uh oh!

steffenlarsen commented Nov 11, 2025

Uh oh!

bader commented Nov 20, 2025

Uh oh!

bader commented Nov 21, 2025

Uh oh!

maarquitos14 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aratajew commented Nov 21, 2025

Uh oh!

steffenlarsen Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

bader commented Nov 21, 2025

Uh oh!

bader Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

bader commented Nov 22, 2025

Uh oh!

steffenlarsen commented Nov 24, 2025

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bader commented Nov 24, 2025

Uh oh!

steffenlarsen commented Nov 24, 2025

Uh oh!

bader commented Nov 24, 2025

Uh oh!

bader commented Nov 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants