[NFC][HIP] Disable device-side kernel launches for HIP #171043

AlexVlx · 2025-12-07T15:05:25Z

#165519 added support for launching kernels from the device side. This is only available in CUDA at the moment. We have to explicitly check whether we are compiling for HIP to guard against this path being exercised, since the CUDA and HIP languages rely on the same CUDAIsDevice bit to check for device side compilation, and it is not possible to disambiguate otherwise.

llvmbot · 2025-12-07T15:05:54Z

@llvm/pr-subscribers-clang-codegen

@llvm/pr-subscribers-clang

Author: Alex Voicu (AlexVlx)

Changes

#165519 added support for launching kernels from the device side. This is only available in CUDA at the moment. We have to explicitly check whether we are compiling for HIP to guard against this path being exercised, since the CUDA and HIP languages rely on the same CUDAIsDevice bit to check for device side compilation, and it is not possible to disambiguate otherwise.

Full diff: https://github.com/llvm/llvm-project/pull/171043.diff

1 Files Affected:

(modified) clang/lib/CodeGen/CGExprCXX.cpp (+2-1)

diff --git a/clang/lib/CodeGen/CGExprCXX.cpp b/clang/lib/CodeGen/CGExprCXX.cpp
index ce2ed9026fa1f..3f4f61db8d3a4 100644
--- a/clang/lib/CodeGen/CGExprCXX.cpp
+++ b/clang/lib/CodeGen/CGExprCXX.cpp
@@ -504,7 +504,8 @@ RValue CodeGenFunction::EmitCUDAKernelCallExpr(const CUDAKernelCallExpr *E,
                                                ReturnValueSlot ReturnValue,
                                                llvm::CallBase **CallOrInvoke) {
   // Emit as a device kernel call if CUDA device code is to be generated.
-  if (getLangOpts().CUDAIsDevice)
+  // TODO: implement for HIP
+  if (!getLangOpts().HIP && getLangOpts().CUDAIsDevice)
     return CGM.getCUDARuntime().EmitCUDADeviceKernelCallExpr(
         *this, E, ReturnValue, CallOrInvoke);
   return CGM.getCUDARuntime().EmitCUDAKernelCallExpr(*this, E, ReturnValue,

…fix_hip_disable_self_enqueue

darkbuck · 2025-12-07T22:56:54Z

clang/lib/CodeGen/CGExprCXX.cpp

  // Emit as a device kernel call if CUDA device code is to be generated.
-  if (getLangOpts().CUDAIsDevice)
+  // TODO: implement for HIP
+  if (!getLangOpts().HIP && getLangOpts().CUDAIsDevice)


We added sema check @

llvm-project/clang/lib/Sema/SemaCUDA.cpp

Line 83 in 8378a6f

if (IsDeviceKernelCall && getLangOpts().HIP)

to generate error message on HIP based on Sam's request as HIP currently doesnt' support device-side kernel calls. I don't follow how we could have CUDAKernelCallExpr in the device compilation. Could you elaborate in details?

AlexVlx

We added sema check @

llvm-project/clang/lib/Sema/SemaCUDA.cpp

Line 83 in 8378a6f

if (IsDeviceKernelCall && getLangOpts().HIP)

to generate error message on HIP based on Sam's request as HIP currently doesnt' support device-side kernel calls. I don't follow how we could have CUDAKernelCallExpr in the device compilation. Could you elaborate in details?

The sema check doesn't work as is for hipstdpar, because it's gated on the current target being either a __global__ function or a __device__ function. What happens is that we do the parsing on a normal function, the <<<>>> expression is semantically valid, and then we try to EmitCUDAKernelCallExpr, because at CodeGen that is gated on whether the entire compilation is host or device, not on whether or not the caller is __global__ or __device__. So either the latter check should actually establish the caller's context, or we should bypass this altogether when compiling for hipstdpar. This is the simplest NFC workaround to unbreak things.

darkbuck · 2025-12-09T03:43:47Z

We added sema check @

llvm-project/clang/lib/Sema/SemaCUDA.cpp

Line 83 in 8378a6f

if (IsDeviceKernelCall && getLangOpts().HIP)

to generate error message on HIP based on Sam's request as HIP currently doesnt' support device-side kernel calls. I don't follow how we could have CUDAKernelCallExpr in the device compilation. Could you elaborate in details?

The sema check doesn't work as is for hipstdpar, because it's gated on the current target being either a __global__ function or a __device__ function. What happens is that we do the parsing on a normal function, the <<<>>> expression is semantically valid, and then we try to EmitCUDAKernelCallExpr, because at CodeGen that is gated on whether the entire compilation is host or device, not on whether or not the caller is __global__ or __device__. So either the latter check should actually establish the caller's context, or we should bypass this altogether when compiling for hipstdpar. This is the simplest NFC workaround to unbreak things.

Why not add getLangOpts().HIPStdPar check in sema to skip generating device-side kernel call? So that we have a central place to make that decision?

AlexVlx

We added sema check @

llvm-project/clang/lib/Sema/SemaCUDA.cpp

Line 83 in 8378a6f

if (IsDeviceKernelCall && getLangOpts().HIP)

to generate error message on HIP based on Sam's request as HIP currently doesnt' support device-side kernel calls. I don't follow how we could have CUDAKernelCallExpr in the device compilation. Could you elaborate in details?

The sema check doesn't work as is for hipstdpar, because it's gated on the current target being either a __global__ function or a __device__ function. What happens is that we do the parsing on a normal function, the <<<>>> expression is semantically valid, and then we try to EmitCUDAKernelCallExpr, because at CodeGen that is gated on whether the entire compilation is host or device, not on whether or not the caller is __global__ or __device__. So either the latter check should actually establish the caller's context, or we should bypass this altogether when compiling for hipstdpar. This is the simplest NFC workaround to unbreak things.

Why not add getLangOpts().HIPStdPar check in sema to skip generating device-side kernel call? So that we have a central place to make that decision?

Because, as far as I can ascertain, the Sema check is insufficient / the separate assert in EmitCUDAKernelCallExpr is disjoint. Here's what would happen:

In Sema what we see is that IsDeviceKernelCall is false - this is fine, but we still would emit a CudaKernelCallExpr for the <<<>>> callsite, which was the case anyways before this change;
Later on, when we get to CodeGen, we see the CudaKernelCallExpr, and try to handle it, except now the assumption is that if we're compiling for device and we see that, it must be a device side launch, and go look up a non-existent symbol, and run into the bug.

Disable device-side kernel launches for HIP

c145482

llvmbot added clang Clang issues not falling into any other category clang:codegen IR generation bugs: mangling, exceptions, etc. labels Dec 7, 2025

AlexVlx requested review from darkbuck and yxsamliu December 7, 2025 15:05

AlexVlx added 2 commits December 7, 2025 18:43

Merge branch 'main' of https://github.com/llvm/llvm-project into nfc_…

8311760

…fix_hip_disable_self_enqueue

Merge branch 'main' of https://github.com/llvm/llvm-project into nfc_…

4c35e28

…fix_hip_disable_self_enqueue

darkbuck reviewed Dec 7, 2025

View reviewed changes

AlexVlx commented Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NFC][HIP] Disable device-side kernel launches for HIP #171043

[NFC][HIP] Disable device-side kernel launches for HIP #171043

AlexVlx commented Dec 7, 2025

Uh oh!

llvmbot commented Dec 7, 2025 •

edited

Loading

Uh oh!

darkbuck Dec 7, 2025

Uh oh!

AlexVlx left a comment

Uh oh!

darkbuck commented Dec 9, 2025

Uh oh!

AlexVlx left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[NFC][HIP] Disable device-side kernel launches for HIP #171043

Are you sure you want to change the base?

[NFC][HIP] Disable device-side kernel launches for HIP #171043

Conversation

AlexVlx commented Dec 7, 2025

Uh oh!

llvmbot commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

darkbuck Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

AlexVlx left a comment

Choose a reason for hiding this comment

Uh oh!

darkbuck commented Dec 9, 2025

Uh oh!

AlexVlx left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

llvmbot commented Dec 7, 2025 •

edited

Loading