[Offload] Guard HSA implicit arguments if they aren't created #133073

jhuber6 · 2025-03-26T13:07:24Z

Summary:
We conditionally allocate the implicit arguments, so they possibly are
null. The flang compiler seems to hit this case, even though it
shouldn't when it's supposed to conform to the HSA code object. For now
guard this to fix the regression and cover a case in the future where
someone rolls a fully custom implementatation.

Fixes: #132982

llvmbot · 2025-03-26T13:07:56Z

@llvm/pr-subscribers-offload

@llvm/pr-subscribers-backend-amdgpu

Author: Joseph Huber (jhuber6)

Changes

Summary:
We conditionally allocate the implicit arguments, so they possibly are
null. The flang compiler seems to hit this case, even though it
shouldn't when it's supposed to conform to the HSA code object. For now
guard this to fix the regression and cover a case in the future where
someone rolls a fully custom implementatation.

Fixes: #132982

Full diff: https://github.com/llvm/llvm-project/pull/133073.diff

1 Files Affected:

(modified) offload/plugins-nextgen/amdgpu/src/rtl.cpp (+12-10)

diff --git a/offload/plugins-nextgen/amdgpu/src/rtl.cpp b/offload/plugins-nextgen/amdgpu/src/rtl.cpp
index b2ede888b542d..097a324aa04a1 100644
--- a/offload/plugins-nextgen/amdgpu/src/rtl.cpp
+++ b/offload/plugins-nextgen/amdgpu/src/rtl.cpp
@@ -3386,16 +3386,18 @@ Error AMDGPUKernelTy::launchImpl(GenericDeviceTy &GenericDevice,
     return Err;
 
   // Set the COV5+ implicit arguments to the appropriate values.
-  ImplArgs->BlockCountX = NumBlocks[0];
-  ImplArgs->BlockCountY = NumBlocks[1];
-  ImplArgs->BlockCountZ = NumBlocks[2];
-  ImplArgs->GroupSizeX = NumThreads[0];
-  ImplArgs->GroupSizeY = NumThreads[1];
-  ImplArgs->GroupSizeZ = NumThreads[2];
-  ImplArgs->GridDims = NumBlocks[2] * NumThreads[2] > 1
-                           ? 3
-                           : 1 + (NumBlocks[1] * NumThreads[1] != 1);
-  ImplArgs->DynamicLdsSize = KernelArgs.DynCGroupMem;
+  if (ImplArgs) {
+    ImplArgs->BlockCountX = NumBlocks[0];
+    ImplArgs->BlockCountY = NumBlocks[1];
+    ImplArgs->BlockCountZ = NumBlocks[2];
+    ImplArgs->GroupSizeX = NumThreads[0];
+    ImplArgs->GroupSizeY = NumThreads[1];
+    ImplArgs->GroupSizeZ = NumThreads[2];
+    ImplArgs->GridDims = NumBlocks[2] * NumThreads[2] > 1
+                             ? 3
+                             : 1 + (NumBlocks[1] * NumThreads[1] != 1);
+    ImplArgs->DynamicLdsSize = KernelArgs.DynCGroupMem;
+  }
 
   // Push the kernel launch into the stream.
   return Stream->pushKernelLaunch(*this, AllArgs, NumThreads, NumBlocks,

offload/plugins-nextgen/amdgpu/src/rtl.cpp

Summary: We conditionally allocate the implicit arguments, so they possibly are null. The flang compiler seems to hit this case, even though it shouldn't when it's supposed to conform to the HSA code object. For now guard this to fix the regression and cover a case in the future where someone rolls a fully custom implementatation. Fixes: llvm#132982

jhuber6 requested review from carlobertolli, jplehr, ronlieb, saiislam and shiltian March 26, 2025 13:07

llvmbot added backend:AMDGPU offload labels Mar 26, 2025

arsenm reviewed Mar 26, 2025

View reviewed changes

offload/plugins-nextgen/amdgpu/src/rtl.cpp Outdated Show resolved Hide resolved

jhuber6 force-pushed the FixFlang branch from 3d50325 to 2928db4 Compare March 26, 2025 13:13

arsenm approved these changes Mar 26, 2025

View reviewed changes

jhuber6 merged commit 75f810e into llvm:main Mar 26, 2025
9 checks passed

jhuber6 deleted the FixFlang branch March 26, 2025 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Offload] Guard HSA implicit arguments if they aren't created #133073

[Offload] Guard HSA implicit arguments if they aren't created #133073

jhuber6 commented Mar 26, 2025

llvmbot commented Mar 26, 2025 •

edited

Loading

[Offload] Guard HSA implicit arguments if they aren't created #133073

[Offload] Guard HSA implicit arguments if they aren't created #133073

Conversation

jhuber6 commented Mar 26, 2025

llvmbot commented Mar 26, 2025 • edited Loading

llvmbot commented Mar 26, 2025 •

edited

Loading