[AMDGPU] Set MaxAtomicSizeInBitsSupported. #75185

jyknight · 2023-12-12T13:41:18Z

This will result in larger atomic operations getting expanded to __atomic_* libcalls via AtomicExpandPass, which matches what Clang already does in the frontend.

While AMDGPU currently disables the use of all libcalls, I've changed it to instead disable all of them except the atomic ones. Those are already be emitted by the Clang frontend, and enabling them in the backend allows the same behavior there.

This will result in larger atomic operations getting expanded to `__atomic_*` libcalls via AtomicExpandPass, which matches what Clang already does in the frontend. While AMDGPU currently disables the use of all libcalls, I've changed it to instead disable all of them _except_ the atomic ones. Those are already be emitted by the Clang frontend, and by enabling them in the backend, allows the same behavior there.

llvmbot · 2023-12-12T13:41:48Z

@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-amdgpu

Author: James Y Knight (jyknight)

Changes

This will result in larger atomic operations getting expanded to __atomic_* libcalls via AtomicExpandPass, which matches what Clang already does in the frontend.

While AMDGPU currently disables the use of all libcalls, I've changed it to instead disable all of them except the atomic ones. Those are already be emitted by the Clang frontend, and enabling them in the backend allows the same behavior there.

Full diff: https://github.com/llvm/llvm-project/pull/75185.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp (+7-3)
(added) llvm/test/CodeGen/AMDGPU/atomic-oversize.ll (+10)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
index fcbdf51b03c1f..78092675057df 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
@@ -506,9 +506,11 @@ AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
   setOperationAction(ISD::SELECT, MVT::v12f32, Promote);
   AddPromotedToType(ISD::SELECT, MVT::v12f32, MVT::v12i32);
 
-  // There are no libcalls of any kind.
-  for (int I = 0; I < RTLIB::UNKNOWN_LIBCALL; ++I)
-    setLibcallName(static_cast<RTLIB::Libcall>(I), nullptr);
+  // Disable most libcalls.
+  for (int I = 0; I < RTLIB::UNKNOWN_LIBCALL; ++I) {
+    if (I < RTLIB::ATOMIC_LOAD || I > RTLIB::ATOMIC_FETCH_NAND_16)
+      setLibcallName(static_cast<RTLIB::Libcall>(I), nullptr);
+  }
 
   setSchedulingPreference(Sched::RegPressure);
   setJumpIsExpensive(true);
@@ -556,6 +558,8 @@ AMDGPUTargetLowering::AMDGPUTargetLowering(const TargetMachine &TM,
                        ISD::FSUB,       ISD::FNEG,
                        ISD::FABS,       ISD::AssertZext,
                        ISD::AssertSext, ISD::INTRINSIC_WO_CHAIN});
+
+  setMaxAtomicSizeInBitsSupported(64);
 }
 
 bool AMDGPUTargetLowering::mayIgnoreSignedZero(SDValue Op) const {
diff --git a/llvm/test/CodeGen/AMDGPU/atomic-oversize.ll b/llvm/test/CodeGen/AMDGPU/atomic-oversize.ll
new file mode 100644
index 0000000000000..f62a93f523365
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/atomic-oversize.ll
@@ -0,0 +1,10 @@
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 -verify-machineinstrs < %s | FileCheck %s
+
+define void @test(ptr %a) nounwind {
+; CHECK-LABEL: test:
+; CHECK: __atomic_load_16
+; CHECK: __atomic_store_16
+  %1 = load atomic i128, ptr %a seq_cst, align 16
+  store atomic i128 %1, ptr %a seq_cst, align 16
+  ret void
+}

…ot to expect to crash.

arsenm · 2023-12-18T15:11:18Z

What's the point of doing this? There aren't any lib calls, and the lib call handling is unaware of the address space anyway. This is just changing a hard error into producing nonworking code

jyknight · 2023-12-18T15:13:30Z

The purpose is to match what's currently produced by Clang, with the eventual goal of deleting parts of the Clang atomic lowering, leaning on the backend instead.

arsenm

I bet this breaks differently with 32-bit address space pointers

jyknight · 2023-12-18T21:50:50Z

I don't know. I guess we can deal with that if it comes up in the future.

jyknight requested a review from arsenm December 12, 2023 13:41

llvmbot added the backend:AMDGPU label Dec 12, 2023

Adjust llvm/test/Transforms/AtomicExpand/AMDGPU/unaligned-atomic.ll n…

3f64deb

…ot to expect to crash.

llvmbot added the llvm:transforms label Dec 12, 2023

arsenm approved these changes Dec 18, 2023

View reviewed changes

jyknight merged commit 137f785 into llvm:main Dec 18, 2023
5 checks passed

jyknight deleted the atomic-max-amdgpu branch December 18, 2023 21:51

jyknight mentioned this pull request Jan 10, 2024

[clang][CodeGen] Emit atomic IR in place of optimized libcalls. #73176

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Set MaxAtomicSizeInBitsSupported. #75185

[AMDGPU] Set MaxAtomicSizeInBitsSupported. #75185

jyknight commented Dec 12, 2023

llvmbot commented Dec 12, 2023 •

edited

Loading

arsenm commented Dec 18, 2023

jyknight commented Dec 18, 2023

arsenm left a comment

jyknight commented Dec 18, 2023

[AMDGPU] Set MaxAtomicSizeInBitsSupported. #75185

[AMDGPU] Set MaxAtomicSizeInBitsSupported. #75185

Conversation

jyknight commented Dec 12, 2023

llvmbot commented Dec 12, 2023 • edited Loading

arsenm commented Dec 18, 2023

jyknight commented Dec 18, 2023

arsenm left a comment

Choose a reason for hiding this comment

jyknight commented Dec 18, 2023

llvmbot commented Dec 12, 2023 •

edited

Loading