[AMDGPU] Prevent folding of the negative i32 literals as i64 #70274

rampitec · 2023-10-26T00:57:10Z

We can use sign extended 64-bit literals, but only for signed operands. At the moment we do not know if an operand is signed. Such operand will be encoded as its low 32 bits and then either correctly sign extended or incorrectly zero extended by HW.

llvmbot · 2023-10-26T00:58:20Z

@llvm/pr-subscribers-backend-amdgpu

Author: Stanislav Mekhanoshin (rampitec)

Changes

We can use sign extended 64-bit literals, but only for signed operands. At the moment we do not know if an operand is signed. Such operand will be encoded as its low 32 bits and then either correctly sign extended or incorrectly zero extended by HW.

Full diff: https://github.com/llvm/llvm-project/pull/70274.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIInstrInfo.cpp (+9)
(added) llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir (+20)

diff --git a/llvm/lib/Target/AMDGPU/SIInstrInfo.cpp b/llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
index 827c2c156638468..355805e053f38df 100644
--- a/llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
+++ b/llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
@@ -5500,6 +5500,15 @@ bool SIInstrInfo::isOperandLegal(const MachineInstr &MI, unsigned OpIdx,
     if (Is64BitOp && !AMDGPU::isValid32BitLiteral(Imm, Is64BitFPOp) &&
         !AMDGPU::isInlinableLiteral64(Imm, ST.hasInv2PiInlineImm()))
       return false;
+
+    // FIXME: We can use sign extended 64-bit literals, but only for signed
+    //        operands. At the moment we do not know if an operand is signed.
+    //        Such operand will be encoded as its low 32 bits and then either
+    //        correctly sign extended or incorrectly zero extended by HW.
+    if (Is64BitOp && !Is64BitFPOp && isInt<32>(Imm) &&
+        (int32_t)Lo_32(Imm) < 0 &&
+        !AMDGPU::isInlinableLiteral64(Imm, ST.hasInv2PiInlineImm()))
+      return false;
   }
 
   // Handle non-register types that are treated like immediates.
diff --git a/llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir b/llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir
new file mode 100644
index 000000000000000..7cfa67d86fbd94e
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir
@@ -0,0 +1,20 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 3
+# RUN: llc -march=amdgcn -mcpu=gfx900 -verify-machineinstrs -run-pass=si-fold-operands -o - %s | FileCheck -check-prefix=GCN %s
+
+# The constant is 0xffffffff80000000. It is 64-bit negative constant, but it passes the test
+# isInt<32>(). Nonetheless it is not a legal literal for a binary or unsigned operand and
+# cannot be used right in the shift as HW will zero extend it.
+
+---
+name:            imm64_shift_int32_const
+body: |
+  bb.0:
+    ; GCN-LABEL: name: imm64_shift_int32_const
+    ; GCN: [[S_MOV_B:%[0-9]+]]:sreg_64 = S_MOV_B64_IMM_PSEUDO -2147483648
+    ; GCN-NEXT: [[S_LSHL_B64_:%[0-9]+]]:sreg_64 = S_LSHL_B64 [[S_MOV_B]], 1, implicit-def $scc
+    ; GCN-NEXT: S_ENDPGM 0, implicit [[S_LSHL_B64_]]
+    %0:sreg_64 = S_MOV_B64_IMM_PSEUDO 18446744071562067968
+    %1:sreg_64 = S_LSHL_B64 %0, 1, implicit-def $scc
+    S_ENDPGM 0, implicit %1
+
+...

rampitec · 2023-10-26T08:19:18Z

In a long term I think we need to replace INT64 operands with SINT64 and UINT64. That said there are no MVT::s64 and MVT::u64 to use in an instruction profile.

arsenm · 2023-10-26T09:35:51Z

In a long term I think we need to replace INT64 operands with SINT64 and UINT64. That said there are no MVT::s64 and MVT::u64 to use in an instruction profile.

You would have to bundle that somehow in the profile.

llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp

rampitec · 2023-10-26T16:34:43Z

In a long term I think we need to replace INT64 operands with SINT64 and UINT64. That said there are no MVT::s64 and MVT::u64 to use in an instruction profile.

You would have to bundle that somehow in the profile.

Right, VT to use in profile is also missing.

jayfoad

LGTM if Matt is happy with the tests.

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp

rampitec requested review from jayfoad and Sisyph October 26, 2023 00:57

llvmbot added the backend:AMDGPU label Oct 26, 2023

arsenm requested changes Oct 26, 2023

View reviewed changes

llvm/test/CodeGen/AMDGPU/folding-of-i32-as-i64.mir Show resolved Hide resolved

jayfoad reviewed Oct 26, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp Outdated Show resolved Hide resolved

jayfoad reviewed Oct 26, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp Outdated Show resolved Hide resolved

rampitec added 2 commits October 26, 2023 11:21

Fold common condition in checks.

9eaa267

Added more tests, positive and negative

71e5a19

rampitec requested a review from arsenm October 26, 2023 18:41

Removed isInt<32>() check and added tests

073baf8

rampitec mentioned this pull request Oct 27, 2023

[AMDGPU] Select 64-bit imm moves if can be encoded as 32 bit operand #70395

Merged

jayfoad approved these changes Oct 27, 2023

View reviewed changes

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp Outdated Show resolved Hide resolved

Simplified condition.

db086eb

arsenm approved these changes Oct 30, 2023

View reviewed changes

rampitec merged commit ee6d62d into llvm:main Oct 30, 2023

rampitec deleted the folding-of-i32-as-i64 branch October 30, 2023 15:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Prevent folding of the negative i32 literals as i64 #70274

[AMDGPU] Prevent folding of the negative i32 literals as i64 #70274

rampitec commented Oct 26, 2023

llvmbot commented Oct 26, 2023

rampitec commented Oct 26, 2023

arsenm commented Oct 26, 2023

rampitec commented Oct 26, 2023

jayfoad left a comment

[AMDGPU] Prevent folding of the negative i32 literals as i64 #70274

[AMDGPU] Prevent folding of the negative i32 literals as i64 #70274

Conversation

rampitec commented Oct 26, 2023

llvmbot commented Oct 26, 2023

rampitec commented Oct 26, 2023

arsenm commented Oct 26, 2023

rampitec commented Oct 26, 2023

jayfoad left a comment

Choose a reason for hiding this comment