AMDGPU: Partially clean up canonicalized predicates in tablegen #85404

arsenm · 2024-03-15T14:15:24Z

This was the easy case. There are more issues with some of the other is_canonicalized* patterns. First there appears to be a tablegen bug where the predicate is silently ignored if used as a ComplexPattern source, and we also probably need a version with an operand.

llvmbot · 2024-03-15T14:15:42Z

@llvm/pr-subscribers-backend-amdgpu

Author: Matt Arsenault (arsenm)

Changes

This was the easy case. There are more issues with some of the other is_canonicalized* patterns. First there appears to be a tablegen bug where the predicate is silently ignored if used as a ComplexPattern source, and we also probably need a version with an operand.

Full diff: https://github.com/llvm/llvm-project/pull/85404.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIInstrInfo.td (+13)
(modified) llvm/lib/Target/AMDGPU/SIInstructions.td (+6-24)

diff --git a/llvm/lib/Target/AMDGPU/SIInstrInfo.td b/llvm/lib/Target/AMDGPU/SIInstrInfo.td
index 835a5a24723154..1694436bad15ce 100644
--- a/llvm/lib/Target/AMDGPU/SIInstrInfo.td
+++ b/llvm/lib/Target/AMDGPU/SIInstrInfo.td
@@ -835,6 +835,19 @@ def fp16_zeros_high_16bits : PatLeaf<(f16 VGPR_32:$src), [{
   return fp16SrcZerosHighBits(N->getOpcode());
 }]>;
 
+def is_canonicalized : PatLeaf<(fAny srcvalue:$src), [{
+  const SITargetLowering &Lowering =
+      *static_cast<const SITargetLowering *>(getTargetLowering());
+  return Lowering.isCanonicalized(*CurDAG, SDValue(N, 0));
+}]> {
+  let GISelPredicateCode = [{
+    const SITargetLowering *TLI = static_cast<const SITargetLowering *>(
+        MF.getSubtarget().getTargetLowering());
+    const MachineOperand &Dst = MI.getOperand(0);
+    assert(Dst.isDef());
+    return TLI->isCanonicalized(Dst.getReg(), MF);
+   }];
+}
 
 //===----------------------------------------------------------------------===//
 // MUBUF/SMEM Patterns
diff --git a/llvm/lib/Target/AMDGPU/SIInstructions.td b/llvm/lib/Target/AMDGPU/SIInstructions.td
index 3ab788406ecb28..1c942dcefdacea 100644
--- a/llvm/lib/Target/AMDGPU/SIInstructions.td
+++ b/llvm/lib/Target/AMDGPU/SIInstructions.td
@@ -2946,30 +2946,12 @@ def : GCNPat<
 
 // If fcanonicalize's operand is implicitly canonicalized, we only need a copy.
 let AddedComplexity = 1000 in {
-def : GCNPat<
-  (is_canonicalized_1<fcanonicalize> f16:$src),
-  (COPY f16:$src)
->;
-
-def : GCNPat<
-  (is_canonicalized_1<fcanonicalize> v2f16:$src),
-  (COPY v2f16:$src)
->;
-
-def : GCNPat<
-  (is_canonicalized_1<fcanonicalize> f32:$src),
-  (COPY f32:$src)
->;
-
-def : GCNPat<
-  (is_canonicalized_1<fcanonicalize> v2f32:$src),
-  (COPY v2f32:$src)
->;
-
-def : GCNPat<
-  (is_canonicalized_1<fcanonicalize> f64:$src),
-  (COPY f64:$src)
->;
+foreach vt = [f16, v2f16, f32, v2f32, f64] in {
+  def : GCNPat<
+    (fcanonicalize (vt is_canonicalized:$src)),
+    (COPY vt:$src)
+  >;
+}
 }
 
 // Prefer selecting to max when legal, but using mul is always valid.

jayfoad

Nice.

arsenm added the backend:AMDGPU label Mar 15, 2024

arsenm requested review from jayfoad and hvdijk March 15, 2024 14:15

jayfoad approved these changes Mar 15, 2024

View reviewed changes

hvdijk approved these changes Mar 15, 2024

View reviewed changes

arsenm merged commit fc06c8e into llvm:main Mar 15, 2024
5 of 6 checks passed

arsenm deleted the is-canonicalized-cleanup branch March 15, 2024 15:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMDGPU: Partially clean up canonicalized predicates in tablegen #85404

AMDGPU: Partially clean up canonicalized predicates in tablegen #85404

arsenm commented Mar 15, 2024

llvmbot commented Mar 15, 2024

jayfoad left a comment

AMDGPU: Partially clean up canonicalized predicates in tablegen #85404

AMDGPU: Partially clean up canonicalized predicates in tablegen #85404

Conversation

arsenm commented Mar 15, 2024

llvmbot commented Mar 15, 2024

jayfoad left a comment

Choose a reason for hiding this comment