[DAG] Add legalization handling for AVGCEIL/AVGFLOOR nodes #92096

RKSimon · 2024-05-14T11:13:53Z

Always match AVG patterns pre-legalization, and use TargetLowering::expandAVG to expand again during legalization.

I've removed the X86 custom AVGCEILU pattern detection and replaced with combines to try and convert other AVG nodes to AVGCEILU.

github-actions · 2024-05-14T11:17:11Z

⚠️ C/C++ code formatter, clang-format found issues in your code. ⚠️

You can test this locally with the following command:

git-clang-format --diff 0e346eeac676d909402abe01fb23248bb3efc5e0 58c869b8dd4bf1f2929d06bc244ee97b3bde5fa1 -- llvm/include/llvm/CodeGen/TargetLowering.h llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp llvm/lib/Target/X86/X86ISelLowering.cpp

View the diff from clang-format here.

diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
index f435a36305..fb4ac238e3 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
@@ -2823,9 +2823,11 @@ void DAGTypeLegalizer::ExpandIntegerResult(SDNode *N, unsigned ResNo) {
   case ISD::USHLSAT: ExpandIntRes_SHLSAT(N, Lo, Hi); break;
 
   case ISD::AVGCEILS:
-  case ISD::AVGCEILU: 
+  case ISD::AVGCEILU:
   case ISD::AVGFLOORS:
-  case ISD::AVGFLOORU: ExpandIntRes_AVG(N, Lo, Hi); break;
+  case ISD::AVGFLOORU:
+    ExpandIntRes_AVG(N, Lo, Hi);
+    break;
 
   case ISD::SMULFIX:
   case ISD::SMULFIXSAT:
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
index 82c39f4613..f561e80e25 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
@@ -479,7 +479,7 @@ private:
   void ExpandIntRes_SADDSUBO          (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandIntRes_UADDSUBO          (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandIntRes_XMULO             (SDNode *N, SDValue &Lo, SDValue &Hi);
-  void ExpandIntRes_AVG               (SDNode *N, SDValue &Lo, SDValue &Hi);
+  void ExpandIntRes_AVG(SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandIntRes_ADDSUBSAT         (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandIntRes_SHLSAT            (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandIntRes_MULFIX            (SDNode *N, SDValue &Lo, SDValue &Hi);

llvm/include/llvm/CodeGen/TargetLowering.h

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vaaddu.ll

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

…n looking for a splat constant Limit the isConstOrConstSplat call to the vector elements we care about Noticed while investigating regressions in #92096

goldsteinn · 2024-05-16T16:34:51Z

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

+      if (KnownAmt.isConstant() && KnownAmt.getConstant().ult(VTBits))
+        Tmp = std::min<uint64_t>(Tmp + KnownAmt.getConstant().getZExtValue(),
+                                 VTBits);
+    }


This seems like an unrelated change?

If only... its to fix a thumb2 regression as it lowers v2i64 constant as bitcast(v4i32 constant)

Once this draft has addressed all the regressions I'll turn my attention to pulling out some of these changes.

This is proving tricky to pull out - but I've confirmed that it doesn't cause any notable compile time diff - as we fallback to ComputeKnownBits call which will call computeKnownBits on the shift amount anyway.

Isn't there any helper simpler than computeKnownBits that can look through bitcasts to find a constant?

If you are going to use computeKnownBits, why not use KnownAmt.getMinValue() instead of KnownAmt.getConstant()?

Updated to getMaxValue() (for upper bound) + getMinValue() (for min sign extension) - the shift amount isn't just a bitcast(v4i32 constant) hidden constant, so we do need the abilities of computeKnownBits.

We could update getValidMinimumShiftAmountConstant (et. al) to return std::optional<APInt> to allow it to fallback to computeKnownBits, although that would mean the function would return a value that might not actual exist in the shift amount, I don't think we've used that property but it would still be a change.

I've created #93182 as a possible cleanup for this (the pull requests are independent though so we can go with the above approach for now). #93182 should get analyzed up by llvm-compile-time-tracker in the next hour or so.

I think you should use getMinValue in both places. It doesn't matter if we don't know an upper bound for the shift amount.

Actually I'm not even sure what the ult check is for, except perhaps to guard against Tmp + getMinValue() overflowing?

Yes, its mainly just a sanity/overflow check (somebody always comes along with a i1024 fuzz test or something eventually that makes getZExtValue() assert or cause weird getLimitedValue() behaviour).

Using getMaxValue() was mainly to try and keep closer to the behaviour of getValidMinimumShiftAmountConstant which doesn't accept out of bounds shift amounts.

…tternMatch No need for this to be vector specific, and its more likely that scalar cases will appear after #92096

RKSimon · 2024-06-03T08:16:11Z

ping - #93182 is now finished, so this PR is ready to go.

RKSimon · 2024-06-06T08:14:38Z

ping? any objections to me getting this committed now please?

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

RKSimon · 2024-06-07T15:00:57Z

@jayfoad any more comments?

jayfoad

No objection from me. The logic looks good. But I don't feel I know enough about any of the affected targets to approve it.

RKSimon · 2024-06-07T16:55:27Z

@davemgreen @goldsteinn any objections?

RKSimon · 2024-06-12T08:36:00Z

ping?

jayfoad

LGTM

Always match AVG patterns pre-legalization, and use TargetLowering::expandAVG to expand again during legalization. I've removed the X86 custom AVGCEILU pattern detection and replaced with combines to try and convert other AVG nodes to AVGCEILU.

dtcxzyw · 2024-06-12T14:26:02Z

Hi @RKSimon, I think this patch causes some regressions on riscv: dtcxzyw/llvm-codegen-benchmark@97ad8e7

Reproducer:

; llc -mtriple=riscv64 test.ll -o -
define signext i64 @func000000000000002b(i32 signext %0) #0 {
entry:
  %1 = zext nneg i32 %0 to i64
  %2 = add nsw i64 %1, -1
  %3 = lshr i64 %2, 1
  %4 = add nuw nsw i64 %3, 1
  %5 = and i64 %4, 9223372036854775806
  ret i64 %5
}

Before (74f200b):

func000000000000002b:
        addi    a0, a0, -1
        srli    a0, a0, 1
        addi    a0, a0, 1
        andi    a0, a0, -2
        ret

After (47afa10):

func000000000000002b:
        addi    a0, a0, -1
        srli    a0, a0, 1
        addi    a0, a0, 1
        li      a1, -3
        srli    a1, a1, 1
        and     a0, a0, a1
        ret

dtcxzyw · 2024-06-12T14:29:53Z

Hi @RKSimon, I think this patch causes some regressions on riscv: dtcxzyw/llvm-codegen-benchmark@97ad8e7

Reproducer:

; llc -mtriple=riscv64 test.ll -o -
define signext i64 @func000000000000002b(i32 signext %0) #0 {
entry:
  %1 = zext nneg i32 %0 to i64
  %2 = add nsw i64 %1, -1
  %3 = lshr i64 %2, 1
  %4 = add nuw nsw i64 %3, 1
  %5 = and i64 %4, 9223372036854775806
  ret i64 %5
}

Before (74f200b):

func000000000000002b:
        addi    a0, a0, -1
        srli    a0, a0, 1
        addi    a0, a0, 1
        andi    a0, a0, -2
        ret

After (47afa10):

func000000000000002b:
        addi    a0, a0, -1
        srli    a0, a0, 1
        addi    a0, a0, 1
        li      a1, -3
        srli    a1, a1, 1
        and     a0, a0, a1
        ret

SelectionDAG has 17 nodes:
  t0: ch,glue = EntryToken
                  t2: i64,ch = CopyFromReg t0, Register:i64 %0
                t4: i64 = AssertSext t2, ValueType:ch:i32
              t5: i32 = truncate t4
            t6: i64 = sign_extend t5
          t8: i64 = add nsw t6, Constant:i64<-1>
        t10: i64 = srl t8, Constant:i64<1>
      t11: i64 = add nuw nsw t10, Constant:i64<1>
    t13: i64 = and t11, Constant:i64<9223372036854775806>
  t15: ch,glue = CopyToReg t0, Register:i64 $x10, t13
  t16: ch = RISCVISD::RET_GLUE t15, Register:i64 $x10, t15:1



Combining: t16: ch = RISCVISD::RET_GLUE t15, Register:i64 $x10, t15:1

Combining: t15: ch,glue = CopyToReg t0, Register:i64 $x10, t13

Combining: t14: i64 = Register $x10

Combining: t13: i64 = and t11, Constant:i64<9223372036854775806>
Creating constant: t17: i32 = Constant<-1>
Creating new node: t18: i32 = avgfloors t5, Constant:i32<-1>
Creating new node: t19: i64 = sign_extend t18

RKSimon · 2024-06-12T14:42:53Z

cheers - looking at this now

vitalybuka · 2024-06-12T17:48:49Z

Probably by this patch as this is the only one in DAG in the blame list
https://lab.llvm.org/buildbot/#/builders/237/builds/7908

Can you please fix or revert?

FYI @fmayer

dtcxzyw · 2024-06-12T18:01:41Z

Probably by this patch as this is the only one in DAG in the blame list https://lab.llvm.org/buildbot/#/builders/237/builds/7908

Can you please fix or revert?

FYI @fmayer

Should be fixed by ca33796.

RKSimon requested review from jayfoad, davemgreen, topperc and goldsteinn May 14, 2024 11:13

RKSimon force-pushed the legal-avg branch from e1f4018 to 40d1b4c Compare May 14, 2024 11:55

jayfoad reviewed May 14, 2024

View reviewed changes

llvm/include/llvm/CodeGen/TargetLowering.h Outdated Show resolved Hide resolved

jayfoad reviewed May 14, 2024

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp Outdated Show resolved Hide resolved

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp Outdated Show resolved Hide resolved

RKSimon force-pushed the legal-avg branch 6 times, most recently from c5278b3 to 57017b3 Compare May 14, 2024 14:38

RKSimon commented May 14, 2024

View reviewed changes

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-vaaddu.ll Outdated Show resolved Hide resolved

RKSimon force-pushed the legal-avg branch from 57017b3 to cf0be51 Compare May 15, 2024 14:53

jayfoad reviewed May 15, 2024

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp Outdated Show resolved Hide resolved

RKSimon force-pushed the legal-avg branch 4 times, most recently from 016927e to 2f9a4fb Compare May 16, 2024 09:56

RKSimon force-pushed the legal-avg branch 2 times, most recently from ad8ab1e to 9c6aa40 Compare May 16, 2024 15:18

goldsteinn reviewed May 16, 2024

View reviewed changes

RKSimon force-pushed the legal-avg branch from 9c6aa40 to 33adfc8 Compare May 16, 2024 16:48

RKSimon mentioned this pull request May 16, 2024

[DAG] Fold AVGU(ZEXT(X),ZEXT(Y)) -> ZEXT(AVGU(X,Y)) #86301

Closed

RKSimon force-pushed the legal-avg branch from 33adfc8 to 0790923 Compare May 17, 2024 11:28

RKSimon added a commit that referenced this pull request May 19, 2024

[DAG] visitAVG - rewrite "fold (avgfloor x, 0) -> x >> 1" to use SDPa…

9f5c8de

…tternMatch No need for this to be vector specific, and its more likely that scalar cases will appear after #92096

RKSimon force-pushed the legal-avg branch from 4de1df7 to a970e86 Compare June 1, 2024 15:52

RKSimon force-pushed the legal-avg branch from a970e86 to 062c407 Compare June 3, 2024 09:44

RKSimon mentioned this pull request Jun 5, 2024

[SelectionDAG] Fold (avg x, 0) -> x >> 1 #85581

Closed

RKSimon force-pushed the legal-avg branch from 062c407 to 6855fb7 Compare June 6, 2024 08:09

jayfoad reviewed Jun 6, 2024

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp Outdated Show resolved Hide resolved

RKSimon force-pushed the legal-avg branch 3 times, most recently from 54b366e to fa32106 Compare June 7, 2024 12:15

jayfoad reviewed Jun 7, 2024

View reviewed changes

RKSimon mentioned this pull request Jun 12, 2024

[X86] Failure to produce chained PAVG's #51473

Closed

RKSimon force-pushed the legal-avg branch from fa32106 to 5d15207 Compare June 12, 2024 12:05

jayfoad approved these changes Jun 12, 2024

View reviewed changes

RKSimon force-pushed the legal-avg branch from 5d15207 to 58c869b Compare June 12, 2024 12:51

RKSimon merged commit ea2ee5d into llvm:main Jun 12, 2024
3 of 6 checks passed

RKSimon deleted the legal-avg branch June 12, 2024 13:11

dtcxzyw mentioned this pull request Jun 12, 2024

Update diff June 12th 2024, 1:51:24 pm dtcxzyw/llvm-codegen-benchmark#66

Closed

RKSimon mentioned this pull request Jun 12, 2024

[DAG] Regression due to creation of avgfloors nodes #95284

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DAG] Add legalization handling for AVGCEIL/AVGFLOOR nodes #92096

[DAG] Add legalization handling for AVGCEIL/AVGFLOOR nodes #92096

RKSimon commented May 14, 2024 •

edited

Loading

github-actions bot commented May 14, 2024 •

edited

Loading

goldsteinn May 16, 2024

RKSimon May 16, 2024

RKSimon May 22, 2024

jayfoad May 22, 2024

RKSimon May 22, 2024

RKSimon May 23, 2024

jayfoad May 23, 2024

jayfoad May 23, 2024

RKSimon May 23, 2024

RKSimon commented Jun 3, 2024

RKSimon commented Jun 6, 2024

RKSimon commented Jun 7, 2024

jayfoad left a comment

RKSimon commented Jun 7, 2024

RKSimon commented Jun 12, 2024

jayfoad left a comment

dtcxzyw commented Jun 12, 2024

dtcxzyw commented Jun 12, 2024

RKSimon commented Jun 12, 2024

vitalybuka commented Jun 12, 2024

dtcxzyw commented Jun 12, 2024

[DAG] Add legalization handling for AVGCEIL/AVGFLOOR nodes #92096

[DAG] Add legalization handling for AVGCEIL/AVGFLOOR nodes #92096

Conversation

RKSimon commented May 14, 2024 • edited Loading

github-actions bot commented May 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RKSimon commented Jun 3, 2024

RKSimon commented Jun 6, 2024

RKSimon commented Jun 7, 2024

jayfoad left a comment

Choose a reason for hiding this comment

RKSimon commented Jun 7, 2024

RKSimon commented Jun 12, 2024

jayfoad left a comment

Choose a reason for hiding this comment

dtcxzyw commented Jun 12, 2024

dtcxzyw commented Jun 12, 2024

RKSimon commented Jun 12, 2024

vitalybuka commented Jun 12, 2024

dtcxzyw commented Jun 12, 2024

RKSimon commented May 14, 2024 •

edited

Loading

github-actions bot commented May 14, 2024 •

edited

Loading