[mlir][arith] fix wrong floordivsi fold (#83079) #83248

lipracer · 2024-02-28T11:17:51Z

Fixs #83079

llvmbot · 2024-02-28T11:18:21Z

@llvm/pr-subscribers-llvm-adt
@llvm/pr-subscribers-llvm-support
@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-arith

Author: long.chen (lipracer)

Changes

Fixs #83079

Full diff: https://github.com/llvm/llvm-project/pull/83248.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Arith/IR/ArithOps.cpp (+9-3)
(modified) mlir/test/Transforms/canonicalize.mlir (+9)

diff --git a/mlir/lib/Dialect/Arith/IR/ArithOps.cpp b/mlir/lib/Dialect/Arith/IR/ArithOps.cpp
index 0f71c19c23b654..d370b7d04dea7e 100644
--- a/mlir/lib/Dialect/Arith/IR/ArithOps.cpp
+++ b/mlir/lib/Dialect/Arith/IR/ArithOps.cpp
@@ -709,19 +709,25 @@ OpFoldResult arith::FloorDivSIOp::fold(FoldAdaptor adaptor) {
         }
         if (!aGtZero && !bGtZero) {
           // Both negative, return -a / -b.
-          APInt posA = zero.ssub_ov(a, overflowOrDiv0);
-          APInt posB = zero.ssub_ov(b, overflowOrDiv0);
-          return posA.sdiv_ov(posB, overflowOrDiv0);
+          return a.sdiv_ov(b, overflowOrDiv0);
         }
         if (!aGtZero && bGtZero) {
           // A is negative, b is positive, return - ceil(-a, b).
           APInt posA = zero.ssub_ov(a, overflowOrDiv0);
+          if (overflowOrDiv0)
+            return a;
           APInt ceil = signedCeilNonnegInputs(posA, b, overflowOrDiv0);
+          if (overflowOrDiv0)
+            return a;
           return zero.ssub_ov(ceil, overflowOrDiv0);
         }
         // A is positive, b is negative, return - ceil(a, -b).
         APInt posB = zero.ssub_ov(b, overflowOrDiv0);
+        if (overflowOrDiv0)
+          return a;
         APInt ceil = signedCeilNonnegInputs(a, posB, overflowOrDiv0);
+        if (overflowOrDiv0)
+          return a;
         return zero.ssub_ov(ceil, overflowOrDiv0);
       });
 
diff --git a/mlir/test/Transforms/canonicalize.mlir b/mlir/test/Transforms/canonicalize.mlir
index 2cf86b50d432f6..d2c2c12d323892 100644
--- a/mlir/test/Transforms/canonicalize.mlir
+++ b/mlir/test/Transforms/canonicalize.mlir
@@ -989,6 +989,15 @@ func.func @tensor_arith.floordivsi_by_one(%arg0: tensor<4x5xi32>) -> tensor<4x5x
   return %res : tensor<4x5xi32>
 }
 
+// CHECK-LABEL: func @arith.floordivsi_by_one_overflow
+func.func @arith.floordivsi_by_one_overflow() -> i64 {
+  %neg_one = arith.constant -1 : i64
+  %min_int = arith.constant -9223372036854775808 : i64
+  // CHECK: arith.floordivsi
+  %poision = arith.floordivsi %min_int, %neg_one : i64
+  return %poision : i64
+}
+
 // -----
 
 // CHECK-LABEL: func @arith.ceildivsi_by_one

kuhar

Looks fine upon a quick scan of the patch. Given the number or error code paths, I think it could be made more readable with something like constFoldBinaryOpConditional IIRC.

pingshiyu · 2024-02-28T13:59:32Z

Not sure this is the right patch, please see this comment

lipracer · 2024-03-03T06:41:54Z

@kuhar There are indeed many overflow states that need to be checked here, but I haven't come up with a good method yet. Can you give me a specific suggestion? Thank you.

kuhar · 2024-03-03T18:07:31Z

@kuhar There are indeed many overflow states that need to be checked here, but I haven't come up with a good method yet. Can you give me a specific suggestion? Thank you.

Without knowing the exact details of the floordivsi algorithm, the way I'd think about it is that we only need to detect if the final overflow happened or not. And of course fold correctly when there is no overflow. If there are intermediate overflows, I'd think that they can fold into one of the two buckets: (a) those that indicate that the floordivsi overflows for these arguments, and (b) implementation bugs.

It might be helpful to extract this fold code to a unit test (say one of the APInt test files) and go from there. Pick a few inputs that are known to overflow and trace the values and intermediate overflows. Maybe write a separate check to decide if overflow happens for the given input instead of relying on some intermediate checks.

kuhar · 2024-03-03T18:09:29Z

@lipracer Another idea: we could perform the division on APInt with more bits than the bitwidth of the folded integer type. Maybe this way it'd be easier to detect overflow. And then once we are confident there's no overflow, truncate it back to the desired bitwidth.

kuhar

@lipracer could you open a separate PR for the APInt change? This will make it easier to review and land properly.

lipracer · 2024-03-11T02:47:45Z

Ok.

lipracer · 2024-03-11T05:21:23Z

submit a APInt change PR.

lipracer · 2024-03-18T06:32:14Z

@lipracer could you open a separate PR for the APInt change? This will make it easier to review and land properly.

APInt change has already merged.

kuhar

Looks fine upon a quick scan but I won't have the time to understand the details of the expansion in near future. Please get a second approval before landing.

lipracer · 2024-03-18T07:45:54Z

Thank you very much for your valuable suggestion.

lipracer · 2024-03-20T08:51:18Z

@joker-eph I'd like someone to review my code, could you help me with that?

Mogball

This change almost certainly has performance implications. Do we have some way to measure or even guess at that? If this is fixing a correctness bug, I am also not the right person to verify that, but I trust the author in this

joker-eph · 2024-03-20T18:02:21Z

@pingshiyu would you be able to review the correctness of this change?

lipracer · 2024-03-20T18:18:05Z

This change almost certainly has performance implications. Do we have some way to measure or even guess at that? If this is fixing a correctness bug, I am also not the right person to verify that, but I trust the author in this

Yes, this may indeed affect performance. This implementation is based on tensorflow xla expansion. For performance considerations, I think we can have a compilation option similar to ‘fast-expand’ for expansion. This change is only consistent with the behavior of llvm.

pingshiyu · 2024-03-21T00:24:58Z

@pingshiyu would you be able to review the correctness of this change?

sure, I've just taken some time to check through the cases, lgtm re correctness.

thanks @lipracer!

maybe some extra tests would be nice for the edge cases, e.g. floordivsi(min_val, 1), and floordivsi(max_val, -1).

lipracer · 2024-03-21T07:31:57Z

@pingshiyu Thanks, I have added more corner tests.

lipracer · 2024-03-22T04:46:59Z

The CI error seems unrelated to this change. If the performance drops after merging, I will submit another change to enable "fast expand".

joker-eph · 2024-03-22T04:53:34Z

The premerge is green at HEAD: can you rebase? https://lab.llvm.org/buildbot/#/builders/271

joker-eph · 2024-03-22T04:56:37Z

Nevermind I misread the log, it's failing on building flang by running out of memory:

�_bk;t=1711039712398�C:\ws\src\flang\include\flang\Evaluate\tools.h(236): fatal error C1060: compiler is out of heap space

I concur that it is unrelated, and all the MLIR tests are passing.

lipracer · 2024-03-22T04:58:12Z

Thanks, I will rebase on head.

1) fix floordivsi error expand logic 2) fix floordivsi fold did't check overflow stat Fixs llvm#83079

Fixs llvm#83079

llvmbot added mlir mlir:arith labels Feb 28, 2024

kuhar requested review from joker-eph, kuhar and zero9178 February 28, 2024 12:38

kuhar reviewed Feb 28, 2024

View reviewed changes

lipracer force-pushed the fix-arith-fold branch from 911bc26 to 1fff7f9 Compare March 9, 2024 07:54

llvmbot added llvm:support llvm:adt labels Mar 9, 2024

lipracer force-pushed the fix-arith-fold branch from c4bfc60 to 52ff01d Compare March 9, 2024 13:38

lipracer requested a review from kuhar March 11, 2024 02:37

kuhar requested changes Mar 11, 2024

View reviewed changes

lipracer force-pushed the fix-arith-fold branch from f231e71 to 0939c26 Compare March 11, 2024 05:09

lipracer requested a review from kuhar March 11, 2024 14:36

lipracer force-pushed the fix-arith-fold branch 2 times, most recently from c241567 to fcaa9e5 Compare March 15, 2024 11:11

lipracer removed llvm:support llvm:adt labels Mar 15, 2024

kuhar approved these changes Mar 18, 2024

View reviewed changes

lipracer requested a review from Mogball March 19, 2024 02:17

Mogball approved these changes Mar 20, 2024

View reviewed changes

lipracer force-pushed the fix-arith-fold branch from f3bcea4 to 1217ab5 Compare March 22, 2024 05:01

lipracer added 2 commits March 22, 2024 13:01

[mlir][arith] fix wrong floordivsi fold (llvm#83079)

ec294c9

1) fix floordivsi error expand logic 2) fix floordivsi fold did't check overflow stat Fixs llvm#83079

add more corner test

1217ab5

lipracer merged commit 631e54a into llvm:main Mar 22, 2024
4 checks passed

chencha3 pushed a commit to chencha3/llvm-project that referenced this pull request Mar 23, 2024

[mlir][arith] fix wrong floordivsi fold (llvm#83248)

1918dcb

Fixs llvm#83079

lipracer deleted the fix-arith-fold branch March 24, 2024 15:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mlir][arith] fix wrong floordivsi fold (#83079) #83248

[mlir][arith] fix wrong floordivsi fold (#83079) #83248

lipracer commented Feb 28, 2024

llvmbot commented Feb 28, 2024 •

edited

kuhar left a comment

pingshiyu commented Feb 28, 2024

lipracer commented Mar 3, 2024

kuhar commented Mar 3, 2024

kuhar commented Mar 3, 2024

kuhar left a comment •

edited

lipracer commented Mar 11, 2024

lipracer commented Mar 11, 2024

lipracer commented Mar 18, 2024

kuhar left a comment

lipracer commented Mar 18, 2024

lipracer commented Mar 20, 2024

Mogball left a comment

joker-eph commented Mar 20, 2024

lipracer commented Mar 20, 2024 •

edited

pingshiyu commented Mar 21, 2024 •

edited

lipracer commented Mar 21, 2024

lipracer commented Mar 22, 2024

joker-eph commented Mar 22, 2024

joker-eph commented Mar 22, 2024

lipracer commented Mar 22, 2024

[mlir][arith] fix wrong floordivsi fold (#83079) #83248

[mlir][arith] fix wrong floordivsi fold (#83079) #83248

Conversation

lipracer commented Feb 28, 2024

llvmbot commented Feb 28, 2024 • edited

kuhar left a comment

Choose a reason for hiding this comment

pingshiyu commented Feb 28, 2024

lipracer commented Mar 3, 2024

kuhar commented Mar 3, 2024

kuhar commented Mar 3, 2024

kuhar left a comment • edited

Choose a reason for hiding this comment

lipracer commented Mar 11, 2024

lipracer commented Mar 11, 2024

lipracer commented Mar 18, 2024

kuhar left a comment

Choose a reason for hiding this comment

lipracer commented Mar 18, 2024

lipracer commented Mar 20, 2024

Mogball left a comment

Choose a reason for hiding this comment

joker-eph commented Mar 20, 2024

lipracer commented Mar 20, 2024 • edited

pingshiyu commented Mar 21, 2024 • edited

lipracer commented Mar 21, 2024

lipracer commented Mar 22, 2024

joker-eph commented Mar 22, 2024

joker-eph commented Mar 22, 2024

lipracer commented Mar 22, 2024

llvmbot commented Feb 28, 2024 •

edited

kuhar left a comment •

edited

lipracer commented Mar 20, 2024 •

edited

pingshiyu commented Mar 21, 2024 •

edited