[InstCombine] Missed optimization: Fold `usub_sat((sub nuw C1, A), C2)` to `usub_sat(C1 - C2, A)` or `0` #82177

XChy · 2024-02-18T17:02:00Z

Alive2 proof: https://alive2.llvm.org/ce/z/Bre2we

Motivating example

define i32 @src(i32 %a){
entry:
  %add = sub nuw i32 64, %a
  %cond = call i32 @llvm.usub.sat.i32(i32 %add, i32 14)
  ret i32 %cond
}

can be folded to:

define i32 @tgt(i32 %a){
entry:
  %cond = call i32 @llvm.usub.sat.i32(i32 50, i32 %a)
  ret i32 %cond
}

When C2 u< C1, we get usub_sat(C1 - C2, A), otherwise we get 0. See also the examples in alive2 proof.

Real-world motivation

This snippet of IR is derived from jemalloc/src/psset.c@psset_maybe_remove_purge_list (after O3 pipeline).
The example above is a reduced version. If you're interested in the original suboptimal IR and optimal IR, contact me to get it, please. Actually such pattern is found frequently in the IRs in jemalloc project.

Let me know if you can confirm that it's an optimization opportunity, thanks.

The text was updated successfully, but these errors were encountered:

elhewaty · 2024-02-19T01:47:39Z

I will work on this.

elhewaty · 2024-02-19T02:30:29Z

If I need to confirm the transformation for vectors.
what can I use? I think llvm.assume doesn't work with vectors.

cyk2018 · 2024-02-19T04:36:47Z

It seems we must keep the equality between two expression, in your example, it is C1 - A u< C2 with C1 - C2 u< A。And can you give the example such as how can I get this IR from C or C++ Code.

XChy · 2024-02-19T05:38:51Z

If I need to confirm the transformation for vectors. what can I use? I think llvm.assume doesn't work with vectors.

Since this pattern doesn't involve vector operation, the correctness for scalars implies the correctness for vectors. However, If you do want to verify this pattern in Alive2, you can transform %cmp = icmp ult <4 x i32> %c2, %c1 into a single i1 with @llvm.vector.reduce.and.i1, and then apply llvm.assume.

XChy · 2024-02-19T05:52:09Z

@cyk2018, tools like https://github.com/travitch/whole-program-llvm and https://github.com/SRI-CSL/gllvm can extract IRs when compiling the whole project. For a single C/C++ file, just clang -emit-llvm -S example.c.

cyk2018 · 2024-02-20T11:57:05Z

@cyk2018, tools like https://github.com/travitch/whole-program-llvm and https://github.com/SRI-CSL/gllvm can extract IRs when compiling the whole project. For a single C/C++ file, just clang -emit-llvm -S example.c.

Thanks for your links. I am confused in how to generate nuw flag. I have found this method that SCCPSolver can generate this after analysis. The question is not very relevant with this issue. Still sincerely thanks.

… A) or 0 (#82280) - Fixes: #82177 - Alive2: https://alive2.llvm.org/ce/z/Q7mMC3

github-actions bot added the new issue label Feb 18, 2024

XChy changed the title ~~[InstCombine] Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0~~ [InstCombine] Missed optimization: Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0 Feb 18, 2024

nikic added llvm:instcombine missed-optimization and removed new issue labels Feb 18, 2024

XChy assigned elhewaty Feb 19, 2024

elhewaty mentioned this issue Feb 19, 2024

[InstCombine] Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0 #82280

Merged

XChy mentioned this issue Mar 5, 2024

Replace unsigned induction variable with size_t in background_threads jemalloc/jemalloc#2611

Merged

nikic closed this as completed in #82280 Mar 11, 2024

nikic pushed a commit that referenced this issue Mar 11, 2024

[InstCombine] Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2,…

3f302ea

… A) or 0 (#82280) - Fixes: #82177 - Alive2: https://alive2.llvm.org/ce/z/Q7mMC3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InstCombine] Missed optimization: Fold `usub_sat((sub nuw C1, A), C2)` to `usub_sat(C1 - C2, A)` or `0` #82177

[InstCombine] Missed optimization: Fold `usub_sat((sub nuw C1, A), C2)` to `usub_sat(C1 - C2, A)` or `0` #82177

XChy commented Feb 18, 2024

elhewaty commented Feb 19, 2024

elhewaty commented Feb 19, 2024

cyk2018 commented Feb 19, 2024

XChy commented Feb 19, 2024

XChy commented Feb 19, 2024

cyk2018 commented Feb 20, 2024

[InstCombine] Missed optimization: Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0 #82177

[InstCombine] Missed optimization: Fold usub_sat((sub nuw C1, A), C2) to usub_sat(C1 - C2, A) or 0 #82177

Comments

XChy commented Feb 18, 2024

Motivating example

Real-world motivation

elhewaty commented Feb 19, 2024

elhewaty commented Feb 19, 2024

cyk2018 commented Feb 19, 2024

XChy commented Feb 19, 2024

XChy commented Feb 19, 2024

cyk2018 commented Feb 20, 2024

[InstCombine] Missed optimization: Fold `usub_sat((sub nuw C1, A), C2)` to `usub_sat(C1 - C2, A)` or `0` #82177

[InstCombine] Missed optimization: Fold `usub_sat((sub nuw C1, A), C2)` to `usub_sat(C1 - C2, A)` or `0` #82177