Missed optimization of uaddo(x, x) #57330

chfast · 2022-08-24T07:51:35Z

_Bool src(unsigned x, unsigned* acc) {
    unsigned s = x + x;
    *acc = s;
    return (s < x);
}

_Bool tgt(unsigned x, unsigned* acc) {
    return __builtin_uadd_overflow(x, x, acc);
}

https://godbolt.org/z/PsGoGGWj8

The uaddo is not created for the case of x + x. This would save us single cmp instruction. This may be because the x + x is optimized to x << 1.

Discovered while analyzing #57316.

The text was updated successfully, but these errors were encountered:

llvmbot · 2022-08-24T07:53:56Z

@llvm/issue-subscribers-backend-x86

RKSimon · 2022-08-24T09:17:05Z

@rotateright Do you think we'd be better off handling this in InstCombine?

https://alive2.llvm.org/ce/z/ZHhUPZ suggests we don't match the general pattern either

chfast · 2022-08-24T09:25:52Z

This pattern is matched on IR level but only in CodeGenPrepare (so after opt -O3). I have no idea why. https://alive2.llvm.org/ce/z/3z32e3

rotateright · 2022-08-24T12:09:40Z

The choice to not canonicalize to the intrinsics in IR (move the transform to CGP) was made with:
https://reviews.llvm.org/D8889

Even in CGP, we've made several adjustments since then to get/avoid different asm for various targets.

It's possible that we've progressed enough in analysis that the decision can be revisited, but it requires looking at the output for several different patterns on multiple targets to make sure nothing regresses.

rotateright · 2022-08-24T12:20:27Z

Filed a beginner bug -- #57338 -- to reduce the icmp:
https://alive2.llvm.org/ce/z/sTrumT

rotateright · 2022-08-24T12:41:12Z

This example (even if it seems unlikely in practice) also demonstrates the difficulty of reconciling IR and codegen on patterns like this:
https://godbolt.org/z/TxKqhbecE
After we decouple the math from the overflow calc, it could be hard to bring them back together (for example, they moved to different basic blocks).

chfast added llvm:codegen missed-optimization backend:X86 labels Aug 24, 2022

rotateright mentioned this issue Aug 24, 2022

[InstCombine] reduce test-for-overflow of shifted value #57338

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missed optimization of uaddo(x, x) #57330

Missed optimization of uaddo(x, x) #57330

chfast commented Aug 24, 2022

llvmbot commented Aug 24, 2022

RKSimon commented Aug 24, 2022

chfast commented Aug 24, 2022

rotateright commented Aug 24, 2022

rotateright commented Aug 24, 2022

rotateright commented Aug 24, 2022

Missed optimization of uaddo(x, x) #57330

Missed optimization of uaddo(x, x) #57330

Comments

chfast commented Aug 24, 2022

llvmbot commented Aug 24, 2022

RKSimon commented Aug 24, 2022

chfast commented Aug 24, 2022

rotateright commented Aug 24, 2022

rotateright commented Aug 24, 2022

rotateright commented Aug 24, 2022