Comparison patterns for PPC #289

dbalatoni13 · 2025-03-22T08:18:52Z

I added a test case for how MWCC 1.3.2/2.6 handles a != 0 and a != b and code to handle it. In this PR I plan to fix all possible combinations from the previous and new test.

Feel free to give feedback to the commits in the meantime! As I wrote on Discord, I'm still learning when to use early_unwrap_ints over early_unwrap. I have a TODO comment and I'm also not sure why the input of replace_clz_shift is first passed to fold_divmod, should we do that here too?

dbalatoni13 · 2025-03-22T08:35:55Z

I'll add a comment about the version info and flags.

simonlindholm · 2025-03-22T09:22:58Z

I'm also not sure why the input of replace_clz_shift is first passed to fold_divmod, should we do that here too?

That's just a case of "try all the patterns one at a time", first fold_divmod, then replace_clz_shift, then (newly added) replace_or_shift. The way you've added it looks good to me.

simonlindholm

Looks good. Happy to merge this now and do the other comparisons in a follow-up if you feel like it, but also ok with awaiting those

m2c/evaluate.py

dbalatoni13 · 2025-03-22T13:50:18Z

That's just a case of "try all the patterns one at a time", first fold_divmod, then replace_clz_shift, then (newly added) replace_or_shift. The way you've added it looks good to me.

Oh, right. Should I leave it like this then or can I chain it one more time? I think the other new patterns will go to the same place, so I'm not sure.

dbalatoni13 · 2025-03-22T13:52:03Z

Looks good. Happy to merge this now and do the other comparisons in a follow-up if you feel like it, but also ok with awaiting those

I won't take long, so I'd like to do it here to avoid cluttering PRs

Co-authored-by: Simon Lindholm <simon.lindholm10@gmail.com>

simonlindholm · 2025-03-22T14:17:38Z

Oh, right. Should I leave it like this then or can I chain it one more time?

Either way is fine. Probably best to be consistent, i.e. either

            lower_bits = replace_or_shift(replace_clz_shift(fold_divmod(lower_bits)))

or

            lower_bits = fold_divmod(lower_bits)
            lower_bits = replace_clz_shift(lower_bits)
            lower_bits = replace_or_shift(lower_bits)

Maybe the latter is a bit more readable?

dbalatoni13 · 2025-03-22T14:29:58Z

Maybe the latter is a bit more readable?

Yes, especially if considering that I'll have to add more.

dbalatoni13 · 2025-03-23T04:40:22Z

a < b and b > a generate the exact same code, I wonder which one I should go for.

dbalatoni13 · 2025-03-23T06:35:16Z

Hmm, if you look at a <= b in the comparison2 test, it's decompiled perfectly. But a >= b has casts, even though it's the exact same thing in the asm. This is how the expressions look like:
((arg1 >> 0x1F) + (arg0 >> 0x1FU) + M2C_CARRY)
(((s32) arg0 >> 0x1F) + ((u32) arg1 >> 0x1FU) + M2C_CARRY)

Do you think we can do something about that safely?

dbalatoni13 · 2025-03-23T06:53:22Z

m2c/evaluate.py

+    left_expr = early_unwrap_ints(expr.left)
+    if not (isinstance(left_expr, BinaryOp) and left_expr.op == "+"):
+        return expr
+    # We call this function only from carry_add_to so it's not necessary to check the right side


Now that I think about it, the expression might be changed by fold_divmod by this point. I think I should either change the order or just add an extra check here.

simonlindholm · 2025-03-23T10:38:00Z

Btw, it would be worth adding a test for unsigned comparisons as well.

dbalatoni13 · 2025-03-23T19:53:00Z

Oh wow, that generates code that uses an instruction that's not even implemented yet: orc. I'll take a look at the patterns from the old comparison test and leave the orc and the pattern using it for other PRs.

dbalatoni13 · 2025-03-23T21:58:07Z

Actually, let's look at the patterns from the old comparison test some other time. Looking forward to your feedback to merge this :)

simonlindholm · 2025-03-23T23:06:42Z

I'll take a look tomorrow!

dbalatoni13 · 2025-03-24T06:23:14Z

Thanks! I just had this idea: if we know that a certain pattern is only generated if the args are u32 for example, can't we use this information to guess the types better? 🤔

simonlindholm · 2025-03-24T07:13:18Z

Yes, that would be ideal! Comparisons have different semantics between signed and unsigned so it's natural that they would give different codegen, and it's certainly useful to expose the types to the user.

dbalatoni13 · 2025-03-25T23:05:21Z

May I ask how to do that? :D Though the problem is that in the a < b case we don't know which one from the two is u32. It generates this if either of them is u32.

dbalatoni13 · 2025-03-27T02:52:38Z

I feel like arg0 >> 31 and (arg0 >> 31) ^ 1 might be too short to not create false positives. On the other hand: should I implement these three?:

temp_r0 = CLZ(arg0);
global = ((1 << (temp_r0 & 0x1F)) & 1) | ((1 >> (0x20 - (temp_r0 & 0x1F))) & 1);
global = (u32) (-(s32) arg0 & ~arg0) >> 0x1FU;
global = (u32) ((arg1 | ~arg0) - ((u32) (arg1 - arg0) >> 1U)) >> 0x1FU;

dbalatoni13 · 2025-03-28T04:53:13Z

I ran this through the whole Mario Party 4 decomp and these new patterns caused 30 files to be changed. Only one of them is a false positive, but that function is a mess anyways. I noticed that the order in a < b actually matters when matching, because of the order the values are loaded in, we've sadly lost this info by the point we handle the expression as a whole.

simonlindholm · 2025-03-28T23:33:00Z

May I ask how to do that? :D Though the problem is that in the a < b case we don't know which one from the two is u32. It generates this if either of them is u32.

I would suggest using as_uintish() on both operands, which will try to coerce them both to unsigned, falling back to a cast if it fails. It might be wrong (as you say only one has to be u32 for C to auto-convert to u32) but it still feels like a better guess than nothing at all?

On the other hand: should I implement these three?:

Well, sure, sounds good to me. Do you want to do it in the same PR or another? (I'm already taking too long to get around to reviewing this one I feel like...)

simonlindholm · 2025-03-28T23:43:31Z

Actually, for comparisons specifically you can use BinaryOp.ucmp() for uint compares, which is shorthand for doing as_uintish() on each argument.

tests/end_to_end/comparison2/orig.c

simonlindholm · 2025-03-28T23:37:09Z

m2c/evaluate.py

+
+def replace_shift_add_carry(expr: BinaryOp) -> BinaryOp:
+    """
+    Simplify the expressions matching `((b >> 31) + (a >> 31U)) + M2C_CARRY`


(Hm, I wish M2C_CARRY expressions kept the addition/subtraction that resulted in the carry, it's tricky to understand why the pattern makes sense right now)

m2c/evaluate.py

Handle a != 0 and a != b on MWCC 1.3.2/2.6

329865a

simonlindholm approved these changes Mar 22, 2025

View reviewed changes

m2c/evaluate.py Outdated Show resolved Hide resolved

m2c/evaluate.py Outdated Show resolved Hide resolved

m2c/evaluate.py Outdated Show resolved Hide resolved

Remove comment

6bc325f

Co-authored-by: Simon Lindholm <simon.lindholm10@gmail.com>

Implement changes based on what we discussed

f937f1e

dbalatoni13 added 4 commits March 23, 2025 07:00

Add a < b pattern for MWCC 2.6

c415903

Remove duplicated check

fb529cb

Add a <= b pattern for MWCC 2.6

90f487f

Ran the formatter

cf05664

dbalatoni13 commented Mar 23, 2025

View reviewed changes

dbalatoni13 added 2 commits March 23, 2025 22:20

Check carry bit and extend the tests

0192bd8

Add pattern for u32 a < b

9e97433

dbalatoni13 and others added 2 commits March 27, 2025 03:29

Merge branch 'matt-kempster:master' into comparisons

2eb04ec

Overwrite tests

8f9ee85

simonlindholm reviewed Mar 28, 2025

View reviewed changes

dbalatoni13 added 2 commits March 30, 2025 01:05

Delete u8 and s8 tests as they are identical to u16 and s16

27007b8

Remove early_unwraps

7655bac

Comparison patterns for PPC #289

Are you sure you want to change the base?

Comparison patterns for PPC #289

Uh oh!

Conversation

dbalatoni13 commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 22, 2025

Uh oh!

simonlindholm commented Mar 22, 2025

Uh oh!

simonlindholm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dbalatoni13 commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 22, 2025

Uh oh!

simonlindholm commented Mar 22, 2025

Uh oh!

dbalatoni13 commented Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 23, 2025

Uh oh!

dbalatoni13 Mar 23, 2025

Choose a reason for hiding this comment

Uh oh!

simonlindholm commented Mar 23, 2025

Uh oh!

dbalatoni13 commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonlindholm commented Mar 23, 2025

Uh oh!

dbalatoni13 commented Mar 24, 2025

Uh oh!

simonlindholm commented Mar 24, 2025

Uh oh!

dbalatoni13 commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbalatoni13 commented Mar 28, 2025

Uh oh!

simonlindholm commented Mar 28, 2025

Uh oh!

simonlindholm commented Mar 28, 2025

Uh oh!

Uh oh!

simonlindholm Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dbalatoni13 commented Mar 22, 2025 •

edited

Loading

dbalatoni13 commented Mar 22, 2025 •

edited

Loading

dbalatoni13 commented Mar 22, 2025 •

edited

Loading

dbalatoni13 commented Mar 23, 2025 •

edited

Loading

dbalatoni13 commented Mar 23, 2025 •

edited

Loading

dbalatoni13 commented Mar 23, 2025 •

edited

Loading

dbalatoni13 commented Mar 25, 2025 •

edited

Loading

dbalatoni13 commented Mar 27, 2025 •

edited

Loading