Arm64Emitter: Simplify LogicalImm logic #10807

merryhime · 2022-07-03T17:28:32Z

Simplify logical immediate encoding logic.

This is based on the observation that if a valid repeating element exists, it repeats through value. Thus it does not matter which one you analyse. Thus we skip over the least significant element if LSB = 1 by masking it out with inverse_mask_from_trailing_ones, to avoid the degenerate case of a stretch of 1 bits going 'round the end' of the word.

dougallj · 2022-07-07T06:26:42Z

Source/Core/Common/Arm64Emitter.h

-    // it.
-    n = out_n;
+    const u64 inverse_mask_from_trailing_ones = ~value | (value + 1);
+    const size_t rotation = Common::LeastSignificantSetBit(value & inverse_mask_from_trailing_ones);


LeastSignificantSetBit(x) can just return __builtin_ctzll(x) - the result of which is undefined if x is 0, which it can be in this case. Maybe add a Common::CountTrailingZeros matching the zero-handling of Common::CountLeadingZeros?

I'd usually clear the trailing ones with just value & (value + 1), but otherwise this looks good.

(In case it's of interest, I wrote a blog post on optimising logical immediate encoding a while ago, which also removes the initial loop - it'd probably look something like this.)

Ha! Why am I not surprised this isn't a novel insight! Neat!

I intuitively knew you could mask trailing ones again after rotation to determine z+o, but the observation that verifying rotation also ensured esize was a power of two wasn't obvious to me.

Thanks! Yeah, I don't know if that one's obvious to anyone. (I'd consider adding an explicit "is power of two" check to spare future people who stumble upon this code from having to understand that, but it feels silly to add code that I know is redundant.)

Heavily simplify logical immediate encoding. This is based on the observation that if a valid repeating element exists, it repeats through `value`. Thus it does not matter which one you analyse. Thus we skip over the least significent element if LSB = 1 by masking it out with `inverse_mask_from_trailing_ones`, to avoid the degenerate case of a stretch of 1 bits going 'round the end' of the word.

Source/Core/Common/Arm64Emitter.h

JosJuice

I didn't review the intermediate version of the algorithm (the one in the first commit), but the final version looks good. Just some style nits.

Source/UnitTests/Core/PowerPC/JitArm64/MovI2R.cpp

Source/Core/Common/Arm64Emitter.h

JosJuice · 2022-07-10T20:15:14Z

LGTM after squashing the changes in the last commit into the appropriate commits.

@dougallj

h/t @dougallj

merryhime force-pushed the LogicalImm branch from 76fe9c8 to de9a1bd Compare July 3, 2022 17:39

dougallj reviewed Jul 7, 2022

View reviewed changes

merryhime force-pushed the LogicalImm branch from de9a1bd to dcb2269 Compare July 7, 2022 22:06

dougallj reviewed Jul 8, 2022

View reviewed changes

Source/Core/Common/Arm64Emitter.h Outdated Show resolved Hide resolved

merryhime force-pushed the LogicalImm branch from 68a7d22 to cf99ade Compare July 9, 2022 07:18

JosJuice reviewed Jul 10, 2022

View reviewed changes

merryhime force-pushed the LogicalImm branch from d7a0053 to 581498b Compare July 10, 2022 19:08

merryhime added 3 commits July 10, 2022 22:17

UnitTests/MovI2R: Test all logical immediates

4d99506

BitUtils: Implement CountTrailingZeros

20ccc38

Arm64Emitter: Simplify LogicalImm further

0d947ed

h/t @dougallj

merryhime force-pushed the LogicalImm branch from 581498b to 0d947ed Compare July 10, 2022 21:17

JosJuice approved these changes Jul 10, 2022

View reviewed changes

JMC47 merged commit 38cb76d into dolphin-emu:master Jul 10, 2022

merryhime deleted the LogicalImm branch July 10, 2022 23:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm64Emitter: Simplify LogicalImm logic #10807

Arm64Emitter: Simplify LogicalImm logic #10807

merryhime commented Jul 3, 2022 •

edited

dougallj Jul 7, 2022

merryhime Jul 7, 2022 •

edited

dougallj Jul 8, 2022

JosJuice left a comment

JosJuice commented Jul 10, 2022

Arm64Emitter: Simplify LogicalImm logic #10807

Arm64Emitter: Simplify LogicalImm logic #10807

Conversation

merryhime commented Jul 3, 2022 • edited

dougallj Jul 7, 2022

Choose a reason for hiding this comment

merryhime Jul 7, 2022 • edited

Choose a reason for hiding this comment

dougallj Jul 8, 2022

Choose a reason for hiding this comment

JosJuice left a comment

Choose a reason for hiding this comment

JosJuice commented Jul 10, 2022

merryhime commented Jul 3, 2022 •

edited

merryhime Jul 7, 2022 •

edited