CodeGen: Use more efficient lowering for UNM_* #1177

zeux · 2024-03-02T19:10:32Z

UNM_NUM and UNM_VEC were both implemented assuming SSE-style restrictions (2-argument form), but using AVX that doesn't have them. There's no need to copy source to destination separately - we can just vxorpd into destination.

Most occurrences of UNM_NUM/UNM_VEC followed the self-xor path, but this saves a couple instructions in trig benchmark and makes it execute ~0.1% fewer instructions (the actual runtime delta is within the noise).

UNM_NUM and UNM_VEC were both implemented assuming SSE-style restrictions (2-argument form), but using AVX that doesn't have them. There's no need to copy source to destination separately - we can just vxorpd into destination. Most occurrences of UNM_NUM/UNM_VEC followed the self-xor path, but this saves a couple instructions in trig benchmark and makes it execute ~0.1% fewer instructions (the actual runtime delta is within the noise).

vegorov-rbx · 2024-03-04T12:01:03Z

CodeGen/src/IrLoweringX64.cpp

-            build.vxorpd(inst.regX64, inst.regX64, build.f64(-0.0));
-        }
-
+        build.vxorpd(inst.regX64, regOp(inst.a), build.f64(-0.0));


Please flag the changes, as we are raising code flagging requirements internally.

Closing this, feel free to make this change internally.

We have added these changes internally.

vegorov-rbx reviewed Mar 4, 2024

View reviewed changes

zeux closed this Mar 4, 2024

zeux deleted the unm-opt branch March 4, 2024 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeGen: Use more efficient lowering for UNM_* #1177

CodeGen: Use more efficient lowering for UNM_* #1177

zeux commented Mar 2, 2024

vegorov-rbx Mar 4, 2024

zeux Mar 4, 2024

vegorov-rbx Mar 5, 2024

CodeGen: Use more efficient lowering for UNM_* #1177

CodeGen: Use more efficient lowering for UNM_* #1177

Conversation

zeux commented Mar 2, 2024

vegorov-rbx Mar 4, 2024

Choose a reason for hiding this comment

zeux Mar 4, 2024

Choose a reason for hiding this comment

vegorov-rbx Mar 5, 2024

Choose a reason for hiding this comment