SPU LLVM: Rearange FM instruction for better performance #9896

Whatcookie · 2021-03-04T17:57:10Z

Rearranges the FM instruction to allow the comparisons and the multiplication to be processed in parallel. This doesn't save any instructions, but still results in a speedup.

In the mandelbrot homebrew performance increased from 120 --> 122fps on my 7700K at 5ghz.

On a cpu with more out of order execution resources available, such as my i5-1135G7 at 2.6ghz performance was increased from 69 --> 74fps.

- Doesn't eliminate any instructions, but allows for better out of order execution.

Yahfz · 2021-03-04T19:14:44Z

Got a nice boost here.
149-151 -> 161
9900KS 5.3

elad335 · 2021-03-05T08:35:21Z

rpcs3/Emu/Cell/SPURecompiler.cpp

-			const auto cb = eval(bitcast<f32[4]>(bitcast<s32[4]>(b) & ma));
-			set_vr(op.rt, fm(ca, cb));
+			const auto cx = eval(ma & mb);
+			const auto x = fm(a, b);


Shouldn't you modify fm function as well? llvm expressions detection relies on this.

fm is currently just an unnecessary alias for multiplication operator

Shouldn't you modify fm function as well? llvm expressions detection relies on this.

The only time we look for the fm pattern is in is_input_positive, which should still be working since it only looks for the case when a = b

Nekotekina · 2021-03-08T12:48:24Z

rpcs3/Emu/Cell/SPURecompiler.cpp

-			const auto ca = eval(bitcast<f32[4]>(bitcast<s32[4]>(a) & mb));
-			const auto cb = eval(bitcast<f32[4]>(bitcast<s32[4]>(b) & ma));
-			set_vr(op.rt, fm(ca, cb));
+			const auto cx = eval(ma & mb);


I wonder if it could be more correct to & first, then use sext, but I won't bother trying it for now.

SPU LLVM: Rearange FM instruction for better performance

7871872

- Doesn't eliminate any instructions, but allows for better out of order execution.

elad335 reviewed Mar 5, 2021

View reviewed changes

Megamouse added the Optimization Optimizes existing code label Mar 6, 2021

Nekotekina reviewed Mar 8, 2021

View reviewed changes

Nekotekina merged commit e5d0e03 into RPCS3:master Mar 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPU LLVM: Rearange FM instruction for better performance #9896

SPU LLVM: Rearange FM instruction for better performance #9896

Whatcookie commented Mar 4, 2021

Yahfz commented Mar 4, 2021 •

edited

elad335 Mar 5, 2021

Nekotekina Mar 5, 2021

Whatcookie Mar 5, 2021

Nekotekina Mar 8, 2021

SPU LLVM: Rearange FM instruction for better performance #9896

SPU LLVM: Rearange FM instruction for better performance #9896

Conversation

Whatcookie commented Mar 4, 2021

Yahfz commented Mar 4, 2021 • edited

elad335 Mar 5, 2021

Choose a reason for hiding this comment

Nekotekina Mar 5, 2021

Choose a reason for hiding this comment

Whatcookie Mar 5, 2021

Choose a reason for hiding this comment

Nekotekina Mar 8, 2021

Choose a reason for hiding this comment

Yahfz commented Mar 4, 2021 •

edited