SPU LLVM: add AVX-512 SPU verification #10113

Whatcookie · 2021-04-14T07:54:29Z

This PR adds a new setting, called "Full Width AVX-512". At the moment only 512 bit wide SPU verification is hidden behind this option, but in the future more code can be put behind it.

This code needs to be hidden behind a setting thanks to some cpus that downclock upon executing 512 bit wide AVX-512 code. While newer cpus like the 11900K don't experience any downclocking (see https://travisdowns.github.io/blog/2020/08/19/icl-avx512-freq.html), older cpus based on skylake-x will aggressively downclock, and the setting is better off for these cpus.

Users may also prefer this option disabled if they've overclocked their CPU with a negative AVX-512 offset.

This new setting is off by default and cannot be toggled if the users cpu doesn't support AVX-512.

For an example this reduces the code size of the largest .obj file in the mandelbrot homebrew from 29.5kb down to 27.1kb.

Megamouse · 2021-04-14T08:00:53Z

rpcs3/rpcs3qt/settings_dialog.cpp

@@ -202,6 +202,10 @@ settings_dialog::settings_dialog(std::shared_ptr<gui_settings> gui_settings, std
 	m_emu_settings->EnhanceCheckBox(ui->accurateXFloat, emu_settings_type::AccurateXFloat);
 	SubscribeTooltip(ui->accurateXFloat, tooltips.settings.accurate_xfloat);

+	m_emu_settings->EnhanceCheckBox(ui->fullWidthAVX512, emu_settings_type::FullWidthAVX512);
+	SubscribeTooltip(ui->fullWidthAVX512, tooltips.settings.full_width_avx512);
+	ui->fullWidthAVX512->setDisabled(!utils::has_avx512());


Can just say setEnabled without the !.
But I guess that's preference

Oh, I didn't notice that there's a setEnabled. I was looking just looking at the LLVMdfma option. I'll just use that instead then

Nekotekina · 2021-04-14T08:07:12Z

Hmm, do those new CPU have 512-bit bus? From my tests with asmjit (which has these paths disabled) it was significantly slower than its AVX2 counterpart. That's for skylake-x though, I don't know about newer processors.

ASMJIT has those paths disabled, I guess you can enable one of them with this setting.

Whatcookie · 2021-04-14T08:14:36Z

I think both the newer cpus and skylake-x can do two 512bit loads per clock, but I believe they can only achieve that speed if the data is in the L1 cache.

ASMJIT has those paths disabled, I guess you can enable one of them with this setting.

ok.

Nekotekina · 2021-04-14T11:01:13Z

Also new code path for LLVM is almost the same as existing one, can you change existing to be more "variable" instead? In a sense of not hardcoding vector size (could be 128, 256, 512, maybe 1024 in future).

Whatcookie · 2021-04-15T07:16:51Z

Hmm, I'm not really sure on how to make the code more "variable" in a clean way.
For example the size of the shufflevector is based off of the size of the indices array. I can copy to smaller arrays as needed, but the code becomes full of branches and is difficult to read as a result.

Is there something obvious I'm missing?

Nekotekina · 2021-04-15T07:18:30Z

Ah, ArrayRef constructor... I believe you can specify length manually as a second argument, just checked.

- This is hidden behind a new setting, as some cpus may downclock agressively when executing 512 wide instructions

Whatcookie · 2021-04-15T23:39:49Z

should be good now

Megamouse reviewed Apr 14, 2021

View reviewed changes

Whatcookie force-pushed the spu-512 branch from d57a508 to ca508ea Compare April 14, 2021 08:06

Whatcookie force-pushed the spu-512 branch from ca508ea to a81309e Compare April 14, 2021 08:19

SPU LLVM: add AVX-512 SPU verification

b94e57b

- This is hidden behind a new setting, as some cpus may downclock agressively when executing 512 wide instructions

Whatcookie force-pushed the spu-512 branch from 8635e23 to b94e57b Compare April 15, 2021 22:31

Nekotekina merged commit 0a7df9d into RPCS3:master Apr 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPU LLVM: add AVX-512 SPU verification #10113

SPU LLVM: add AVX-512 SPU verification #10113

Whatcookie commented Apr 14, 2021

Megamouse Apr 14, 2021

Whatcookie Apr 14, 2021

Nekotekina commented Apr 14, 2021 •

edited

Whatcookie commented Apr 14, 2021

Nekotekina commented Apr 14, 2021

Whatcookie commented Apr 15, 2021

Nekotekina commented Apr 15, 2021

Whatcookie commented Apr 15, 2021

SPU LLVM: add AVX-512 SPU verification #10113

SPU LLVM: add AVX-512 SPU verification #10113

Conversation

Whatcookie commented Apr 14, 2021

Megamouse Apr 14, 2021

Choose a reason for hiding this comment

Whatcookie Apr 14, 2021

Choose a reason for hiding this comment

Nekotekina commented Apr 14, 2021 • edited

Whatcookie commented Apr 14, 2021

Nekotekina commented Apr 14, 2021

Whatcookie commented Apr 15, 2021

Nekotekina commented Apr 15, 2021

Whatcookie commented Apr 15, 2021

Nekotekina commented Apr 14, 2021 •

edited