SPU LLVM: Improve runtime SPU compilation preferences #15250

elad335 · 2024-02-27T09:05:56Z

Prefer using inactive worker threads to ompile new SPU blocks for maximum concurrency.
Postpone thread notifications to when block queue is drained so all threads would start at once, not delaying the managing thread to push more blocks.
If more than one block has the same use count as others, apply for compilation first the one which has been queued earlier. Similarly to distance / time = speed, here the time here is compared (estimation) for rate of block usage.

Some of these affect only CPUs with 12 or more threads at the current implementation, point 3 affects all.

Tests: Tested to improve massively the performance of SPU LLVM ingame compilation for Red Dead Redemption on 14600KF

Megamouse · 2024-02-27T11:57:25Z

Instead of just making changes that supposedly are faster, how about adding some benchmarks for once, so people can actually grasp the benefits when seeing auch a PR?

Megamouse · 2024-02-27T12:00:08Z

Utilities/lockless.h

@@ -390,9 +390,19 @@ class lf_queue final
 			item->m_link = load(oldv);
 		}

-		if (!oldv)
+		if (!oldv && Notify)


Wouldn't this reduce instructions if it was behind if constexpr Notify?

Not really, compilers detect that Notify is known. if-constexpr is not technically for optimizations, but to avoid compiling code that would otherwise not compile for template functions.

elad335 · 2024-02-27T13:19:03Z

Instead of just making changes that supposedly are faster, how about adding some benchmarks for once, so people can actually grasp the benefits when seeing auch a PR?

Need someone with 12 or more threads to test this on empty SPU cache.

elad335 · 2024-02-27T13:21:50Z

I guess I'll also make it possible to test its performance somewhat accurately mid-game, because the only way the user known now is how much stuttery the game is while compiling.

readywer · 2024-02-27T13:38:13Z

Instead of just making changes that supposedly are faster, how about adding some benchmarks for once, so people can actually grasp the benefits when seeing auch a PR?

Need someone with 12 or more threads to test this on empty SPU cache.

I have a 13600k later today i will try to compere the performace to master in some games(GT5,6, R&C games).

elad335 · 2024-02-27T14:47:32Z

keep in mind it's not an FPS test, it's to test how relatively long SPU compilation takes (those green "compiled block successfully messages")

EmulationChannel · 2024-02-27T20:56:04Z

@elad335 I tested 14600KF 14 /20 THE LAST OF US and RED DEAD REDEMPTION very quickly SPU CACHE

SPU LLVM: Improve runtime SPU compilation preferences

d026c11

elad335 added the LLVM Related to LLVM instruction decoders label Feb 27, 2024

elad335 force-pushed the hdd1 branch from 32767ce to d026c11 Compare February 27, 2024 09:06

elad335 added 2 commits February 27, 2024 11:56

Update SPUCommonRecompiler.cpp

0c30f42

Update SPUCommonRecompiler.cpp

4ba67f6

Megamouse reviewed Feb 27, 2024

View reviewed changes

Update SPUCommonRecompiler.cpp

c0a7dc9

Merge branch 'master' into hdd1

53384fd

elad335 merged commit 75ef154 into RPCS3:master Feb 28, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPU LLVM: Improve runtime SPU compilation preferences #15250

SPU LLVM: Improve runtime SPU compilation preferences #15250

elad335 commented Feb 27, 2024 •

edited

Megamouse commented Feb 27, 2024

Megamouse Feb 27, 2024

elad335 Feb 27, 2024

elad335 commented Feb 27, 2024

elad335 commented Feb 27, 2024

readywer commented Feb 27, 2024

elad335 commented Feb 27, 2024

EmulationChannel commented Feb 27, 2024 •

edited

SPU LLVM: Improve runtime SPU compilation preferences #15250

SPU LLVM: Improve runtime SPU compilation preferences #15250

Conversation

elad335 commented Feb 27, 2024 • edited

Megamouse commented Feb 27, 2024

Megamouse Feb 27, 2024

Choose a reason for hiding this comment

elad335 Feb 27, 2024

Choose a reason for hiding this comment

elad335 commented Feb 27, 2024

elad335 commented Feb 27, 2024

readywer commented Feb 27, 2024

elad335 commented Feb 27, 2024

EmulationChannel commented Feb 27, 2024 • edited

elad335 commented Feb 27, 2024 •

edited

EmulationChannel commented Feb 27, 2024 •

edited