[Test] Big atomic cleanup and futex_waitv support for Linux #14403

Nekotekina · 2023-07-31T21:06:38Z

Some time ago I noticed when profiling on Linux that atomic waiting implementation may be possibly inefficient. A huge portion of the CPU time was outside of the underlying futex syscall but inside the atomic wait routines. But what atomic wait does is very similar to futex. Hence the idea to use futex directly but this would need to remove some superfluous features from atomic wait support. I'm not sure how it will work out. Please test for regressions.

Megamouse · 2023-07-31T21:09:26Z

rpcs3/util/media_utils.h

@@ -77,8 +77,8 @@ namespace utils
 		std::vector<u8> data;
 		atomic_t<u64> m_size = 0;
 		atomic_t<u64> duration_ms = 0;
-		atomic_t<bool> track_fully_decoded{false};
-		atomic_t<bool> track_fully_consumed{false};


I don't like this. Can't you make it opaque to the user (developer) with some template magic?

I don't think it's possible to change atomic bool to use 32-bit storage without breaking some PS3 struct (possibly in future as well)

Megamouse · 2023-07-31T21:19:57Z

rpcs3/headless_application.cpp

@@ -60,8 +60,7 @@ void headless_application::InitializeCallbacks()

 		return false;
 	};
-	callbacks.call_from_main_thread = [this](std::function<void()> func, atomic_t<bool>* wake_up)
-	{
+	callbacks.call_from_main_thread = [this](std::function<void()> func, atomic_t<u32>* wake_up) {


this is exactly why the clang-format is the way it is right now.

I tweaked it for the lesser evil, because otherwise the whole lambda gets shifted right.

But that's fine. GalCiv and I weighed the pro's and cons last time we updated the clang-format.
Having the indentation in a lambda or not is both allowed and not nitpicked at the moment.

There are some bugs in clangformat with the inline version.
There are worse format issues that happen in different cases.
I don't remember the details, but I remember that it was easier to use the current settings all things considered.

llvm/llvm-project#54827

Ok, I'll remove AllowShortLambdasOnASingleLine. The other tweak should be fine though?

Idk. I'd have to test with existing code

elad335 · 2023-08-01T13:12:40Z

Utilities/lockless.h

 		{
-			m_head.template wait<Flags>(nullptr);
+			utils::bless<atomic_t<u32>>(&m_head)[1].wait(0);


Just like lf_queue has a specalization for waiting on nullptr, so do pointers IMO.

solarmystic · 2023-08-01T15:29:26Z

Just simply booting up rpcs3 with this test pr build causes some irregular log entries that is not typically seen on the master build. Enumeration of PC Specs would occur without any of those entries on master in a single block of information.

Test build

Master for comparison

Besides that observation, tested Persona 5 and found no significant difference in performance between Master and PR.

Master - 105/80/68 FPS (Average/1%/0.1% FPS)

Screenshot

Test build - 106/80/69 FPS (Average/1%/0.1% FPS)

Screenshot

Nekotekina · 2023-08-01T16:41:30Z

Updated, please retest

solarmystic · 2023-08-01T21:42:29Z

Updated, please retest

It just produces this Fatal Error on boot now in Windows.

solarmystic · 2023-08-01T23:13:45Z

Latest commit boots up fine. System Specs enumeration is back to normal now.

Persona 5 performance remains unaffected compared to master at 106 FPS on average.

Screenshot

Nekotekina · 2023-08-02T16:25:04Z

This PR includes potential fix for arm64 architecture related to incorrect pointer size assumptions.

Prevents implementing thread priority on Linux.

In order to make this possible, some unnecessary features were removed.

oltolm · 2023-08-02T19:37:26Z

3rdparty/llvm/CMakeLists.txt

@@ -14,8 +14,7 @@ if(WITH_LLVM)
 		option(LLVM_INCLUDE_TESTS OFF)
 		option(LLVM_INCLUDE_TOOLS OFF)
 		option(LLVM_INCLUDE_UTILS OFF)
-		# we globally enable ccache
-		set(LLVM_CCACHE_BUILD OFF CACHE BOOL "Set to ON for a ccache enabled build")
+		option(LLVM_CCACHE_BUILD ON)


Why did you set LLVM_CCACHE_BUILD to ON? On Linux the command line for LLVM files looks like this:

ccache ccache c++ ...

ccache is called twice.

Somehow it worked fine until the changes in LLVM_CCACHE_BUILD, afterwards it started to rebuild llvm after unrelated changes.

cipherxof · 2023-08-19T06:42:33Z

I'm just posting this here for future reference. This PR seems to have massively boosted performance for MGS4 under Linux. In some cases I'm gaining +40fps.

(custom build here, same results on master though)

cipherxof · 2023-08-20T20:23:40Z

Seeing improvements in RDR as well. Granted, I am assuming this PR is what caused these gains because I'm not seeing the improvements in Windows and I was unable to test linux until a recent crash regression was fixed.

Nekotekina · 2023-08-20T20:46:24Z

@cipherxof what is your distro/kernel? Also you can try to disable futex_waitv in atomic.cpp and compare

cipherxof · 2023-08-20T22:46:49Z

@cipherxof what is your distro/kernel? Also you can try to disable futex_waitv in atomic.cpp and compare

OS: Arch Linux
Kernel: x86_64 Linux 6.2.6-273-tkg-cfs

This PR is definitely the reason for the performance gains. However, even with it disabled I am still getting slightly better performance than I was previously. For some reason my frame times are spiking more than before, although that may just be something with my system so I'll need to re-compile an old custom build to test that.

futex_waitv disabled

futex_waitv enabled

Another thing worth noting (bare with me here) is that before this PR, Metal Gear Online required some specific changes in order have playable framerates. Now, with futex_waitv enabled I no longer need these changes and the game performs even better now. The problem with this of course is that this change only affects Linux.

Metal Gear Online

without the changes above:

with the changes:

futex_waitv enabled:

Nekotekina · 2023-08-20T23:24:47Z

@cipherxof do you use mitigations=off on Linux by any chance?

cipherxof · 2023-08-20T23:39:27Z

@cipherxof do you use mitigations=off on Linux by any chance?

I do, yes.

I also re-tested with a different custom kernel (Liquorix) and my frametimes are back to normal 👍

Megamouse reviewed Jul 31, 2023

View reviewed changes

Nekotekina force-pushed the typei branch 3 times, most recently from 1095e61 to b0b9146 Compare August 1, 2023 13:05

elad335 reviewed Aug 1, 2023

View reviewed changes

Nekotekina force-pushed the typei branch from b0b9146 to 4bee0c2 Compare August 1, 2023 16:24

Nekotekina force-pushed the typei branch from 4bee0c2 to c89d148 Compare August 1, 2023 22:15

Nekotekina force-pushed the typei branch from c89d148 to 5891f67 Compare August 2, 2023 10:53

Nekotekina added 3 commits August 2, 2023 19:26

Don't require Qt 6.4.0 (works with 6.2.4)

0bf1936

Reset broken LLCM_CCACHE_BUILD change

6061b12

Remove thread pool

380e991

Prevents implementing thread priority on Linux.

Nekotekina force-pushed the typei branch 2 times, most recently from 3ad8195 to 51b8e64 Compare August 2, 2023 17:05

Linux: use futex_waitv syscall for atomic waiting

ba3a16d

In order to make this possible, some unnecessary features were removed.

Nekotekina force-pushed the typei branch from 51b8e64 to ba3a16d Compare August 2, 2023 17:24

Nekotekina merged commit d34287b into RPCS3:master Aug 2, 2023
5 checks passed

oltolm reviewed Aug 2, 2023

View reviewed changes

MsDarkLow mentioned this pull request Aug 2, 2023

[Regression] Gran Turismo 5 and possibly other games crashing since #14403 #14413

Closed

kd-11 mentioned this pull request Aug 3, 2023

Build fails on Linux because the struct futex_waitv in Utilites/sync.h is redefined in linux/futex.h #14417

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test] Big atomic cleanup and futex_waitv support for Linux #14403

[Test] Big atomic cleanup and futex_waitv support for Linux #14403

Nekotekina commented Jul 31, 2023

Megamouse Jul 31, 2023 •

edited

Nekotekina Jul 31, 2023

Megamouse Jul 31, 2023

Nekotekina Jul 31, 2023

Megamouse Jul 31, 2023

Megamouse Jul 31, 2023

Nekotekina Aug 1, 2023

Megamouse Aug 1, 2023

elad335 Aug 1, 2023 •

edited

solarmystic commented Aug 1, 2023 •

edited

Nekotekina commented Aug 1, 2023

solarmystic commented Aug 1, 2023

solarmystic commented Aug 1, 2023

Nekotekina commented Aug 2, 2023

oltolm Aug 2, 2023

Nekotekina Aug 4, 2023

cipherxof commented Aug 19, 2023

cipherxof commented Aug 20, 2023 •

edited

Nekotekina commented Aug 20, 2023

cipherxof commented Aug 20, 2023

Nekotekina commented Aug 20, 2023

cipherxof commented Aug 20, 2023

[Test] Big atomic cleanup and futex_waitv support for Linux #14403

[Test] Big atomic cleanup and futex_waitv support for Linux #14403

Conversation

Nekotekina commented Jul 31, 2023

Megamouse Jul 31, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elad335 Aug 1, 2023 • edited

Choose a reason for hiding this comment

solarmystic commented Aug 1, 2023 • edited

Nekotekina commented Aug 1, 2023

solarmystic commented Aug 1, 2023

solarmystic commented Aug 1, 2023

Nekotekina commented Aug 2, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cipherxof commented Aug 19, 2023

cipherxof commented Aug 20, 2023 • edited

Nekotekina commented Aug 20, 2023

cipherxof commented Aug 20, 2023

Nekotekina commented Aug 20, 2023

cipherxof commented Aug 20, 2023

Megamouse Jul 31, 2023 •

edited

elad335 Aug 1, 2023 •

edited

solarmystic commented Aug 1, 2023 •

edited

cipherxof commented Aug 20, 2023 •

edited