Improve readyqueue perf, rapid fire, small improvements #651

unknownbrackets · 2013-02-11T07:20:22Z

This does the following:

Adds very basic rapid fire when holding shift (e.g. hold shift+tab+z.) Windows keyboard only.
Adds an atomic check variable around MoveEvents(), since they're rare. I had trouble verifying this improved perf, but @KentuckyCompass's fps counter shows a reproducible improvement of maybe 2%.
Moves threadReadyQueue back to a std::vector. It did show up in profile, after all. I opted to create a simple wrapper class.
Avoids some slow runtime type casts / other minor stuff.

This significantly improves (mostly because of rapid fire, heh) the time it takes to get through the intro of Hexyz Force. Overall, in game fps improvement by ~5% on Windows release once past the intro.

sceKernelThread.cpp is kinda big, I'm wondering if breaking callbacks and maybe some things out into another file is realistic... I guess it's not hurting anyone, though...

-[Unknown]

raven02 · 2013-02-11T07:58:47Z

@unknownbrackets , should we expect Sol trigger work again in this commit ? :)

unknownbrackets · 2013-02-11T08:01:11Z

No. I think I know why though. I think sceUtilityLoadModule() needs to eat cycles AND reschedule while doing so (e.g. because it's reading the module from umd/etc.) It's happening to some Tales games and a few others too.

I managed to get Tales of Destiny 2 farther but it still crashes, so I'm not sure I did it right. sceKernelCreateThread() will need to do it too, among a few others and probably IO funcs.

-[Unknown]

hrydgard · 2013-02-11T09:09:30Z

Agree that sceKernelThread is growing a bit too big, but it contains a lot of heavily interconnected functionality.

So need to think carefully about which parts to break out and how their external interfaces should look.

hrydgard · 2013-02-11T09:11:30Z

Core/CoreTiming.cpp

@@ -66,6 +67,8 @@ struct BaseEvent
 Event *eventPool = 0;
 Event *eventTsPool = 0;
 int allocatedTsEvents = 0;
+// Optimization to skip MoveEvents when possible.
+volatile u32 hasTsEvents = false;


Shouldn't need to use volatile and atomic stores/loads as all the stuff touching this will always run on the main emulation thread. If it doesn't, we have other problems..

Well, the whole point was to make ts (threadsafe) events faster. Those are ones scheduled from off thread, like savestates. Actually, savestates are the only consumer currently.

So before it kept calling MoveEvents(), creating a mutex (which isn't slow but even so), and checking tsFirst and allocatedTsEvents. I wanted to cut that down to just a single check without a mutex and saving a function call. It's a small gain but when fast forwarding it adds up, and Advance is called not inoften.

-[Unknown]

Oh right, sorry, it's me who's confused here. Never mind my comment.

unknownbrackets · 2013-02-11T09:15:44Z

Right... the only way really would be for sceKernelThread to expose more one way or another, unfortunately.

-[Unknown]

unknownbrackets · 2013-02-11T09:26:15Z

Windows/KeyboardDevice.cpp

@@ -26,8 +26,12 @@
 };

 int KeyboardDevice::UpdateState() {
+	bool alternate = GetAsyncKeyState(VK_SHIFT) != 0;
+	static int alternator = 0;


Oops, this should be unsigned.

-[Unknown]

Mostly to speed up debugging.

Pretty sure this is needed, but apparently it breaks Sol Trigger.

This saves ~1% during fast forward on a release build.

It showed up in a profile after all. Cut down more than 1%.

Just like .1% but was hoping Mr. Optimizer would do this for me.

Improve readyqueue perf, rapid fire, small improvements

xsacha · 2013-02-11T10:19:21Z

So those atomic functions aren't working on many ARM devices.
They only work #ifdef HAVE_GCC_INT_ATOMICS afaik.

hrydgard · 2013-02-11T10:30:21Z

Are there alternatives?

unknownbrackets · 2013-02-11T15:28:28Z

I think we could just use __sync_synchronize() potentially on those devices if that still exists?

Otherwise, we could just revert the change, although it did improve perf and would be nice to have.

-[Unknown]

hrydgard reviewed Feb 11, 2013
View reviewed changes

unknownbrackets reviewed Feb 11, 2013
View reviewed changes

unknownbrackets added 7 commits February 11, 2013 01:27

Add very simple rapid fire for Windows keyboard.

e8e9f7f

Mostly to speed up debugging.

Move currentThread init to a better place.

fd1c686

Move running thread resched to __KernelNextThread.

9a5589a

Pretty sure this is needed, but apparently it breaks Sol Trigger.

Spend less time moving ts events in CoreTiming.

da5026e

Wake delayed threads directly, rather than looping.

f552cb3

This saves ~1% during fast forward on a release build.

Add a std::vector wrapper to do remove/pop/push.

6ca1cad

It showed up in a profile after all. Cut down more than 1%.

Minor perf gain in __KernelNextThread.

537fbe4

Just like .1% but was hoping Mr. Optimizer would do this for me.

hrydgard added a commit that referenced this pull request Feb 11, 2013

Merge pull request #651 from unknownbrackets/perf

480592c

Improve readyqueue perf, rapid fire, small improvements

hrydgard merged commit 480592c into hrydgard:master Feb 11, 2013

unknownbrackets deleted the perf branch February 11, 2013 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve readyqueue perf, rapid fire, small improvements #651

Improve readyqueue perf, rapid fire, small improvements #651

unknownbrackets commented Feb 11, 2013

raven02 commented Feb 11, 2013

unknownbrackets commented Feb 11, 2013

hrydgard commented Feb 11, 2013

hrydgard Feb 11, 2013

unknownbrackets Feb 11, 2013

hrydgard Feb 11, 2013

unknownbrackets commented Feb 11, 2013

unknownbrackets Feb 11, 2013

xsacha commented Feb 11, 2013

hrydgard commented Feb 11, 2013

unknownbrackets commented Feb 11, 2013

Improve readyqueue perf, rapid fire, small improvements #651

Improve readyqueue perf, rapid fire, small improvements #651

Conversation

unknownbrackets commented Feb 11, 2013

raven02 commented Feb 11, 2013

unknownbrackets commented Feb 11, 2013

hrydgard commented Feb 11, 2013

hrydgard Feb 11, 2013

Choose a reason for hiding this comment

unknownbrackets Feb 11, 2013

Choose a reason for hiding this comment

hrydgard Feb 11, 2013

Choose a reason for hiding this comment

unknownbrackets commented Feb 11, 2013

unknownbrackets Feb 11, 2013

Choose a reason for hiding this comment

xsacha commented Feb 11, 2013

hrydgard commented Feb 11, 2013

unknownbrackets commented Feb 11, 2013