-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[OpenGL] Very slow effects [Grisaia no Kajitsu] #16307
Comments
Please create a frame dump |
@sum2012 I've already done that. |
This scene draws a series of 246 horizontal strips to produce a faded effect. The right way to do this would probably have been using vertex colors (they already used modulate) and simple alpha blending, but alas. The later draws use FIXED + FIXED blending. This should be translated to ONE + FIXED with a shader uniform... and at least for me, this is exactly what happens. It's definitely not efficient that it does 246 separate draw calls, each with a uniform update in between, but I'm surprised it's so significantly slow. -[Unknown] |
If you load up the GE frame dump yourself as if you loaded a game, does the slowdown reproduce? |
@hrydgard I can reproduce the slowdown when running the dump. But the result is a bit strange. I'm not sure. For some reason the dump runs very slowly even with the Vulkan backend. And the speed is exactly the same as with the OpenGL. It's weird, it shouldn't be like this. And also the speed is different. When I run the dump, the speed is about 16%. But in the game about 3-6%. |
The frame dump has a different performance problem. Even on PC it is slow. For some reason, it's marking the texture as "dirty" each time, so PPSSPP is recopying the 512x512x4 texture data for every single one of those 246 draws. That's 246 MB of copying per frame, and takes its toll. The texture data is identical so this is some kind of bug. But in any case, that it's faster (16%) may indicate something isn't being captured by the frame dump which is making the actual scene slower... -[Unknown] |
After #16321 is merged, try this frame dump: I suspect this one will be fast for you in both OpenGL and Vulkan. If that's the case, this might be a performance issue the frame dump didn't capture - there might be a readback or other operation, perhaps? Or something about how it's stalling. -[Unknown] |
@unknownbrackets But I'm pretty sure the dump didn't capture the issue and it's slow because of something else. It is definitely not right that the speed with OpenGL and Vulkan is exactly the same. |
In case you want to test it in-game, here are the save files. Then press the circle button to play the transition effect. |
By the way, I completely forgot to mention. I also tested on AMD graphics, and I did not notice any significant slowdown. So maybe the problem is specific to android, or maybe the desktop GPU is powerful enough to handle it. Not sure. |
Strange. The edited frame dump runs at 1800% (GLES) and 2000% (Vulkan) speed on my phone, though it's Adreno not Mali. That's with the latest git build. Unfortunately I don't have this particular game. But that it's fine in Vulkan (outside the frame dump) makes it hard to think of anything it could be outside graphics rendering or graphics driver. Nothing is really showing up in the debug statistics that takes that long either. -[Unknown] |
@unknownbrackets I'm really sorry for taking your time, I think I initially tested the version before these changes. I'm so sorry 🙇♂️ I confirm, now the edited dump runs without slowdown, 350% with OpenGL and 500% with Vulcan. The original dump is still slow. |
Just for comparison, the game runs at about 1000% and drops to 3-4% when the transition effects occur. |
Does the setting "Skip GPU Readbacks" change performance (or change the graphical results)? |
@hrydgard No change in performance, graphics look correct. |
I guess it has to be something different happening than what the framedump does, perhaps with stalls/flushing. Maybe the GL driver is building a new pipeline for each constant FIXED value from the shader uniform? Not sure why it wouldn't replicate in the frame dump if it was that simple, though... -[Unknown] |
I tested it now with software rendering (OpenGL). There is no slowdown. The game runs at 50% speed (90% if you hide the text window). During transition effect the speed increases to 95%, because the text disappears. |
Game or games this happens in
ULJM06233 - グリザイアの果実 -LE FRUIT DE LA GRISAIA
What area of the game
When the fade transition effect occurs, the game becomes very slow.
Speed seen in PPSSPP
4% (3/60)
GE frame capture and debug statistics
GE frame: ULJM06232_0001.zip
Platform
Android
Mobile phone model or graphics card
Mali-G31
PPSSPP version affected
v1.13.2-1705
Last working version
Graphics backend (3D API)
OpenGL / GLES
Any other notes or things you've tried
Runs very smoothly in Vulkan rendering mode.
Checklist
The text was updated successfully, but these errors were encountered: