New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SPU LLVM: improve performance in some situations #5882
Conversation
Regression on [BCES01893] Gran Turismo 6 using SPU LLVM LOG: SPU LOG: |
Updated, you may try if the crash is gone. |
Working fine now! |
With SPU LLVM and Approximate xfloat (not accurate) I saw no performance gain in Persona 5, WipEout HD, Red Dead Redemption and Sly Cooper Thieves in Time. Gained ~2fps in Ni No Kuni with accurate xfloat ON (note, accurate xfloat is only needed to fix Oliver's Cape color in Ni No Kuni, it's extremely minor and not worth turning accurate xfloat on.) |
I tested the sample you used in 3 different spots (Zoomed in twice and Default). We get double the performance in many occasions when using accurate xfloat compared to how currently is in master. However when without xfloat, I noticed a drop in fps in the 2nd spot, the other spots seem to be within margin of error. These are the fps I got for each spot.
|
tried to use this on persona 5 and see a minor performance boost on some places with SPU LLVM and accurate xfloat before : 13 fps after 18fps |
Gran Turismo 6 1.05 [BCUS98296]
Noticeable improvement (~10%) using accurate xfloat, but no improvement at all when it is off. I will add more tracks as I test. |
Ninja gaiden sigma 2 improvement 51FPS-->58FPS (non accurate xfloat) |
I see almost a 10% speed gain with the game Journey (from 12,5 to 13,5 fps - non accurate xfloat) Win10, i7 7700, gtx1060 |
The House of the Dead 4 doesn't boot anymore with this PR |
Confirmed. HOTD4 hangs on boot. |
16b44d9
to
f1c094f
Compare
Implement remaining instructions. Implement match_expr method. Implement helper methods.
Register information about register accesses.
50300ae
to
0c1e554
Compare
Use a trick in check_state to improve LICM pass.
Refactor SPU analyser (block_info struct). Fill register use info (currently unused).
Bug in the analyser was created recently in RPCS3#5882.
Bug in the analyser was created recently in RPCS3#5882.
I have noticed a huge stability improvement in P5 but I don't know if it is from #5895 or this. Keep up the good work |
Bug in the analyser was created recently in RPCS3#5882.
Huge performance improvement in Diablo 3. From 29fps to 38fps in Stronghold area! Other places its always 60fps! |
For example, before (with accurate xfloat):
After: