OpcodeDecoding: Cache vertex sizes #11067

K0bin · 2022-09-15T14:25:26Z

I struggled a bit to find a place where to put the cache but I'm pretty happy with having it in the callback.
The hit rate came out at 97% in Mario Galaxy. (I hooked it up to the stats HUD for testing.)

Together with #11066 Mario Galaxy goes from 85 fps to 140 fps on the hub world. (5900X, downclocked to 2.2 GHz)

K0bin · 2022-09-15T21:54:01Z

The new version just uses the vertex size that's stored in the VertexLoader if that's available.
Works for 99% of all calls in Mario Galaxy.

Spotted by Pokechu22.

Source/Core/VideoCommon/OpcodeDecoding.cpp

Source/Core/VideoCommon/VertexLoaderManager.cpp

Nerboruto · 2022-09-17T12:47:38Z

could it bring some improvement in other games like fzero or metroid prime?

K0bin · 2022-09-17T12:48:46Z

I think @JMC47 tried it with those and it was between 2 and 5% faster.

K0bin · 2022-09-18T14:29:45Z

Ready for another round of reviews.

Source/Core/VideoCommon/VertexLoaderManager.cpp

Source/Core/VideoCommon/VertexLoaderManager.h

iwubcode

Untested but code wise LGTM. Great work @K0bin !!

Source/Core/VideoCommon/OpcodeDecoding.h

Source/Core/VideoCommon/VertexLoaderManager.h

K0bin · 2022-09-18T23:14:01Z

sigh

Either my local clang-format doesn't pick up the Dolphin config file or the CI bot has a different set of rules.

Pokechu22

Looks good to me. Most of my testing was on single core which didn't show much of an improvement (possibly because of other accuracy settings I have enabled), but I saw a lot larger of an improvement on dual-core. Thanks!

Regarding clang-format: I use MSVC's built-in formatting functionality, which picks up on the config file properly. I think there's also different behavior depending on the version of clang-format in use; I believe clang-format-9 is in use (and different versions behave differently with the same config).

theofficialgman · 2022-09-19T22:44:09Z

I've tested this and the preceeding PR
unfortunatly no FPS improvement seen here (nintendo switch, tegra-x1 quad core cortex A57 @ 2Ghz). 25FPS in that scene before and after the PRs

badkarma12 · 2022-09-20T06:05:14Z

Small boost to Pokemon Colosseum/xd on scenes with shadow pokemon aura and moves like sunny day.

K0bin mentioned this pull request Sep 15, 2022

Optimize GetVertexSize #11066

Merged

K0bin force-pushed the cache-vertex-size branch 3 times, most recently from eb39450 to dd3375e Compare September 15, 2022 21:52

VertexLoaderManager: Fix backwards preprocess check

a31e36a

Spotted by Pokechu22.

K0bin force-pushed the cache-vertex-size branch from dd3375e to e0d323b Compare September 15, 2022 21:57

Pokechu22 reviewed Sep 15, 2022

View reviewed changes

Source/Core/VideoCommon/OpcodeDecoding.cpp Outdated Show resolved Hide resolved

K0bin force-pushed the cache-vertex-size branch from e0d323b to ea20fb8 Compare September 15, 2022 23:22

iwubcode reviewed Sep 16, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.cpp Outdated Show resolved Hide resolved

iwubcode reviewed Sep 16, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.cpp Show resolved Hide resolved

iwubcode reviewed Sep 16, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.cpp Outdated Show resolved Hide resolved

K0bin force-pushed the cache-vertex-size branch from ea20fb8 to 8b64298 Compare September 17, 2022 12:22

K0bin force-pushed the cache-vertex-size branch from 8b64298 to edf7dd8 Compare September 18, 2022 14:16

K0bin force-pushed the cache-vertex-size branch from edf7dd8 to ededfe1 Compare September 18, 2022 17:56

iwubcode reviewed Sep 18, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.cpp Show resolved Hide resolved

iwubcode reviewed Sep 18, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.h Outdated Show resolved Hide resolved

K0bin force-pushed the cache-vertex-size branch from ededfe1 to 3be90b6 Compare September 18, 2022 19:20

iwubcode approved these changes Sep 18, 2022

View reviewed changes

Pokechu22 reviewed Sep 18, 2022

View reviewed changes

Source/Core/VideoCommon/OpcodeDecoding.h Outdated Show resolved Hide resolved

K0bin force-pushed the cache-vertex-size branch 2 times, most recently from 907b5d7 to b603c91 Compare September 18, 2022 20:10

Pokechu22 reviewed Sep 18, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.h Show resolved Hide resolved

Pokechu22 reviewed Sep 18, 2022

View reviewed changes

Source/Core/VideoCommon/VertexLoaderManager.h Show resolved Hide resolved

K0bin force-pushed the cache-vertex-size branch 2 times, most recently from a80f5a0 to eb55bd2 Compare September 18, 2022 23:06

K0bin added 2 commits September 19, 2022 01:14

VertexLoaderManager: Clean up and slightly speed up with templates

a6c6ec0

OpcodeDecoding: Get vertex size from the loader

2db74e7

K0bin force-pushed the cache-vertex-size branch from eb55bd2 to 2db74e7 Compare September 18, 2022 23:15

Pokechu22 approved these changes Sep 18, 2022

View reviewed changes

JMC47 merged commit 6f4f5b0 into dolphin-emu:master Sep 19, 2022

K0bin deleted the cache-vertex-size branch September 19, 2022 11:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpcodeDecoding: Cache vertex sizes #11067

OpcodeDecoding: Cache vertex sizes #11067

K0bin commented Sep 15, 2022 •

edited

K0bin commented Sep 15, 2022

Nerboruto commented Sep 17, 2022

K0bin commented Sep 17, 2022

K0bin commented Sep 18, 2022

iwubcode left a comment

K0bin commented Sep 18, 2022

Pokechu22 left a comment •

edited

theofficialgman commented Sep 19, 2022

badkarma12 commented Sep 20, 2022

OpcodeDecoding: Cache vertex sizes #11067

OpcodeDecoding: Cache vertex sizes #11067

Conversation

K0bin commented Sep 15, 2022 • edited

K0bin commented Sep 15, 2022

Nerboruto commented Sep 17, 2022

K0bin commented Sep 17, 2022

K0bin commented Sep 18, 2022

iwubcode left a comment

Choose a reason for hiding this comment

K0bin commented Sep 18, 2022

Pokechu22 left a comment • edited

Choose a reason for hiding this comment

theofficialgman commented Sep 19, 2022

badkarma12 commented Sep 20, 2022

K0bin commented Sep 15, 2022 •

edited

Pokechu22 left a comment •

edited