New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VideoBackend/OGL: Prefer KHR_shader_subgroup over NV_shader_thread. #11523
Conversation
|
Gave it a try on Windows 11, with a NVIDIA RTX 3060 (laptop version, 130W TGP, driver 528.24, with Optimus enabled) and a Intel UHD Graphics (Gen12, Tiger Lake-H, driver 31.0.101.4090), using Lord Crump's "PUNISH HIM" cutscene from Paper Mario TTYD prologue. All settings were kept at default, except for disabling Dual Core mode since TTYD is sensitive to it. Here are the results:
Haven't noticed anything wrong as far as rendering goes. Will test this on some old hardware soon (NVIDIA Fermi, Intel Haswell)... Edit: forgot to mention the CPU, it's a Core i7-11800H |
822c8d3
to
cd5e2d5
Compare
|
This might be useful, it contains XML reports from GLview, listing supported extensions and other related info of the GPUs I tested: |
|
@mbc07 Yes, this was very helpful. The XML files showed that the newer Intel driver supports everything we need and that this feature was enabled and so tested. It also shows that Fermi wasn't supported before and won't be afterwards. However, I'm a bit doubtful about the speedup on your NVIDIA RTX 3060, it should be neglegtable as we "just" switch the GLSL extension to generate the same intrinsic. |
I'll try retesting it later, this time with Optimus disabled. In my initial test I forgot to disable Optimus, so whatever was rendered on it passed through the iGPU before reaching the screen. The iGPU hasn't been particularly stable since the driver merge Intel did early January (Gen12 iGPUs and their Arc dGPUs now shares the same driver)... |
|
@degasus you were right. Retested the RTX 3060 with NVIDIA Optimus disabled and got 178 FPS with this PR and 176 FPS on master (min/max IR results are the same as before)... |
cd5e2d5
to
de4987a
Compare
de4987a
to
7bcf36f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apart from the GlslVersion uglyness, it looks fine to me.
7bcf36f
to
53184e3
Compare
While the NV extension is totally fine, the KHR extension should be able to support more hardware. For NVIDIA, the hardware either supports both or neither, it just needs a driver from the last two years. For AMD, the drivers from late 2022-12 seems to bring support for the KHR extension. For Intel, the KHR is also supported for some years.
53184e3
to
4b2aa94
Compare



While the NV extension is totally fine, the KHR extension should be able to support more hardware.
For NVIDIA, the hardware either supports both or neither, it just needs a driver from the last two years.
For AMD, the drivers from late 2022-12 seems to bring support for the KHR extension.
For Intel, the KHR is also supported for some years.
This needs testing of BBOX on ALL GPU vendors, both behavior and performance.
And we need reporting if the two optional features are really supported on AMD.
Testing results: