Using a multicontext version of glad to prepare for multi GPU support. #10118

jhmueller-huawei · 2022-06-14T13:55:28Z

Refactoring all Vulkan calls to use the device context.

Signed-off-by: Joerg H. Mueller joerg.mueller@huawei.com

jhmueller-huawei · 2022-06-15T07:47:35Z

Let me add some information to the changes in this pull request:

For multi-GPU support we need to load Vulkan functions for each device in use which doesn't allow us to use a global context that is usually created by GLAD anymore.

Instead, multiple GladVulkanContext structs can be created. This patch does so once for the Vulkan Instance and once for each Device.

Since calling functions requires using the GladVulkanContext directly now, it didn't make sense to keep an interface class (FunctionLoader) and the implementation (GladFunctionLoader) separate anymore since there is no alternative interface and no viable option to implement one, so I moved everything into the FunctionLoader class (creating a FunctionLoader.cpp for it) and removed GladFunctionLoader.

I also removed the dynamic module handling code in FunctionLoader since on closer inspection that is exactly what GLAD is doing.

The biggest part of the diff is the auto-generated vulkan.h file from https://gen.glad.sh/ where now the mx flag was enabled additionally to allow multiple contexts.

Gems/Atom/RHI/Vulkan/Code/Include/Atom/RHI.Loader/FunctionLoader.h

Gems/Atom/RHI/Vulkan/Code/Source/RHI/NullDescriptorManager.cpp

Gems/Atom/RHI/Vulkan/Code/Source/RHI/Instance.h

moudgils

For the most part these changes look good to me. Are you able to test AtomSampleViewer and run the full test suite to ensure none of the existing functionality is broken?

martinwinter-huawei · 2022-06-29T13:56:25Z

Having updated both o3de and o3de-atom-sampleviewer, we get the same result both on the unmodified development branch as well as with this commit included, running scripts/_fulltestsuite_bv.luac, which runs through until we get a crash.
It crashes in MSAA_RPI_ExampleComponent trying to de-reference an iterator in Scene::RemoveRenderPipeline(AZ::Name const &).

We are testing on Linux and with Clang 13.0.1.

martinwinter-huawei · 2022-07-06T06:29:13Z

I added a bug report at the o3de-atom-sampleviewer repository to highlight this issue.

jhmueller-huawei · 2022-07-22T11:25:00Z

Any progress on this @moudgils? I just updated to the latest development branch. @kh-huawei just tried to run the test suite of AtomSampleViewer with it on Windows and everything worked.

moudgils · 2022-07-22T15:56:14Z

Any progress on this @moudgils? I just updated to the latest development branch. @kh-huawei just tried to run the test suite of AtomSampleViewer with it on Windows and everything worked.

Ahh. As long as the full test suite runs correctly for Vk we should be good to check this in. I will approve it and start AR for it.

moudgils · 2022-07-25T15:45:53Z

There are a few issues with this PR as flagged by the AR run. Please go ahead and address them so that the AR run passes - https://jenkins.build.o3de.org/job/O3DE/view/change-requests/job/PR-10118/

Refactoring all Vulkan calls to use the device context. Signed-off-by: Joerg H. Mueller <joerg.mueller@huawei.com>

jhmueller-huawei · 2022-07-26T12:33:04Z

There are a few issues with this PR as flagged by the AR run. Please go ahead and address them so that the AR run passes - https://jenkins.build.o3de.org/job/O3DE/view/change-requests/job/PR-10118/

The update should fix those issues. I assume you need to trigger another run, or does it run automatically?

moudgils · 2022-08-03T20:27:46Z

@jhmueller-huawei This PR is causing runtime crash for Android. Specifically the call to SetDebugUtilsObjectNameEXT is seg-faulting. Are you able to look into this? An engineer (internally) is looking into it but we may decide to revert this PR until the cause of the issue is identified/resolved.

jhmueller-huawei · 2022-08-04T09:04:33Z

Unfortunately, not really. I also don't have any idea why this call would crash, since it's just one of the many function calls that now go through the context instead of a global that is set by glad. It is of course a function provided by an extension, so maybe the issue is with the extension management in glad on Android.

…glad context (#11151) Signed-off-by: moraaar <moraaar@amazon.com> ## What does this PR do? Fixes `VK_EXTENSION_SUPPORTED` macro to check at runtime if a vulkan extension is supported or not. Glad vulkan checks each extension availability at runtime (when loaded) and saves it in global int variables `GLAD_VK_EXT_extensionname`. PR #10118 changed this so it checks if a macro `VK_EXT_...` is 1, which it always is, but it's not checking at runtime if the device/instance has support for the extension. So what's happening is that on android it now thinks extension `EXT_debug_utils` is available, which is not, and then it crashes when calling `CreateDebugUtilsMessengerEXT`. With the new glad vulkan header introduced in PR #10118 instead of checking `GLAD_VK_EXT_extensionname` it should be checking `context.EXT_extensionname` int instead. ## How was this PR tested? Run default level using vulkan rhi on pc and android.

…#9) Removed duplicated function glad loader code and using Atom_RHI_Vulkan.Glad.Static instead. Fixed compilation issues after glad vulkan header was updated to use multi-context in Vulkan gem (o3de/o3de#10118) Built with latest O3DE using AtomSampleViewer project (openxr branch). RHI VR Sample works with Quest 2 via link cable. Fixes #7 Signed-off-by: moraaar <moraaar@amazon.com>

…#9) Removed duplicated function glad loader code and using Atom_RHI_Vulkan.Glad.Static instead. Fixed compilation issues after glad vulkan header was updated to use multi-context in Vulkan gem (o3de/o3de#10118) Built with latest O3DE using AtomSampleViewer project (openxr branch). RHI VR Sample works with Quest 2 via link cable. Fixes o3de#7 Signed-off-by: moraaar <moraaar@amazon.com>

jhmueller-huawei requested review from a team as code owners June 14, 2022 13:55

gadams3 requested a review from a team June 14, 2022 14:34

rgba16f requested review from moudgils and jiaweig-amzn June 14, 2022 15:20

thefranke reviewed Jun 21, 2022

View reviewed changes

Gems/Atom/RHI/Vulkan/Code/Include/Atom/RHI.Loader/FunctionLoader.h Show resolved Hide resolved

thefranke reviewed Jun 21, 2022

View reviewed changes

Gems/Atom/RHI/Vulkan/Code/Source/RHI/NullDescriptorManager.cpp Show resolved Hide resolved

thefranke reviewed Jun 21, 2022

View reviewed changes

Gems/Atom/RHI/Vulkan/Code/Source/RHI/Instance.h Outdated Show resolved Hide resolved

lmbr-pip added the sig/graphics-audio Categorizes an issue or PR as relevant to SIG graphics-audio. label Jun 23, 2022

jhmueller-huawei force-pushed the glad_multi_context_refactor branch 2 times, most recently from 90765a6 to d32c516 Compare June 28, 2022 08:22

moudgils reviewed Jun 28, 2022

View reviewed changes

jhmueller-huawei force-pushed the glad_multi_context_refactor branch from d32c516 to 7f5ade6 Compare July 22, 2022 11:23

tjmgd self-requested a review July 22, 2022 11:52

tjmgd approved these changes Jul 22, 2022

View reviewed changes

moudgils approved these changes Jul 22, 2022

View reviewed changes

Using a multicontext version of glad to prepare for multi GPU support.

5b77bc6

Refactoring all Vulkan calls to use the device context. Signed-off-by: Joerg H. Mueller <joerg.mueller@huawei.com>

jhmueller-huawei force-pushed the glad_multi_context_refactor branch from 7f5ade6 to 5b77bc6 Compare July 26, 2022 08:29

thefranke approved these changes Jul 28, 2022

View reviewed changes

thefranke merged commit 3cd9628 into o3de:development Jul 28, 2022

moraaar mentioned this pull request Aug 2, 2022

OpenXRVk Gem fails to compile o3de/o3de-extras#7

Closed

moraaar mentioned this pull request Aug 8, 2022

Fix VK_EXTENSION_SUPPORTED macro by checking the runtime variable of glad context #11151

Merged

moraaar mentioned this pull request Aug 9, 2022

Fixed OpenXRVk compilation and using Vulkan Function Loader from Atom o3de/o3de-extras#9

Merged

jhmueller-huawei mentioned this pull request Feb 28, 2023

Reviewer Nomination: jhmueller-huawei o3de/sig-graphics-audio#123

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a multicontext version of glad to prepare for multi GPU support. #10118

Using a multicontext version of glad to prepare for multi GPU support. #10118

jhmueller-huawei commented Jun 14, 2022

jhmueller-huawei commented Jun 15, 2022

moudgils left a comment

martinwinter-huawei commented Jun 29, 2022

martinwinter-huawei commented Jul 6, 2022

jhmueller-huawei commented Jul 22, 2022

moudgils commented Jul 22, 2022 •

edited

moudgils commented Jul 25, 2022

jhmueller-huawei commented Jul 26, 2022

moudgils commented Aug 3, 2022 •

edited

jhmueller-huawei commented Aug 4, 2022

Using a multicontext version of glad to prepare for multi GPU support. #10118

Using a multicontext version of glad to prepare for multi GPU support. #10118

Conversation

jhmueller-huawei commented Jun 14, 2022

jhmueller-huawei commented Jun 15, 2022

moudgils left a comment

Choose a reason for hiding this comment

martinwinter-huawei commented Jun 29, 2022

martinwinter-huawei commented Jul 6, 2022

jhmueller-huawei commented Jul 22, 2022

moudgils commented Jul 22, 2022 • edited

moudgils commented Jul 25, 2022

jhmueller-huawei commented Jul 26, 2022

moudgils commented Aug 3, 2022 • edited

jhmueller-huawei commented Aug 4, 2022

moudgils commented Jul 22, 2022 •

edited

moudgils commented Aug 3, 2022 •

edited