-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix CachingAllocator debug for non-async operations #45368
Fix CachingAllocator debug for non-async operations #45368
Conversation
type bugfix |
cms-bot internal usage |
-code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45368/40797
Code check has found code style and quality issues which could be resolved by applying following patch(s)
|
I think it should fix the issue, yes. |
Actually, I think that the behaviour of The issue is how we handle pinned host memory:
The quick solution implemented here is to wait after the I agree a better solution is to use an immediate memory write: either |
b476030
to
2e5ed47
Compare
-code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45368/40804
Code check has found code style and quality issues which could be resolved by applying following patch(s)
|
2e5ed47
to
645f6a6
Compare
@VinInn what do you think of this approach ? |
enable gpu |
please test |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45368/40805
|
+heterogeneous |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45368/40828
|
Pull request #45368 was updated. @AdrianoDee, @kskovpen, @miquork, @srimanob, @subirsarkar, @sunilUIET can you please check and sign again. |
+1 Size: This PR adds an extra 44KB to repository
Comparison SummarySummary:
GPU Comparison SummarySummary:
|
+Upgrade |
+pdmv |
This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @antoniovilela, @rappoccio, @sextonkennedy (and backports should be raised in the release meeting by the corresponding L2) |
+1 |
PR description:
@VinInn pointed out that filling memory asynchronously may be incorrect if the memory is later being set synchronously, without using a queue.
This is often the case with pinned host memory buffers, where the allocation and memset may be asynchronous, but the content is accessed directly using host-only operations.
This change makes the allocator wait for the memset to complete before returning the memory buffer to the user code.
It also adds a customisation function to activate memory filling, and uses it in the non-profling Alpaka workflows.
PR validation:
The new tests pass.
If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:
To be backported to 14.0.x to fix the same bug there.