Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

btl/ofi: increase outstanding_rdma eariler for flush #11656

Merged
merged 1 commit into from
May 10, 2023

Conversation

wzamazon
Copy link
Contributor

@wzamazon wzamazon commented May 8, 2023

Piror to this patch, outstanding_rdma was increased after rdma request was sent. This could cause mca_btl_ofi_flush() to return before the rdma submitted by the caller completed under multi-thread environment.

To address, this patch moved the increment of outstanding_rdma to before RDMA requests were submitted, and decrease the counter when submission failed.

Piror to this patch, outstanding_rdma was increased after
rdma request was sent. This could cause mca_btl_ofi_flush()
to return before the rdma submitted by the caller completed
under multi-thread environment.

To address, this patch moved the increment of outstanding_rdma
to before RDMA requests were submitted, and decrease
the counter when submission failed.

Signed-off-by: Wei Zhang <wzam@amazon.com>
Copy link
Contributor

@devreal devreal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 👍

@bwbarrett bwbarrett merged commit 0751f42 into open-mpi:main May 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants