Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BTL/OFI: retry posting receive buffer #12561

Merged
merged 1 commit into from
May 21, 2024

Conversation

hppritcha
Copy link
Member

There are cases under heavy load (at least for HPE CXI provider) that trying to post a receive buffer can return -FI_EAGAIN.

This PR uses the OFI_RETRY_UNTIL_DONE macro to try reposting receive buffer in the event -FI_EAGAIN is returned from the fi_recv call.

There are cases under heavy load (at least for HPE CXI provider)
that trying to post a receive buffer can return  -FI_EAGAIN.

This PR uses the OFI_RETRY_UNTIL_DONE macro to try reposting receive buffer in the
event -FI_EAGAIN is returned from the fi_recv call.

Signed-off-by: Howard Pritchard <hppritcha@gmail.com>
@hppritcha hppritcha merged commit 42c744e into open-mpi:main May 21, 2024
14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants