-
Notifications
You must be signed in to change notification settings - Fork 931
PML/UCX: fixed hang on MPI_Finalize - v3.1.x #5248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PML/UCX: fixed hang on MPI_Finalize - v3.1.x #5248
Conversation
|
@hoopoepg Is this a cherry pick from master? Or is this unique to v3.1? If it's a cherry pick from master, please be sure to use In general, we usually commit fixes to master first and then file PRs that are cherry picks from master to the release branches -- unless there's a fix specific to a release branch (and not relevant to master), of course. But then we specifically note that in the PR so that reviewers aren't surprised. |
855d6a2 to
ae1a54d
Compare
fixes issue openucx/ucx#2656 added flush for worker object to complete all pending operations Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit 0a8261f)
ae1a54d to
923e159
Compare
|
@jsquyres yep, sorry, forgot to note about cherry-pick |
|
Ok, thanks. Does it need to go to v3.0.x as well? |
|
I think yes, it seems this is long-play issue |
|
yes |
- added debug output for woker flush failure Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit c3dbc86)
|
bot:mellanox:retest |
|
@bwbarrett @jsquyres Can you please merge this PR? |
fixes issue openucx/ucx#2656
added flush for worker object to complete all pending operations
cherry picked from #5227, commit 0a8261f
Signed-off-by: Sergey Oblomov sergeyo@mellanox.com