New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MPI hanging if MPI_Init_thread
used on generoso with foss/2023a (OpenMPI 4.1.5 / libfabric 1.18.0)
#18925
Comments
Also worth mentioning is that we're not seeing this with |
Also solve the issue:
Test ideas from open-mpi/ompi#11295 (comment)
|
So, which option is the safest (in particular for |
workaround for |
@branfosj We can close this one now, right? |
Yes. For reference, we're decided on
as this fixed the issue we were seeing and looked to be a change least likely to break anything else. |
It is hanging in the PSM3 provider in
libfabric
. We are seeing this when testing on generoso in #18443, #18444, #18731.Error:
If we control
libfabric
withFI_PROVIDER
(such asFI_PROVIDER="udp,tcp"
orFI_PROVIDER="psm2"
) then the example completes. It fails if we setFI_PROVIDER="psm3"
.Running with FI_LOG_LEVEL=debug mpirun -n 2 ./a.out gives lots of output. I think the relevant line is
From the code:
The text was updated successfully, but these errors were encountered: