You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@ltalirz , any reason we cannot use HPC-X (which is OMPI+UCX) here? The mt-init scripts are there in the HPC-X folder. (/opt/hpcx*). The UCX mt build is also here.
Hi @jithinjosepkl , sorry for the long delay on our side.
First, my counter question would be: what speaks against having the OpenMPI in /opt be compiled against UCX with thread support? Would you be open to a PR that fixes this?
Second, we tried using the OpenMPI from HPC-X but ran into a number of issues.
After fixing this, mpicc and mpifort from the mpi/hpcx module pick up the outdated gcc 4.8.5 from the CentOS image which fails to compile even a simple MPI program (this despite the fact that ompi_info tells that the OpenMPI bundled with HPC-X is built with gcc 9.2.0).
This can be fixed by explicitly loading module load gcc-9.2.0 mpi/hpcx (we suggest making gcc-9.2.0 a dependency of hpi/hpcx)
Even after this step, we still run into errors when using the hpcx openmpi as an external mpi library in spack, since the libtool .la files shipped with hpcx point to non-existent libraries in /hpc (that were used to compiled hpcx).
Given your comment in #102 we understand that we should switch to AlmaLinux 8.6.
We will open a new issue, should any of these issues persist there.
cc @matt-chan
I am unable to use the UCX messaging layer together with OpenMPI for an application that supports MPI + OpenMP parallelization strategies.
While the UCX pml component is found, initializing the component fails:
When I
export OMPI_MCA_pml_ucx_verbose=10
, I am notified thatThis happens even when I
export OMP_NUM_THREADS=1
(I guess this is independent of whether multiple threads are actually used).I read in openucx/ucx#5284 (comment) that I may need UCX to be built with the
--enable-mt
option.Would it be possible to have the UCX that ships with CentOS-HPC built with the
--enable-mt
option?Or is this already the case and I am barking up the wrong tree here?
The text was updated successfully, but these errors were encountered: