-
Notifications
You must be signed in to change notification settings - Fork 426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to run on node without infiniband? #8952
Comments
If I use these settings
then I get these errors:
|
For the record, some related issues |
@cponder By default, OpenMPI disables UCX for non-rdma networks. |
If I explicitly disable the UCX here
I don't see any problems. But if I unset the variable instead
which I'd expect to exhibit the default behavior, still triggers UCX warnings
although the MPI operation completes.
the MPI operation completes with no warning message. |
@cponder when OMPI_MCA_pml is unset, OpenMPI would try to initialize UCX, which will print a warning that some of the transports specified by ICX_TLS are not available. Is it possible to avoid setting UCX_TLS? |
Yeah we'll just use
on these systems. But my closing question is that if this is the case
and there's no IB on the node, then I would expect this
to trigger the default behavior and not use UCX. But it still initializes the UCX in spite of this, right? |
Yes, it still initializes UCX to check if there are rdma networks. |
Ok, I'll go ahead and close. |
I'm using UCX 1.13.1 built inside a container that works fine on an IB cluster.
The problem is that I'm trying to use it on a node that doesn't have IB cards or a Mellanox driver installed, much less a backbone network.
I can run my using the setting
but is there a way to not have to disable the UCX? Or should disabling it be the preferred method?
The text was updated successfully, but these errors were encountered: