Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

5.0.3 "Both TCP interface include and exclude lists were specified" #12467

Open
jwagnerhki opened this issue Apr 15, 2024 · 3 comments
Open

5.0.3 "Both TCP interface include and exclude lists were specified" #12467

jwagnerhki opened this issue Apr 15, 2024 · 3 comments

Comments

@jwagnerhki
Copy link

Thank you for taking the time to submit an issue!

Background information

What version of Open MPI are you using? (e.g., v4.1.6, v5.0.1, git branch name and hash, etc.)

v5.0.3

Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)

From source tarball

If you are building/installing from a git clone, please copy-n-paste the output from git submodule status.

Please describe the system on which you are running

  • Operating system/version: CentOS 7.7.1908
  • Computer hardware:
  • Network type: Ethernet, Infiniband

Details of the problem

Excluding an interface, without an include list, results in a bizarre error claiming an exclude and include list were specified:

$ mpirun -np 45 --hostfile test.machines --show-progress --mca mpi_yield_when_idle 0 --map-by seq --bind-to none --mca btl_base_verbose 100 --mca btl_tcp_if_exclude 10.10.0.0/16 --mca oob_tcp_if_exclude 10.10.0.0/16  /cluster/difx/runmpifxcorr.DiFX-255-ompi5 test.input
--------------------------------------------------------------------------
Both TCP interface include and exclude lists were specified:

  Include: 10.10.0.0/16
  Exclude: 10.10.0.0/16

Only one of these can be given.
--------------------------------------------------------------------------
@rhc54
Copy link
Contributor

rhc54 commented Apr 15, 2024

Change the names as follows:

--omca btl_tcp_if_exclude 10.10.0.0/16 --prtemca oob_tcp_if_exclude 10.10.0.0/16

The problem is that things get confused when you don't clearly specify the target of the params (i.e., the runtime vs the MPI layers). We try to work it out, but cannot always guess correctly.

@ggouaillardet
Copy link
Contributor

If that does not fix your issue, double check the config files and the environment, the included interfaces might be there.

Copy link

It looks like this issue is expecting a response, but hasn't gotten one yet. If there are no responses in the next 2 weeks, we'll assume that the issue has been abandoned and will close it.

@github-actions github-actions bot added the Stale label Apr 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants