Skip to content

mpiexec 4.1.3, how to avoid transfer PMIX_INSTALL_PREFIX to the child process of orted #13196

@gzm55

Description

@gzm55

Background information

What version of Open MPI are you using? (e.g., v4.1.6, v5.0.1, git branch name and hash, etc.)

v4.1.4

Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)

installed via conda package, make the xcopy to the cluster.

Please describe the system on which you are running

  • Operating system/version: CentOS 7
  • Computer hardware: x64_64
  • Network type: localhost

Details of the problem

In our cluster, the mpi prefix are different on different nodes. For example on master node, the opal_prefix is /path/master/prefix, while on the nodes, the opal_prefix is `/path/node-/prefix

The job is run like this

export OPAL_PREFIX=/path/master/prefix
unset PMIX_INSTALL_PREFIX
shell$ mpiexec -n 1 --mca plm_rsh_agent /usr/bin/echo sh -c 'echo $PMIX_INSTALL_PREFIX'

and run in another shell the orted command:

export OPAL_PREFIX=/path/node-<id>/prefix
export PMIX_INSTALL_PREFIX=$OPAL_PREFIX
orted -mca ess "env" ...

Then the result of master shell is always /path/master/prefix, not the nodes specific path, even if I exported a correct value for PMIX_INSTALL_PREFIX before orted started.

How to avoid transfer PMIX_INSTALL_PREFIX from mpiexec to the child process of orted?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions