Skip to content

Conversation

@rhc54
Copy link
Contributor

@rhc54 rhc54 commented May 6, 2016

This was responsible for breaking things like map-by foo

Fixes #1623

…This was responsible for breaking things like map-by <foo>.

Remove debug, let orterun send terminate cmd to DVM

Recover the DVM support
@rhc54
Copy link
Contributor Author

rhc54 commented May 6, 2016

Spent a bunch of time tracking down this "error" from the Mellanox Jenkins, and I don't think the test is correct. The cmd line shown in fact works correctly:

$  mpirun -np 2 -mca mca_base_env_list 'XXX_A=1;XXX_B=2;XXX_C;XXX_D;XXX_E' env | grep XXX
OMPI_MCA_mca_base_env_list=XXX_A=1;XXX_B=2;XXX_C;XXX_D;XXX_E
XXX_E=5
XXX_D=4
XXX_C=3
OMPI_MCA_mca_base_env_list=XXX_A=1;XXX_B=2;XXX_C;XXX_D;XXX_E
XXX_E=5
XXX_D=4
XXX_C=3
$

So when you run that thru wc -l, you will get an answer of 8, not 6.

@rhc54
Copy link
Contributor Author

rhc54 commented May 6, 2016

Should note: the Jenkins test only exports valus for C, D, and E. Perhaps it was expecting that A and B would be set? If so, then I'm not sure where and how that was supposed to be done, and will have to look further at some point.

@rhc54 rhc54 merged commit 6311f86 into open-mpi:master May 6, 2016
@rhc54 rhc54 deleted the topic/repair branch May 6, 2016 21:39
@ggouaillardet
Copy link
Contributor

@rhc54 I previously noticed Jenkins script explicitly export A and B before invoking mpirun

@rhc54
Copy link
Contributor Author

rhc54 commented May 7, 2016

Sadly, there is no documentation anywhere on these tests, nor are they available to me - nor is the functionality particularly documented in the code. So I'm working my way thru them as best I can.

Progress is being made...

@ggouaillardet
Copy link
Contributor

I looks like there is a
set -x
in the Jenkins script, this is how I found it when reading the logs
I previously suggested all Jenkins scripts should be public (read only github repo looks fine) so anyone can understand what is tested and how. shall I add this to the agenda of the next telcon ?

@rhc54
Copy link
Contributor Author

rhc54 commented May 7, 2016

sure - it would definitely help

@jsquyres
Copy link
Member

jsquyres commented May 7, 2016

@jladd-mlnx @Di0gen can you please comment on this conversation?

@mike-dubman
Copy link
Member

@ggouaillardet - the script is public and here (also mindist.c test)

https://github.com/mellanox-hpc/jenkins_scripts

@ggouaillardet
Copy link
Contributor

thanks @miked-mellanox
@rhc54 here is what is broken

export XXX_A=1
echo XXX_B=2 > xxx.conf
mpirun -np 1 -am xxx.conf env | grep ^XXX

XXX_B is not exported

git bisect indicates 58dd41f is the guilty commit
according to the logs 2839484 might have been an attempt to fix it, but it did not fix this specific test case.

@rhc54
Copy link
Contributor Author

rhc54 commented May 9, 2016

Yes, I know - I pointed that out in an issue I filed, and requested help to decipher the AMCA stuff as it isn't very well commented 😄

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants