Skip to content
This repository was archived by the owner on Sep 30, 2022. It is now read-only.

Conversation

@hjelmn
Copy link
Member

@hjelmn hjelmn commented Aug 24, 2016

Signed-off-by: Nathan Hjelm hjelmn@lanl.gov

(cherry picked from commit open-mpi/ompi@83062db)

Signed-off-by: Nathan Hjelm hjelmn@lanl.gov

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>

(cherry picked from commit open-mpi/ompi@83062db)

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

:bot🏷️bug
:bot:milestone:v2.0.1
:bot:assign: @hppritcha

Missed this when transcribing the patch.

@jsquyres
Copy link
Member

@hjelmn What bug does this solve?

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

A bug I introduced. If MPI_THREAD_MULTIPLE is on it deadlocks on the first connection. Couldn't test after transcribing because none of our open Cray systems were up. Hit the bug immediately once I got to test.

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

Also, this does not require an rc2 IMO.

@lanl-ompi
Copy link
Contributor

Test FAILed.

1 similar comment
@lanl-ompi
Copy link
Contributor

Test FAILed.

@jsquyres
Copy link
Member

@hppritcha Assuming you can review, I'm ok with this.

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

LANL jenkins blew up.

:bot:lanl:retest

@lanl-ompi
Copy link
Contributor

Test FAILed.

1 similar comment
@lanl-ompi
Copy link
Contributor

Test FAILed.

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

@hppritcha LANL jenkins looks like it is busted.

@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/2122/ for details.

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

And now the mellanox Jenkins is belly-up.

00:48:21.114 --2016-08-24 20:56:33--  (try: 2)  http://www.mcs.anl.gov/~thakur/thread-tests/thread-tests-1.1.tar.gz
00:48:21.114 Connecting to www.mcs.anl.gov (www.mcs.anl.gov)|140.221.6.95|:80... failed: Connection timed out.
00:50:28.370 Retrying.

@hjelmn
Copy link
Member Author

hjelmn commented Aug 24, 2016

bot:mellanox:retest
bot:lanl:retest

@lanl-ompi
Copy link
Contributor

Test FAILed.

@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/2124/ for details.

@hppritcha
Copy link
Member

bot:lanl:retest

@lanl-ompi
Copy link
Contributor

Test FAILed.

1 similar comment
@lanl-ompi
Copy link
Contributor

Test FAILed.

@hppritcha
Copy link
Member

UH network problems hitting of some sort.
@edgargabriel if you have the chance could you see if the cluster at UH is having external connectivity issues?

@hppritcha
Copy link
Member

lets wait for CI modulo the mess up at UH then merge.
👍

@edgargabriel
Copy link
Member

I can login onto the node without issues, but I know that there were some network issues the last two days.

@jsquyres
Copy link
Member

bot:ibm:retest

@hppritcha
Copy link
Member

@edgargabriel says UH is messed up while school starts up so we'll ignore dlopen and distcheck tests for this PR.

@edgargabriel
Copy link
Member

not objecting to the decision, but I am still a bit surprised that it 'hangs' since both of my pr's that I filed today got through that point without major issues.

@hppritcha
Copy link
Member

bot:lanl:retest
@jsquyres i vote for merging in and seeing how mtt goes over the weekend

@jsquyres jsquyres merged commit 17bbb5c into open-mpi:v2.x Aug 26, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants