-
Notifications
You must be signed in to change notification settings - Fork 68
v2.x missing commits #1237
v2.x missing commits #1237
Conversation
The opal_mem_hooks_release_hook does not have const on the pointer (though it probably should). This commit eliminates a warning by casting away the const until opal_mem_hooks_release_hook is updated to use const. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov> (cherry picked from open-mpi/ompi@5612998) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
This commit moves the patcher framework initialization to the memory/patcher component. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov> (cherry picked from open-mpi/ompi@41f00b7) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
Thanks to Paul Hargrove for reporting. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from open-mpi/ompi@acbd2c6) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
Fix warnings introduced by request rework. Signed-off-by: Nathan Hjelm <hjelmn@me.com> (cherry picked from open-mpi/ompi@b001184) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
|
This PR fixes open-mpi/ompi#1794 and open-mpi/ompi#1795. |
|
Test FAILed. |
|
@hjelmn Looks like this caused a legit segv at the Mellanox jenkins: @jladd-mlnx @Di0gen Could we get a corefile backtrace, perchance? Thanks! |
|
Test PASSed. |
|
Test FAILed. |
|
@jladd-mlnx Both this PR and #1238 are failing with this stack trace in the thread-tests-1.1/overlap test: But this stack trace implies that there's nothing going on with requests (i.e., this possibly isn't related to the request rework). Also, #1238 is a one-sided thing, not a request thing -- but it is also failing in MPI_Barrier. Was there a change in the hcoll stack on the jenkins machine recently, perchance? (it looks like MXM in the stack might be a red herring -- looks like it's just the segv handler, because it was an openib BTL run...?) |
|
@jsquyres @hjelmn Looks like a race condition to me. I ran it five times with no issues, and on the sixth it segfaulted. This is with Yalla. Now it's hanging. I can give access if you'd like. |
|
@jladd-mlnx Can you bisect? |
|
@artpol84 Could you please have Boris take a look. Just ssh to jenkins01 on Bgate and copy paste the command line. |
|
@jsquyres This one probably should go in as well. Want to get an MTT run in tonight. |
|
@hppritcha I'm ok with this one going in, too. |
|
Test FAILed. |
|
Test FAILed. |
Missing commits from patcher and request rework code.
Fixes open-mpi/ompi#1794
Fixes open-mpi/ompi#1795
:bot🏷️bug
:bot:milestone:v2.0.0
:bot:assign: @jsquyres