-
Notifications
You must be signed in to change notification settings - Fork 914
Do not mark a -host specification as "slots given" unless it actually specifies the #slots #2730
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
… specifies the #slots. This allows the system to dynamically detect the number of available processors and set the #slots accordingly. Signed-off-by: Ralph Castain <rhc@open-mpi.org>
Do we need to change any documentation for this change in behavior? |
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
Good point - I have updated the man page |
@jsquyres what do you think about this kind of change in behavior within a release stream? |
Here's the spread sheet that we worked on at the Dallas F2F 2/16: https://docs.google.com/spreadsheets/d/1poOwNKtYxnDnpF7-D15lmcFVtLRNrmf4_hu3Obtu95M/edit |
@hppritcha Hate to tell you, but that spreadsheet isn't accurate - at least, that isn't the current behavior. |
@hppritcha @rhc54 imho, this specific case is kind of undocumented/unspecified. one one hand, we specify a slot-list, and on the other hand, we specify a host. for example, what if the slot-list contains 12 slots but we specify a different number of slots with the in this very specific case, e.g. |
I've said this before, but I'll reiterate here - I am no longer supporting the -host/-hostfile code. We keep coming up with every imaginable corner-case, and the complexity of trying to handle all of them - while preserving existing behavior - is insane. Clear evidence: the current master behavior no longer mirrors @jsquyres spreadsheet. This patch makes the branch follow the master and fixes the user's problem, but the result isn't the behavior in the spreadsheet. Someone else can figure out what they want to do. |
bot:mellanox:retest |
2 similar comments
bot:mellanox:retest |
bot:mellanox:retest |
Here's what we have at the moment across several releases for the
where nslots is the number of cores on the host foo |
Discussed at the devel F2F and decided to keep the behavior of 1.10.x series (modulo the new --host foo:X). So closing this PR. |
For reference: https://github.com/open-mpi/ompi/wiki/Meeting-2017-01 |
Food for thought: It would be nice if we had a system regularly running a handful of |
This allows the system to dynamically detect the number of available processors and set the #slots accordingly.
Signed-off-by: Ralph Castain rhc@open-mpi.org