Skip to content

Conversation

@rhc54
Copy link
Contributor

@rhc54 rhc54 commented Dec 29, 2016

There are only five places in the non-daemon code paths where opal_hwloc_topology is currently referenced:

  • shared memory BTLs (sm, smcuda). I have added a code path to those components that uses the location string
    instead of the topology itself, if available, thus avoiding instantiating the topology

  • openib BTL. This uses the distance matrix. At present, I haven't developed a method
    for replacing that reference. Thus, this component will instantiate the topology

  • usnic BTL. Uses the distance matrix.

  • treematch TOPO component. Does some complex tree-based algorithm, so it will instantiate
    the topology

  • ess base functions. If a process is direct launched and not bound at launch, this
    code attempts to bind it. Thus, procs in this scenario will instantiate the
    topology

Note that instantiating the topology on complex chips such as KNL can consume
megabytes of memory.

Signed-off-by: Ralph Castain rhc@open-mpi.org

…will be used.

There are only five places in the non-daemon code paths where opal_hwloc_topology is currently referenced:

* shared memory BTLs (sm, smcuda). I have added a code path to those components that uses the location string
  instead of the topology itself, if available, thus avoiding instantiating the topology

* openib BTL. This uses the distance matrix. At present, I haven't developed a method
  for replacing that reference. Thus, this component will instantiate the topology

* usnic BTL. Uses the distance matrix.

* treematch TOPO component. Does some complex tree-based algorithm, so it will instantiate
  the topology

* ess base functions. If a process is direct launched and not bound at launch, this
  code attempts to bind it. Thus, procs in this scenario will instantiate the
  topology

Note that instantiating the topology on complex chips such as KNL can consume
megabytes of memory.

Fix pernode binding policy

Properly handle the unbound case

Correct pointer usage

Do not free static error messages!

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
@rhc54 rhc54 merged commit a161628 into open-mpi:master Dec 29, 2016
@rhc54 rhc54 deleted the topic/topo branch December 29, 2016 19:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant