Skip to content

NUMA setting and affinity configuration? #451

@biocyberman

Description

@biocyberman

One of our software (CST) requires that the job running it must have correct NUMA setting. I don't really know what it is talking about. But here is SLURM documentation about cpu-bind and NUMA:
https://slurm.schedmd.com/mc_support.html#srun_lowlevelmc

Since SLURM documention was not helpful on this right now, and I forgot where I read about the argument for task affinity, I am asking it here:

  1. Why affinity is needed?
  2. What is the consequence (i.e. resource utility, runtime, etc) if I disable this plugin?
  3. According to SLURM man page: man srun, some cpu-bind modes only support if the whole node is allocated. This is very unlikely case, provided how fat a DGX2 node is. So, what if I can have correct NUMA setting or the similar while I allocate a node partially?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions