Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hpcbind: check for correct Slurm variable #6116

Merged
merged 1 commit into from
May 10, 2023

Conversation

rbberger
Copy link
Contributor

@rbberger rbberger commented May 9, 2023

hpcbind was checking for SLURM_LOCAL_ID which does not exist. It therefore defaulted to HPCBIND_QUEUE_RANK=0 (e.g. leading to only using GPU 0)

The correct variable name is SLURM_LOCALID. See https://slurm.schedmd.com/srun.html#SECTION_OUTPUT-ENVIRONMENT-VARIABLES

@dalg24-jenkins
Copy link
Collaborator

Can one of the admins verify this patch?

@dalg24
Copy link
Member

dalg24 commented May 9, 2023

(no need to trigger tests on the ORNL Jenkins server for this patch)

@dalg24 dalg24 merged commit f15b5ab into kokkos:develop May 10, 2023
27 checks passed
@dalg24 dalg24 added the Bug Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos) label May 10, 2023
@dalg24 dalg24 mentioned this pull request May 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants