Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve progress engine binding #83

Merged
merged 3 commits into from
Nov 4, 2020
Merged

Improve progress engine binding #83

merged 3 commits into from
Nov 4, 2020

Conversation

ndryden
Copy link
Collaborator

@ndryden ndryden commented Oct 19, 2020

Requires #82.

Closes #45. Closes #64.

When using CUDA, this should now automatically detect cores close to the GPU device being used. Otherwise this uses similar behavior as before to identify the NUMA node.

(If you've built Aluminum with CUDA support but for some reason are not wanting to use CUDA, there isn't a good way to identify that, and we don't handle that case.)

This should result in more reliable progress engine binding without needing to mess with the AL_PROGRESS_CORE or AL_PROGRESS_RANKS_PER_NUMA_NODE environment variables (which no longer do anything).

@ndryden ndryden added the enhancement New feature or request label Oct 19, 2020
@ndryden ndryden requested a review from benson31 October 19, 2020 20:11
Copy link
Collaborator

@benson31 benson31 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For CUDA and as a general structure, this is a strict improvement. I'm having binding issues with HIP, but the code at least runs without crashing. I think this can merge while we investigate the HIP case separately.

@ndryden
Copy link
Collaborator Author

ndryden commented Nov 4, 2020

We can track the improved AMD support in #85.

@ndryden ndryden merged commit 87bc552 into master Nov 4, 2020
@ndryden ndryden deleted the pe-binding branch November 4, 2020 02:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Improve progress thread binding Could not get NUMA node
2 participants