Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable custom all-reduce for ROCm. #34

Closed
wants to merge 2 commits into from

Conversation

iotamudelta
Copy link

  • add csrc custom_all_reduce into compilation
  • add ops and pybindings
  • do not disable custom all-reduce on ROCm in config
  • port the custom all reduce source and test to HIP
  • remove volatile signatures on ROCm
  • optimized locking for ROCm
  • increase default #threads and decrease #blocks after testing

* add csrc custom_all_reduce into compilation
* add ops and pybindings
* do not disable custom all-reduce on ROCm in config
* port the custom all reduce source and test to HIP
* remove volatile signatures on ROCm
* optimized locking for ROCm
* increase default #threads and decrease #blocks after testing
csrc/custom_all_reduce.cu Outdated Show resolved Hide resolved
csrc/custom_all_reduce.cu Outdated Show resolved Hide resolved
@hongxiayang
Copy link

there is another pull request #27 for the same purpose.

@mawong-amd
Copy link

In light of #27 being merged, am closing this PR. If there are other components that you'd like to bring in here, please open a new PR with those changes.

@mawong-amd mawong-amd closed this Jun 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants