Skip to content

Conversation

@jessebenson
Copy link
Member

Description:
CUDA reduction code went through substantial changes after the initial ROCM code was developed, and while MIOpen was still in early development.

This change updates the ROCM-specific reduction code to more closely match the current CUDA code. MIOpen is missing some functionality (no double support, no support for reduce Avg, no support for reduce L1/L2 norm), so there are still code differences.

@jessebenson jessebenson requested a review from a team as a code owner January 19, 2021 22:01
@jessebenson jessebenson force-pushed the jesseb/rocm-reduction branch from 45c066e to 32d02ab Compare January 26, 2021 17:11
@jessebenson jessebenson force-pushed the jesseb/rocm-reduction branch from 32d02ab to c96489f Compare February 3, 2021 19:32
weixingzhang
weixingzhang previously approved these changes Feb 4, 2021
@jessebenson jessebenson merged commit d914e29 into master Feb 4, 2021
@jessebenson jessebenson deleted the jesseb/rocm-reduction branch February 4, 2021 23:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants