Skip to content

Conversation

@miaoneng
Copy link
Contributor

Use ROCm dialect of __shfl_* intrinsic functions, which isn't automatically converted during hipify.

The unit test test_matmul.py passes most of the time, occasionally it fails as error exceeds default atol, which can happen on both CPU and GPU. Benchmark passes without issue.

@codecov-commenter
Copy link

codecov-commenter commented Nov 28, 2022

Codecov Report

Merging #296 (2803daa) into master (1bf1276) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #296   +/-   ##
=======================================
  Coverage   71.29%   71.29%           
=======================================
  Files          25       25           
  Lines        1066     1066           
=======================================
  Hits          760      760           
  Misses        306      306           

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@rusty1s rusty1s added the enhancement New feature or request label Nov 28, 2022
@rusty1s rusty1s changed the title Use macro for __shfl_* functions for ROCm Use macro for __shfl_* functions for ROCm Nov 28, 2022
@rusty1s rusty1s merged commit c86d777 into rusty1s:master Nov 28, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants