New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#5641: Fix HIP & CUDA MDRange reduce for sizeof(value_type) < sizeof(int) #5745
Conversation
55fdd75
to
04594ce
Compare
I'm going to temporarily push a test change to confirm that it fails the HIP build, and then force-push it back. |
HIP builds with just the assertion but no added test triggering it passed: |
The added assertion does trigger with a test modified to call an offending However, it also reminds me that the guardrail may be necessary for all possible memory space template arguments of |
9e40d39
to
45e33d9
Compare
Since I was typo-fixing anyway, I ran ispell over all comments and strings in that file. Everything else is fine. |
Why only for LAnd? I bet all other reductions would fail too for bool? (Sum, BAnd, LOr, etc. ) |
I've only guarded against what's been reported to fail. If you can point me at a workable set of modules to test things on kokkos-dev-2 or wherever, I can try it out. |
It looks like That said, I've reproduced the (a?) failure for |
I just tested with That suggests the bug is in some sort of sizing or alignment. |
45e33d9
to
bc4bd8a
Compare
Scope has broadened substantially
632ba08
to
e6543da
Compare
I adapted the changes in #5333 from I'm committing the sloppy mess now because it's late and I want to get it out. I'll make tests for Team reductions and move them to a sensible place and maybe fix them tomorrow. |
e6543da
to
00f951a
Compare
Test passes on Serial but fails on HIP before the fix. Test passes with the fix. Will clang-format, then this is ready. |
LOL, OpenMPTarget fails this test too! |
368a328
to
ef07610
Compare
Per @rgayatri23 I'm going to skip the test of OpenMPTarget for now. |
c636371
to
05116a0
Compare
9d71460
to
9e994a0
Compare
Rebased on |
Fixes #5641