-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement parallel_scan with ThreadVectorRange and Reducer #3861
Conversation
70d6815
to
6b3d55f
Compare
This doesn't include the HIP and/or task backend
6b3d55f
to
ae5c3cc
Compare
@Char-Aznable Can you please check if this works for you? |
@masterleinad Thank you for working on this. This works great as far as the tests go. |
No, I didn't benchmark anything but I wouldn't expect a big difference in performance between the old and the new implementation. |
A rebased version of #3602 also including a SYCL implementation and adding the test.