-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reductions into device side view #1788
Comments
Both the |
This feels like it should be marked "defect" and not "enhancement"; I'm not sure I understand why this doesn't/isn't supposed to work now. Also, the For instance, something like auto stream_reducer = Kokkos::Sum<int64_t,Kokkos::DefaultStreamExecutionSpace>();
Kokkos::parallel_reduce(N, KOKKOS_LAMBDA (const int& i, int64_t& lsum) {
lsum += 1;
},stream_reducer);
// wraps cudaStreamAddCallback:
stream_reducer.when_ready([&](int64_t& result) { reducer_result = result; });
// wraps cudaStreamSynchronize:
Kokkos::fence(stream); |
No the fence is not meant for memory. The fence is meant for timing purposes to figure out whether the call was asynchronous. |
I marked it enhancement because we
|
Pull request #1825 |
This doesn't work right now, but we should make this work. Would also allow asynchronous reductions.
Test case:
The text was updated successfully, but these errors were encountered: