-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Triple nested parallelism still fails on bowman #1093
Comments
This is definitely our big white whale ... |
Suspect fixed by #1240 |
This failed again on KNL .... |
Our arch-enemy has returned... |
Looking into this once more. Main issue is with pthreads. I couldn't reproduce it with OpenMP yet. This code also fails (very occasionally, like 1 in a 1000 or so): Kokkos::parallel_reduce(
team_policy(nrows / chunk_size, team_size, vector_length),
KOKKOS_LAMBDA(const member_type &teamMember, double &update) {
const int row_start = teamMember.league_rank() * chunk_size;
const int row_end = row_start + chunk_size;
Kokkos::parallel_for(
Kokkos::TeamThreadRange(teamMember, row_start, row_end),
[&](const int i) {
ScalarType sum_i = 0.0;
#if 1
for(unsigned int j=0; j<ncols; j++) sum_i+=A(i,j)*x(j);
#else
Kokkos::parallel_reduce(
Kokkos::ThreadVectorRange(teamMember, ncols),
[&](const int j, ScalarType &innerUpdate) {
innerUpdate += A(i, j) * x(j);
},
sum_i);
#endif
//Kokkos::atomic_add(&control(),sum_i);
Kokkos::atomic_add(&count(),1l);
Kokkos::single(Kokkos::PerThread(teamMember),
[&]() { update += y(i) * sum_i; });
});
},
result);
Kokkos::fence(); When this fails |
Recent failure with Pthreads on Bowman, intel/compilers/17.2.174 Failure output:
|
https://jenkins-son.sandia.gov/job/Kokkos_SLURM_inner_test/6885/console
The text was updated successfully, but these errors were encountered: