parallel_for with TeamPolicy::team_size_recommended with launch bounds not working -- reported by Daniel Holladay #1283
Labels
Bug
Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
Milestone
On the CUDA backend, I need to make use of launch bounds to get rid of compile time errors regarding regcount. After this change, when the
team_policy
is constructed:The
parallel_for
with thisteam_policy
does not execute. It does execute with this policy:The text was updated successfully, but these errors were encountered: