Lock constant memory in Cuda/HIP kernel launch with a mutex #4525

masterleinad · 2021-11-11T21:25:40Z

The constant memory used for kernel launches is shared between all Cuda/HIP execution space instances. We already guard the access to make sure that the last kernel using the constant memory has finished before copying the next one. This doesn't prevent (different) kernels submitted from independent threads to be copied to the constant memory in a conflict way. Note that we already have a mutex for the case that the same kernel is submitted from different threads.

This pull request adds another mutex that guards access to the constant memory globally.

Lock constant memory in Cuda/HIP kernel launch with a mutex

3d6b036

masterleinad force-pushed the fix_constant_memory_launch_cuda_hip branch from 369193b to 3d6b036 Compare November 11, 2021 22:30

masterleinad requested review from crtrott and Rombur November 11, 2021 22:56

masterleinad marked this pull request as ready for review November 11, 2021 22:57

crtrott approved these changes Nov 18, 2021

View reviewed changes

crtrott merged commit 1151436 into kokkos:develop Nov 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lock constant memory in Cuda/HIP kernel launch with a mutex #4525

Lock constant memory in Cuda/HIP kernel launch with a mutex #4525

masterleinad commented Nov 11, 2021

Lock constant memory in Cuda/HIP kernel launch with a mutex #4525

Lock constant memory in Cuda/HIP kernel launch with a mutex #4525

Conversation

masterleinad commented Nov 11, 2021