[HIP] Lock access to scratch memory when using Teams #3916

Rombur · 2021-04-01T18:18:12Z

There is potentially a problem if multiple threads launch a parallel_for or a parallel_reduce kernel on the same stream and use Teams. The parallel_for and the parallel_reduce may tried to reallocate the scratch memory and being used somewhere else. The current PR uses a mutex to ensure that only one Team parallel_for or parallel_reduce is running for a given instance. I am open to suggestion, if someone has a better solution.

Note that CUDA has the same problem.

dalg24 · 2021-04-01T19:09:12Z

core/src/HIP/Kokkos_HIP_Parallel_Team.hpp

@@ -433,6 +433,9 @@ class ParallelFor<FunctorType, Kokkos::TeamPolicy<Properties...>,
  int m_shmem_size;
  void* m_scratch_ptr[2];
  int m_scratch_size[2];
+  // Only let one ParallelFor/Reduce modify the team scratch memory. The
+  // constructor acquires the mutex which is released in the destructor.
+  std::unique_lock<std::mutex> m_scratch_lock;


std::lock_guard is non-copyable. This means you implicitly deleted the copy constructor and copy assignment. Was it intentional? Did you make sure it plays well with the kernel launching?

This means you implicitly deleted the copy constructor and copy assignment.

True at the same time if you use the copy constructor currently, you are copying pointers without copying their data... Also I don't see what's your use case for that. Also I am using std::unique_lock instead of std::lock_guard. Unlike std::lock_guard, std::unique_lock is movable.

Did you make sure it plays well with the kernel launching?

I am not sure what you mean by that but all the tests pass on Tulip and on the CI.

I missed it was std::unique_lock and not std::lock_guard. I can't remember how we pass the driver around in the kernel launching. This is not a trick question I was just curious if you considered it.

masterleinad

Looks OK to me! It would be good if we finally have some tests for the cases we try to cover here, though.

Lock access to scratch memory when using Teams

9274703

dalg24 reviewed Apr 1, 2021

View reviewed changes

dalg24 requested review from dhollman and crtrott April 1, 2021 19:09

masterleinad approved these changes Apr 6, 2021

View reviewed changes

crtrott approved these changes Apr 7, 2021

View reviewed changes

dalg24 merged commit 5bd55d2 into kokkos:develop Apr 7, 2021

Rombur deleted the multithreading_2 branch June 8, 2021 12:37

This was referenced Oct 15, 2021

SYCL fix thread-safety #4408

Merged

Replace std::unique_lock by std::lock_guard in HIP and SYCL #4416

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HIP] Lock access to scratch memory when using Teams #3916

[HIP] Lock access to scratch memory when using Teams #3916

Rombur commented Apr 1, 2021

dalg24 Apr 1, 2021

Rombur Apr 1, 2021

dalg24 Apr 1, 2021

masterleinad left a comment

[HIP] Lock access to scratch memory when using Teams #3916

[HIP] Lock access to scratch memory when using Teams #3916

Conversation

Rombur commented Apr 1, 2021

dalg24 Apr 1, 2021

Choose a reason for hiding this comment

Rombur Apr 1, 2021

Choose a reason for hiding this comment

dalg24 Apr 1, 2021

Choose a reason for hiding this comment

masterleinad left a comment

Choose a reason for hiding this comment