Update SYCL UniqueToken avoiding bitset #4748

masterleinad · 2022-02-02T21:55:38Z

This corresponds to #4741. I see similar speed-ups.

ajpowelsnl · 2022-02-07T18:38:07Z

Hi @masterleinad -- A couple of questions: 1) will this PR be in the 3.6 release? 2) if it will be in 3.6, is it still a blocker?

masterleinad · 2022-02-07T21:42:28Z

@ajpowelsnl Is there a reason to remove the BlocksPromotion label here if #4741 still has it?

masterleinad · 2022-02-07T21:43:10Z

Hi @masterleinad -- A couple of questions: 1) will this PR be in the 3.6 release? 2) if it will be in 3.6, is it still a blocker?

I think we should treat it the same as #4741.

ajpowelsnl · 2022-02-07T22:02:49Z

Hi @masterleinad -- A couple of questions: 1) will this PR be in the 3.6 release? 2) if it will be in 3.6, is it still a blocker?

I think we should treat it the same as #4741.

Yes, I just spoke with Christian about it, and the unique token stuff will go in Kokkos 3.6 (including the work in this PR). Thanks for responding!

masterleinad · 2022-02-11T14:24:45Z

Retest this please.

masterleinad · 2022-02-11T15:57:36Z

Retest this please.

masterleinad · 2022-02-11T19:02:25Z

OpenMPTarget failing with an internal compiler error is clearly unrelated,

dalg24

Will approve once we clarify the question about the UniqueToken<SYCL, UniqueTokenScope::Instance> constructors.

core/src/SYCL/Kokkos_SYCL_Instance.cpp

dalg24 · 2022-02-12T03:28:25Z

core/src/SYCL/Kokkos_SYCL_UniqueToken.hpp

+    int idx = (blockIdx[0] * (blockDim[0] * blockDim[1]) +
+               threadIdx[1] * blockDim[0] + threadIdx[0]) %
+              size();


I would prefer if you split the expression in two and had idx = idx % size(); like we do in CUDA

dalg24 · 2022-02-12T03:32:48Z

core/src/SYCL/Kokkos_SYCL_UniqueToken.hpp

+  UniqueToken(size_type max_size,
+              execution_space const& arg = execution_space())
+      : UniqueToken<SYCL, UniqueTokenScope::Global>(max_size, arg) {}


Should this one be declared explicit too? ( I know we didn't in CUDA)
Also why is it OK to only have two constructors when we had 4 in CUDA?

Cuda has

explicit UniqueToken() : UniqueToken<Cuda, UniqueTokenScope::Global>( Kokkos::Cuda().concurrency()) {} explicit UniqueToken(execution_space const& arg) : UniqueToken<Cuda, UniqueTokenScope::Global>( Kokkos::Cuda().concurrency(), arg) {} UniqueToken(size_type max_size) : UniqueToken<Cuda, UniqueTokenScope::Global>(max_size) {} UniqueToken(size_type max_size, execution_space const& arg = execution_space()) : UniqueToken<Cuda, UniqueTokenScope::Global>(max_size, arg) {}

In the current form, the first two Cuda constructors are the same as the first one here and the last two Cuda constructors are the same as the second one here. In fact, the third constructor in Cuda (and HIP) should just be removed since trying to use is would be ambiguous taking the last constructor into account.

Of course, we could argue about not using default arguments to be able to only mark the constructors taking one argument as explicit. In the end, that only matters for copy initialization with an initializer list. I agree, though, that we should make all specializations for UniqueToken have the same interface.

masterleinad · 2022-02-12T21:50:59Z

Again, only OpenMPTarget is failing with an internal compiler error.

core/src/SYCL/Kokkos_SYCL_UniqueToken.hpp

…tructor

masterleinad · 2022-02-14T18:25:22Z

Retest this please.

masterleinad added the Blocks Promotion Overview issue for release-blocking bugs label Feb 2, 2022

ajpowelsnl assigned masterleinad Feb 7, 2022

ajpowelsnl added this to In progress in Kokkos Release 3.7 -- 2022 Target Date via automation Feb 7, 2022

ajpowelsnl removed the Blocks Promotion Overview issue for release-blocking bugs label Feb 7, 2022

masterleinad added the Blocks Promotion Overview issue for release-blocking bugs label Feb 9, 2022

masterleinad removed this from In progress in Kokkos Release 3.7 -- 2022 Target Date Feb 9, 2022

masterleinad added this to In progress in Kokkos Release 3.6 via automation Feb 9, 2022

masterleinad moved this from In progress to Awaiting Feedback in Kokkos Release 3.6 Feb 9, 2022

masterleinad force-pushed the sycl-unique-token-improvement branch from 067857a to cab7f79 Compare February 10, 2022 16:28

dalg24 reviewed Feb 12, 2022

View reviewed changes

dalg24 requested a review from crtrott February 12, 2022 03:33

masterleinad added 7 commits February 12, 2022 10:37

Update SYCL UniqueToken avoiding bitset

a7b4dc0

Use given execution space for View initialization

0bc7c7f

Remove unused m_scratchConcurrentBitset

73b0c43

Update the SYCL UniqueToken implementation to use a static lock array

3983126

Avoid unused parameter warning

c4622d7

Address Damien's comments

f1724be

Unify imterfaces between Cuda, HIP and SYCL

38a2deb

masterleinad force-pushed the sycl-unique-token-improvement branch from bd82bee to 38a2deb Compare February 12, 2022 15:44

crtrott requested changes Feb 14, 2022

View reviewed changes

core/src/SYCL/Kokkos_SYCL_UniqueToken.hpp Outdated Show resolved Hide resolved

Don't provide execution space in SYCL UniqueTokenScope::INstance cons…

1ebc774

…tructor

masterleinad requested a review from crtrott February 14, 2022 16:06

crtrott approved these changes Feb 14, 2022

View reviewed changes

masterleinad requested a review from dalg24 February 14, 2022 17:33

dalg24 approved these changes Feb 14, 2022

View reviewed changes

dalg24 merged commit dd709bd into kokkos:develop Feb 15, 2022

Kokkos Release 3.6 automation moved this from Awaiting Feedback to Done Feb 15, 2022

dalg24 removed the Blocks Promotion Overview issue for release-blocking bugs label Feb 15, 2022

masterleinad deleted the sycl-unique-token-improvement branch February 15, 2022 15:15

ajpowelsnl added the InDevelop Enhancement, fix, etc. has been merged into the develop branch; label Feb 15, 2022

masterleinad mentioned this pull request Aug 1, 2022

Avoid allocating memory for UniqueToken #5300

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SYCL UniqueToken avoiding bitset #4748

Update SYCL UniqueToken avoiding bitset #4748

masterleinad commented Feb 2, 2022 •

edited

ajpowelsnl commented Feb 7, 2022

masterleinad commented Feb 7, 2022

masterleinad commented Feb 7, 2022

ajpowelsnl commented Feb 7, 2022

masterleinad commented Feb 11, 2022

masterleinad commented Feb 11, 2022

masterleinad commented Feb 11, 2022

dalg24 left a comment

dalg24 Feb 12, 2022

dalg24 Feb 12, 2022

masterleinad Feb 12, 2022

masterleinad commented Feb 12, 2022

masterleinad commented Feb 14, 2022

Update SYCL UniqueToken avoiding bitset #4748

Update SYCL UniqueToken avoiding bitset #4748

Conversation

masterleinad commented Feb 2, 2022 • edited

ajpowelsnl commented Feb 7, 2022

masterleinad commented Feb 7, 2022

masterleinad commented Feb 7, 2022

ajpowelsnl commented Feb 7, 2022

masterleinad commented Feb 11, 2022

masterleinad commented Feb 11, 2022

masterleinad commented Feb 11, 2022

dalg24 left a comment

Choose a reason for hiding this comment

dalg24 Feb 12, 2022

Choose a reason for hiding this comment

dalg24 Feb 12, 2022

Choose a reason for hiding this comment

masterleinad Feb 12, 2022

Choose a reason for hiding this comment

masterleinad commented Feb 12, 2022

masterleinad commented Feb 14, 2022

masterleinad commented Feb 2, 2022 •

edited