Improve SYCL reduction performance: RangePolicy #6264

masterleinad · 2023-07-06T14:12:33Z

Part of #6035.
This limits the number of workgroups, effectively processing multiple work items per thread, and increases the maximum workgroup size for Intel GPUs. Technically, sycl::info::kernel_device_specific::work_group_size should be the maximum usable value for the workgroup size but it turns out I could still choose sycl::info::device::max_work_group_size and got better performance.

Also, fix matching use_shuffle_based_algorithm(is the reference not a pointer) with ReducerType::static_value_size() (0 for array reductions).

masterleinad · 2023-07-06T22:55:13Z

Only HIP-ROCm-5.2-C++20 is timing out. Everything else is passing.

core/src/SYCL/Kokkos_SYCL_Parallel_Reduce.hpp

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

masterleinad · 2023-07-17T17:48:34Z

SYCL CI is passing.

dalg24 · 2023-07-17T18:13:30Z

Unrelated failure to launch one CUDA build

Improve SYCL reduction performance: RangePolicy

66133d9

masterleinad marked this pull request as ready for review July 6, 2023 18:35

masterleinad mentioned this pull request Jul 12, 2023

Split Kokkos_SYCL_Parallel* #6267

Merged

masterleinad requested a review from nliber July 12, 2023 19:05

nliber approved these changes Jul 12, 2023

View reviewed changes

dalg24 reviewed Jul 13, 2023

View reviewed changes

core/src/SYCL/Kokkos_SYCL_Parallel_Reduce.hpp Show resolved Hide resolved

core/src/SYCL/Kokkos_SYCL_Parallel_Reduce.hpp Outdated Show resolved Hide resolved

core/src/SYCL/Kokkos_SYCL_Parallel_Reduce.hpp Outdated Show resolved Hide resolved

Improve comments on workgroup size deduction

c9573a6

masterleinad force-pushed the improve_reduction_performance_sycl_1 branch from 9ac49e9 to c9573a6 Compare July 13, 2023 19:31

dalg24 approved these changes Jul 17, 2023

View reviewed changes

core/src/SYCL/Kokkos_SYCL_Parallel_Reduce.hpp Outdated Show resolved Hide resolved

Fix typo

ab2e4f5

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

dalg24 merged commit 933d23b into kokkos:develop Jul 17, 2023
27 of 28 checks passed

masterleinad mentioned this pull request Jul 27, 2023

Improve SYCL reduction performance #6035

Closed

crtrott mentioned this pull request Aug 25, 2023

CHANGELOG: 4.2.0 #6197

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve SYCL reduction performance: RangePolicy #6264

Improve SYCL reduction performance: RangePolicy #6264

masterleinad commented Jul 6, 2023 •

edited

masterleinad commented Jul 6, 2023 •

edited

masterleinad commented Jul 17, 2023

dalg24 commented Jul 17, 2023

Improve SYCL reduction performance: RangePolicy #6264

Improve SYCL reduction performance: RangePolicy #6264

Conversation

masterleinad commented Jul 6, 2023 • edited

masterleinad commented Jul 6, 2023 • edited

masterleinad commented Jul 17, 2023

dalg24 commented Jul 17, 2023

masterleinad commented Jul 6, 2023 •

edited

masterleinad commented Jul 6, 2023 •

edited