Initial implementation of gfx942 #6358

skyreflectedinmirrors · 2023-08-14T17:30:29Z

No description provided.

Change-Id: Id31ca3ba5356d021cade2abc3e3f51f9f3b4d211

Change-Id: I1454bb0b91518bfcf7a04506e40b98387cdf8ed9

Change-Id: Id9c03fe451d1d28a3c23a77f161a2600f016c7e4

cmake/kokkos_arch.cmake

Co-authored-by: Daniel Arndt <arndtd@ornl.gov>

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

dalg24 · 2023-08-15T06:25:04Z

Can you say a word about the thread fences that are being added?

skyreflectedinmirrors · 2023-08-15T13:24:20Z

Can you say a word about the thread fences that are being added?

Technically, this was a violation of the memory model, because there was no guarantee that the write for the intermediate reduction values became visible before the read by the last block to do the second stage. This never bit us because it was quite unlikely on the hardware we're running on, but... shall we say that may not always hold.

I've tested correctness and performance of several LAMMPS benchmarks, a regular dot product, and the yAx tutorial example on MI-250, and saw essentially no impact from unconditionally including it.

cmake/kokkos_arch.cmake

Change-Id: Ibd028fddeedf8e0fdda50b72625ab62cee6fa71e

dalg24 · 2023-08-16T18:19:13Z

CUDA failure unrelated

skyreflectedinmirrors and others added 3 commits August 14, 2023 13:29

Initial implementation of gfx942

2d262b4

Change-Id: Id31ca3ba5356d021cade2abc3e3f51f9f3b4d211

remove VEGA arch

fd4134e

Change-Id: I1454bb0b91518bfcf7a04506e40b98387cdf8ed9

apply formatting

7d40888

Change-Id: Id9c03fe451d1d28a3c23a77f161a2600f016c7e4

dalg24 reviewed Aug 14, 2023

View reviewed changes

cmake/kokkos_arch.cmake Outdated Show resolved Hide resolved

masterleinad reviewed Aug 14, 2023

View reviewed changes

cmake/kokkos_arch.cmake Outdated Show resolved Hide resolved

Nick Curtis and others added 2 commits August 14, 2023 15:45

Fix conditional

c1c45c7

Co-authored-by: Daniel Arndt <arndtd@ornl.gov>

More cmake fixes

91a1fa7

Co-authored-by: Damien L-G <dalg24+github@gmail.com>

Rombur reviewed Aug 15, 2023

View reviewed changes

cmake/kokkos_arch.cmake Outdated Show resolved Hide resolved

remove unneeded for old naming schema

18c88f7

Change-Id: Ibd028fddeedf8e0fdda50b72625ab62cee6fa71e

Rombur approved these changes Aug 15, 2023

View reviewed changes

masterleinad approved these changes Aug 15, 2023

View reviewed changes

dalg24 merged commit 04d5c55 into kokkos:develop Aug 16, 2023
25 of 28 checks passed

Rombur mentioned this pull request Aug 11, 2023

CHANGELOG: 4.2.0 #6197

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial implementation of gfx942 #6358

Initial implementation of gfx942 #6358

skyreflectedinmirrors commented Aug 14, 2023

dalg24 commented Aug 15, 2023

skyreflectedinmirrors commented Aug 15, 2023

dalg24 commented Aug 16, 2023

Initial implementation of gfx942 #6358

Initial implementation of gfx942 #6358

Conversation

skyreflectedinmirrors commented Aug 14, 2023

dalg24 commented Aug 15, 2023

skyreflectedinmirrors commented Aug 15, 2023

dalg24 commented Aug 16, 2023