Initial HIP backend support #3

rcarson3 · 2020-10-01T22:06:25Z

Draft proposal to add in HIP support. This PR is currently just a way to easily keep track of what's changed when adding in the new HIP backend

rcarson3 · 2022-01-04T20:14:42Z

Just so I don't forget before merging address #10 in here as well.

rcarson3 · 2022-04-07T22:56:51Z

src/SNLS_cuda_portability.h

+#if (defined(__CUDA_ARCH__) && (__CUDA_ARCH__ > 0)) || defined(__HIP_DEVICE_COMPILE__)
 #define __cuda_device_only__      
 #else
 #define __cuda_host_only__      


So, we should probably move to something more generic here like __gpu_device_only__ and __gpu_host_only__ and maybe add the snls name some where in there as well just to avoid name clashing with other people's macros.

Along a similar set of lines, it would also be good to update the device forall portion of the code.

Lots of different parts of the library referenced to either CUDA or HIP for things. Since, we're seeing more and more different GPU vendors come online it just made more sense to generalize things to be called gpu where possible. As part of this work, I renamed a number of the macros so that they would use SNLS in the name to avoid name clashing with other codes.

…ns of raja

rcarson3 · 2023-12-06T20:32:49Z

Note: the GPU single point test still has some CUDA specific stuff in it. I'm leaving it in there for now given it's being completely ripped out in #14

cmake/thirdpartylibraries/FindRAJA.cmake

src/SNLS_TrDLDenseG_Batch.h

src/SNLS_gpu_portability.h

… used

rcarson3 · 2024-01-03T00:30:11Z

@gberg617 made a few small updates:

1 -> include Alan's bug fixes related to the device class so that wouldn't be sitting in the other branch/PR for too long
2-> simplify some of the CUDA/HIP kernel logic in places so we only have 1 forall call and not 2 different ones for the 2 execution types
3 -> Some minor fixes related to some of the tests that I noticed when testing this on vernal with the RAJA Portability Suite enabled which showed some compilation cases I hadn't run across before.

Initial HIP support

867ed75

rcarson3 added the WIP label Oct 1, 2020

Update blt for HIP related fixes

a5f67c4

Merge branch 'develop' into feature/carson16/hip

3e40a6b

rcarson3 mentioned this pull request Feb 15, 2022

HIP implementation LLNL/ExaConstit#43

Merged

rcarson3 commented Apr 7, 2022

View reviewed changes

rcarson3 marked this pull request as ready for review June 1, 2022 21:07

rcarson3 added 3 commits July 6, 2022 11:58

update blt to v0.5.1

b6ef0c2

changes related to newer blt v0.5.1

0b2e9ad

Merge remote-tracking branch 'ghssh/develop' into feature/carson16/hip

63a31fb

rcarson3 mentioned this pull request Feb 9, 2023

Missing getNJEvals function #13

Closed

rcarson3 added 4 commits August 14, 2023 13:11

update blt to v0.5.3

99ed322

Fixes github issue#13

5434714

Update readme for hybrid solver and fixes github issue #10

1eb0ad6

rcarson3 removed the WIP label Aug 17, 2023

rcarson3 added 2 commits August 17, 2023 12:10

update version number

3b54adf

Address some bugs brought in some other work

7be5dc7

rcarson3 mentioned this pull request Nov 1, 2023

Add support to use lambda functions in solvers #14

Open

Merge branch 'develop' into feature/carson16/hip

b7e8e88

rcarson3 requested a review from gberg617 December 4, 2023 19:06

Fix a few issues with the batch solver due to changes in newer versio…

c077079

…ns of raja

gberg617 reviewed Dec 15, 2023

View reviewed changes

cmake/thirdpartylibraries/FindRAJA.cmake Outdated Show resolved Hide resolved

src/SNLS_TrDLDenseG_Batch.h Show resolved Hide resolved

src/SNLS_gpu_portability.h Show resolved Hide resolved

gberg617 approved these changes Dec 15, 2023

View reviewed changes

adayton1 and others added 4 commits January 2, 2024 12:51

Fix singleton and potential ODR violations

fa4d497

Add missing constructor

535efe3

Fix issue with cherry-pick commit being outdated

e588a5b

Simplify some logic related to the explicitly defined CUDA/HIP kernels

0a48316

rcarson3 added 2 commits January 2, 2024 16:01

Fix some issues with tests and HIP builds noted when RAJA Port. Suite…

dd67cb5

… used

update to requiring c++14 due to changes in gtest requirements

31c85f2

rcarson3 merged commit 1064940 into develop Jan 3, 2024

rcarson3 mentioned this pull request Jan 4, 2024

Add HIP backened to run on AMD hardware #2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial HIP backend support #3

Initial HIP backend support #3

rcarson3 commented Oct 1, 2020

rcarson3 commented Jan 4, 2022

rcarson3 Apr 7, 2022

rcarson3 Apr 7, 2022

rcarson3 commented Dec 6, 2023

rcarson3 commented Jan 3, 2024

Initial HIP backend support #3

Initial HIP backend support #3

Conversation

rcarson3 commented Oct 1, 2020

rcarson3 commented Jan 4, 2022

rcarson3 Apr 7, 2022

Choose a reason for hiding this comment

rcarson3 Apr 7, 2022

Choose a reason for hiding this comment

rcarson3 commented Dec 6, 2023

rcarson3 commented Jan 3, 2024