Remove volatile from atomics #1672

adayton1 · 2024-06-14T18:47:45Z

Summary

This PR is a refactoring
It does the following:
- Modifies hip and cuda generic atomic compare and swap algorithms to use atomic loads instead of relying on volatile
- Re-implements atomic loads in terms of builtin atomics for cuda and hip (so that the generic compare and swap functions can use it)
- Removes volatile qualifier in atomic function signatures
- Uses cuda::atomic_ref in newer versions of CUDA to back atomicLoad/Store
- Uses atomicAdd as a fallback for atomicSub in CUDA
- Refactors CUDA atomics to reduce duplicate code (now more closely matches the Hip atomic implementations)
- Removes checks where __CUDA_ARCH__ is less than 350 since RAJA requires that as the minimum supported architecture anyway

adayton1 · 2024-06-15T00:46:23Z

@MrBurmark, would you be willing to look at the changes to hip atomics? The previous implementation of atomicCAS relied on volatile, so I changed the implementation to use an atomic load. To avoid a circular dependency, I've changed in the implementation of atomicLoad to use the intrinsic if available, otherwise I fall back to atomicOr(address, 0). Does that make sense, or would it be better to fall back to atomicCAS(address, 0, 0) or something else? I think atomicOr will be better than atomicAdd, but I'm not sure if atomicCAS can avoid the write in some cases. I also modified atomicExchange to use reinterpret casting instead of atomicCAS in a loop, then made atomicStore use atomicExchange if the intrinsic is not available.

If these changes make sense, then I will do basically the same thing in CUDA and then we can fully get rid of volatile.

MrBurmark · 2024-06-17T15:16:04Z

Its best to avoid using atomicCAS(0,0) where possible as its slower than doing something like atomicOr(0) or atomicAdd(0). Between atomicOr(0) or atomicAdd(0) I don't know which is faster, maybe #1624 could provide some insight?

include/RAJA/policy/cuda/atomic.hpp

adayton1 · 2024-06-18T20:20:32Z

I'll be out most of this afternoon and all day tomorrow, so if the tests come back passing, feel free to merge when you think it is ready.

include/RAJA/policy/cuda/atomic.hpp

rhornung67

Thanks @adayton1

include/RAJA/policy/cuda/atomic.hpp

MrBurmark · 2024-06-18T22:43:05Z

This is looking pretty good. You double checked which types were available in which hardware for cuda and hip? It looks like the cuda and hip backend are more similar now but they are still a bit different on which types are supported for which hardware.

adayton1 · 2024-06-20T13:51:40Z

This is looking pretty good. You double checked which types were available in which hardware for cuda and hip? It looks like the cuda and hip backend are more similar now but they are still a bit different on which types are supported for which hardware.

Yeah, I double checked the types. The main differences are that Hip doesn't provide an atomicInc or atomicDec, and CUDA supports an additional type for atomicMin and atomicMax.

adayton1 · 2024-06-20T18:00:06Z

I keep hitting unrelated errors in the CI:

[info: cloning spack develop branch from github]
[exe: git clone --single-branch --depth=1 -b develop-2024-05-26 https://github.com/spack/spack.git spack]
Cloning into 'spack'...
error: RPC failed; curl 18 transfer closed with outstanding read data remaining
error: 3939 bytes of body are still expected
fatal: the remote end hung up unexpectedly
fatal: early EOF
fatal: index-pack failed
[spack python: /bin/sh: /dev/shm/lassen53-1939417/spack/bin/spack: No such file or directory]
[Checking for concretizer options...]
[disabling config scope (except defaults) in: /dev/shm/lassen53-1939417/spack/lib/spack/spack/config.py]
Traceback (most recent call last):
File "./scripts/uberenv/uberenv.py", line 1389, in
sys.exit(main())
File "./scripts/uberenv/uberenv.py", line 1335, in main
env.patch()
File "./scripts/uberenv/uberenv.py", line 892, in patch
self.disable_spack_config_scopes()
File "./scripts/uberenv/uberenv.py", line 862, in disable_spack_config_scopes
cfg_script = open(spack_lib_config).read()
FileNotFoundError: [Errno 2] No such file or directory: '/dev/shm/lassen53-1939417/spack/lib/spack/spack/config.py'

adayton1 added 22 commits June 10, 2024 18:25

Remove volatile from seq and omp atomics

f7d323d

Remove volatile from atomic builtins

2608091

Remove volatile from atomic_auto

07cea77

Merge branch 'develop' into feature/dayton8/no_volatile_atomics

243b8b7

Remove volatile from desul atomics

37a74b2

Remove volatile from pattern

17a1c8f

Resolve merge conflicts

e85c1c4

Fixes after merge

c31a1f4

Implement hip_atomicLoad in terms of builtins

7b5aea1

Simplify

2c205c8

Fix after bad merge

4f4e7bd

Fix static_assert

815f609

Clearer message

ba675ed

Merge in develop

8a9fc20

Attempt to fix build errors

9c47bdb

Fixes and remove volatile

1ac0409

Remove more volatile qualifiers

148aa5a

Clean up

be32127

Fix build error

aae088b

Make captures explicit

b073052

Use consistent variable names

b351cfb

Implement backup atomicStore in terms of atomicExchange

e55d2ae

adayton1 added 6 commits June 17, 2024 09:47

Remove more uses of volatile

f88fe9c

CUDA atomic cleanup

1540532

Remove volatile from cuda atomics

7cc74ab

Reimplement atomicStore in CUDA

50568e5

Clean up cuda atomics

ee460dd

Clean up cuda atomics

a79953a

MrBurmark reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

adayton1 added 8 commits June 18, 2024 10:15

Add define for cuda atomic_ref

2e02e3d

Remove unnecessary __device__ specifier

00bf942

Fix undefined behavior where possible

07d5950

Avoid undefined behavior in hip atomics

8a94c55

Add shortcircuit cas for cuda

b4922fd

Add shortcuiting to hip atomics

b291bdf

Move duplicate definition into util header

8742ae1

Use shared implemenation of enable_if helpers

ea4a245

adayton1 requested a review from MrBurmark June 18, 2024 20:18

rhornung67 reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

rhornung67 reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

rhornung67 reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

rhornung67 reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

rhornung67 reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

adayton1 added 2 commits June 18, 2024 13:28

Address review comments

10d308d

Qualify list

d4d0ca7

adayton1 requested a review from rhornung67 June 18, 2024 20:40

rhornung67 approved these changes Jun 18, 2024

View reviewed changes

MrBurmark reviewed Jun 18, 2024

View reviewed changes

include/RAJA/policy/cuda/atomic.hpp Outdated Show resolved Hide resolved

adayton1 and others added 2 commits June 20, 2024 08:43

Improve short-circuiting

3e2472b

Merge branch 'develop' into feature/dayton8/no_volatile_atomics

22c318c

adayton1 requested a review from MrBurmark June 20, 2024 15:44

MrBurmark approved these changes Jun 20, 2024

View reviewed changes

adayton1 merged commit 70e57de into develop Jun 20, 2024
24 checks passed

adayton1 deleted the feature/dayton8/no_volatile_atomics branch June 20, 2024 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove volatile from atomics #1672

Remove volatile from atomics #1672

adayton1 commented Jun 14, 2024 •

edited

Loading

adayton1 commented Jun 15, 2024 •

edited

Loading

MrBurmark commented Jun 17, 2024

adayton1 commented Jun 18, 2024

rhornung67 left a comment

MrBurmark commented Jun 18, 2024

adayton1 commented Jun 20, 2024

adayton1 commented Jun 20, 2024

Remove volatile from atomics #1672

Remove volatile from atomics #1672

Conversation

adayton1 commented Jun 14, 2024 • edited Loading

Summary

adayton1 commented Jun 15, 2024 • edited Loading

MrBurmark commented Jun 17, 2024

adayton1 commented Jun 18, 2024

rhornung67 left a comment

Choose a reason for hiding this comment

MrBurmark commented Jun 18, 2024

adayton1 commented Jun 20, 2024

adayton1 commented Jun 20, 2024

adayton1 commented Jun 14, 2024 •

edited

Loading

adayton1 commented Jun 15, 2024 •

edited

Loading