Closed
Description
ref: https://llvm.org/docs/LangRef.html#atomicrmw-instruction
does it make sense to support half atomic?
cuda has half/half2 atomic API and has corresponding ptx
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#atomicadd