[LIBCLC] Add support for more generic atomic operations #7391

jchlanda · 2022-11-15T08:46:41Z

The diffs are quite hard to follow, but in an essence this patch brings:

a new entry, implementing a generic address space for multiple __CLC_NVVM_ATOMIC_XYZ_IMPL, where XYZ stands for CAS, INDEC, LOAD, MAX, MIN, STORE and SUB,
fixes the name of mangled function that the IMPL uses,
the rest is just formatting to 80 chars.

This patch supersedes: #5849 but it requires the fixes to the remangler from: #7220

Fixes: #7658

jchlanda · 2022-11-15T08:47:04Z

Changing it to a draft, till #7220 is merged.

jchlanda · 2022-12-12T12:27:58Z

/verify with intel/llvm-test-suite#1446

jchlanda · 2022-12-12T12:30:35Z

/verify with intel/llvm-test-suite#1446

romanovvlad · 2022-12-12T12:39:03Z

@bader Could you please help with reviewing?

jchlanda · 2022-12-12T14:36:18Z

/verify with intel/llvm-test-suite#1446

libclc/ptx-nvidiacl/libspirv/atomic/atomic_cmpxchg.cl

Co-authored-by: Alexey Bader <alexey.bader@intel.com>

jchlanda · 2022-12-13T08:15:16Z

/verify with intel/llvm-test-suite#1446

jchlanda · 2022-12-13T09:12:53Z

/verify with intel/llvm-test-suite#1446

jchlanda · 2022-12-13T09:14:44Z

Build failure was caused by the discrepancy in the macros, as the use/def were added over 2 commits (d2eb42f and b35194f)

jchlanda · 2022-12-13T13:54:34Z

/verify with intel/llvm-test-suite#1446

jchlanda · 2022-12-14T06:42:25Z

/verify with intel/llvm-test-suite#1446

steffenlarsen

LGTM!

steffenlarsen · 2022-12-14T09:33:43Z

Verification failures unrelated:

OpenCL:

Timed Out Tests (2):
#7740
SYCL :: Basic/code_location_e2e.cpp

#7741
SYCL :: HostInteropTask/host-task-failure.cpp

Failed Tests (2):
#7742
SYCL :: KernelAndProgram/multiple-kernel-linking.cpp

Also mentioned in #7745
SYCL :: XPTI/kernel/basic.cpp

Level Zero:

Timed Out Tests (2):
#7741
SYCL :: HostInteropTask/host-task-failure.cpp

Failed Tests (19):
#7745
SYCL :: Basic/group_async_copy.cpp

#7744
SYCL :: DeviceLib/imf_fp16_trivial_test.cpp
SYCL :: DeviceLib/imf_fp32_test.cpp
SYCL :: DeviceLib/imf_half_type_cast.cpp

#7743
SYCL :: Reduction/reduction_big_data.cpp
SYCL :: Reduction/reduction_nd_N_vars.cpp
SYCL :: Reduction/reduction_nd_conditional.cpp
SYCL :: Reduction/reduction_nd_dw.cpp
SYCL :: Reduction/reduction_nd_ext_half.cpp
SYCL :: Reduction/reduction_nd_lambda.cpp
SYCL :: Reduction/reduction_range_1d_dw.cpp
SYCL :: Reduction/reduction_range_1d_dw_64bit.cpp
SYCL :: Reduction/reduction_range_1d_rw.cpp
SYCL :: Reduction/reduction_range_2d_dw.cpp
SYCL :: Reduction/reduction_range_2d_rw.cpp
SYCL :: Reduction/reduction_range_3d_dw.cpp
SYCL :: Reduction/reduction_range_N_vars.cpp
SYCL :: Reduction/reduction_range_lambda.cpp
SYCL :: Reduction/reduction_usm.cpp

Counterpart of intel/llvm#7391 This patch supersedes #929

steffenlarsen · 2022-12-14T11:42:07Z

@jchlanda - It looks like some of the newly enabled test-suite tests are failing for CUDA still, specifically due to AtomicStore. See for example https://github.com/intel/llvm/actions/runs/3693963954/jobs/6254968423.

jchlanda · 2022-12-14T11:44:02Z

Interesting, it does look like a generic pointer in those stores, will have a look.

jchlanda · 2022-12-14T14:46:03Z

@jchlanda - It looks like some of the newly enabled test-suite tests are failing for CUDA still, specifically due to AtomicStore. See for example https://github.com/intel/llvm/actions/runs/3693963954/jobs/6254968423.

@steffenlarsen I think I know what's going on here. Seems like there are two problems:

hard coded mangled name in libclc is wrong, I've looked in disassembled file and can see instances of _Z19_spirv_AtomicStoreP ... Scope::MemorySemanticsMask ... that :: is most likely a type substitution gone wrong in the name.
looks like I've got my check-libclc wrong and it only runs remangler against clc-nvptx64--nvidiacl.bc, whereas we should be probably running it against everything that is generated, i.e.:

builtins.link.clc-nvptx64--nvidiacl.bc     
builtins.link.libspirv-nvptx64--.bc        
builtins.link.libspirv-nvptx64--nvidiacl.bc
builtins.opt.clc-nvptx64--nvidiacl.bc      
builtins.opt.libspirv-nvptx64--.bc
builtins.opt.libspirv-nvptx64--nvidiacl.bc 
clc-nvptx64--nvidiacl.bc                   
libspirv-nvptx64--.bc                      
libspirv-nvptx64--nvidiacl.bc

which is why it wasn't picked up earlier, as clc-nvptx64--nvidiacl.bc does not contain all the functions.

I'm not sure what is the best course of action here, either reverting the patch, or XFAILING those 4 tests, as I'm unlikely to get the fix today.

steffenlarsen · 2022-12-14T14:50:33Z

Thank you, @jchlanda ! Since they were just enabled, I think adding XFAIL back to those tests is the best course of action. Your patch seems to still work for a good chunk of them so we may as well keep this functionality, unless you can think of a reason why it may be disruptive?

Counterpart of intel#7391 This patch supersedes intel/llvm-test-suite#929

jchlanda requested a review from a team as a code owner November 15, 2022 08:46

jchlanda requested a review from romanovvlad November 15, 2022 08:46

jchlanda marked this pull request as draft November 15, 2022 08:46

[LIBCLC] Add support for more generic atomic operations

e1eeb3d

jchlanda mentioned this pull request Dec 12, 2022

[SYCL][CUDA] Enable generic atomic tests intel/llvm-test-suite#1446

Merged

jchlanda marked this pull request as ready for review December 12, 2022 12:27

jchlanda force-pushed the jakub/libclc_generic_atomics_2 branch from b50e560 to e1eeb3d Compare December 12, 2022 12:30

bader approved these changes Dec 12, 2022

View reviewed changes

libclc/ptx-nvidiacl/libspirv/atomic/atomic_cmpxchg.cl Outdated Show resolved Hide resolved

libclc/ptx-nvidiacl/libspirv/atomic/atomic_cmpxchg.cl Outdated Show resolved Hide resolved

jchlanda and others added 2 commits December 13, 2022 09:13

Update libclc/ptx-nvidiacl/libspirv/atomic/atomic_cmpxchg.cl

d2eb42f

Co-authored-by: Alexey Bader <alexey.bader@intel.com>

Update libclc/ptx-nvidiacl/libspirv/atomic/atomic_cmpxchg.cl

b35194f

Co-authored-by: Alexey Bader <alexey.bader@intel.com>

jchlanda mentioned this pull request Dec 13, 2022

Ptxas: unresolved extern function __spirv_AtomicLoad(long long const*...) #7658

Closed

steffenlarsen approved these changes Dec 14, 2022

View reviewed changes

steffenlarsen merged commit e99ead8 into intel:sycl Dec 14, 2022

steffenlarsen pushed a commit to intel/llvm-test-suite that referenced this pull request Dec 14, 2022

[SYCL][CUDA] Enable generic atomic tests (#1446)

a60d6b7

Counterpart of intel/llvm#7391 This patch supersedes #929

jchlanda mentioned this pull request Dec 14, 2022

[SYCL][CUDA][libclc] Add support for generic AS in atomics #5849

Closed

This was referenced Dec 14, 2022

[LIBCLC] Fix atomic stores for NVPTX #7780

Merged

[LIBCLC] Correctly track remangler test dependencies #7829

Merged

This was referenced Jan 12, 2023

[CUDA] Add support for the generic address space #5215

Closed

[SYCL][CUDA] sycl::atomic_ref usage on CUDA backend produces linking error #5647

Closed

aelovikov-intel pushed a commit to aelovikov-intel/llvm that referenced this pull request Mar 27, 2023

[SYCL][CUDA] Enable generic atomic tests (intel/llvm-test-suite#1446)

0231da8

Counterpart of intel#7391 This patch supersedes intel/llvm-test-suite#929

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LIBCLC] Add support for more generic atomic operations #7391

[LIBCLC] Add support for more generic atomic operations #7391

jchlanda commented Nov 15, 2022 •

edited

Loading

jchlanda commented Nov 15, 2022

jchlanda commented Dec 12, 2022

jchlanda commented Dec 12, 2022

romanovvlad commented Dec 12, 2022

jchlanda commented Dec 12, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 14, 2022

steffenlarsen left a comment

steffenlarsen commented Dec 14, 2022

steffenlarsen commented Dec 14, 2022

jchlanda commented Dec 14, 2022

jchlanda commented Dec 14, 2022

steffenlarsen commented Dec 14, 2022

[LIBCLC] Add support for more generic atomic operations #7391

[LIBCLC] Add support for more generic atomic operations #7391

Conversation

jchlanda commented Nov 15, 2022 • edited Loading

jchlanda commented Nov 15, 2022

jchlanda commented Dec 12, 2022

jchlanda commented Dec 12, 2022

romanovvlad commented Dec 12, 2022

jchlanda commented Dec 12, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 13, 2022

jchlanda commented Dec 14, 2022

steffenlarsen left a comment

Choose a reason for hiding this comment

steffenlarsen commented Dec 14, 2022

steffenlarsen commented Dec 14, 2022

jchlanda commented Dec 14, 2022

jchlanda commented Dec 14, 2022

steffenlarsen commented Dec 14, 2022

jchlanda commented Nov 15, 2022 •

edited

Loading