[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA #4853

t4c1 · 2021-10-29T12:37:05Z

Updates returns for atomics memory order and scope capabilities queries to make them in line with changes in #4820.

This includes adding the previously not existing option to query for atomic scope capabilities.

…abilities for CUDA devices

steffenlarsen

Looks good, though I have some small comments while this is blocked anyway.

Could you please add a case for the new descriptors in the other PI plugins, like with PI_DEVICE_INFO_ATOMIC_MEMORY_ORDER_CAPABILITIES? I don't expect full implementations, but for consistency it helps keep track of which ones aren't implemented in the corresponding plugins.

Also, would you mind changing this to a draft PR and adding "Draft: " or "[WIP]" (I think the former is more visible) to the title? This is just to prevent it from being prematurely merged by mistake.

sycl/include/CL/sycl/detail/pi.h

steffenlarsen · 2021-11-05T12:30:49Z

After reviewing #4820 I do not think having it merged is enough to unblock this PR. The reason is that, even though it introduces atomic operations with additional memory scopes (acq_rel, acquire, and release), these are still not supported by atomic load/store in LLVM's NVPTX implementation.

For more context see llvm/lib/Target/NVPTX/NVPTXISelDAGToDAG.cpp#L859 (and similar for store). This previously caused libclc to fail to build kernels that would even remotely consider using atomic load/store with anything stricter than "unordered" memory order (see 4876443.)

t4c1 · 2021-11-09T12:37:13Z

Could you please add a case for the new descriptors in the other PI plugins, like with PI_DEVICE_INFO_ATOMIC_MEMORY_ORDER_CAPABILITIES? I don't expect full implementations, but for consistency it helps keep track of which ones aren't implemented in the corresponding plugins.

That should be done now.

bader · 2021-11-17T04:50:26Z

@t4c1, please, update ABI tests.

t4c1 · 2021-11-17T12:43:01Z

The ABI tests should be fixed now.

bader

Approving to start testing.

Dismiss approve to avoid unintentional merge as this PR is labeled as a draft.

steffenlarsen

LGTM. Thanks for adding this!

bader · 2021-11-23T16:53:14Z

@t4c1, please, resolve merge conflicts.

# Conflicts: # sycl/include/CL/sycl/info/info_desc.hpp

bader · 2021-11-30T13:28:02Z

Blocked by: #4820

#4820 is merged. Is this PR unblocked now?

t4c1 · 2021-11-30T13:30:41Z

Given #4853 (comment), I would say no. In the meantime I also figured there are some atomics without direct ptx equivalents missing and will add them soon.

t4c1 · 2022-01-18T10:26:32Z

All the PRs blocking this have now been merged.

t4c1 · 2022-01-26T07:37:48Z

Does this need to wait for something else or can it be merged?

bader · 2022-01-26T07:59:24Z

@againull, @s-kanaev or @smaslov-intel are expected to approve Level Zero plug-in changes. Folks, could you take a look, please?

againull

Plugin changes look good to me.

bader · 2022-01-28T08:43:34Z

@t4c1, this change broke two tests in llvm-test-suite.

SYCL :: AtomicRef/atomic_memory_order_acq_rel.cpp
SYCL :: AtomicRef/atomic_memory_order_acq_rel_atomic64.cpp

Error message:

PI CUDA ERROR:
	Value:           719
	Name:            CUDA_ERROR_LAUNCH_FAILED
	Description:     unspecified launch failure
	Function:        cuda_piEnqueueMemBufferRead
	Source Location: llvm.src/sycl/plugins/cuda/pi_cuda.cpp:2430

atomic_memory_order_acq_rel_atomic64.cpp.tmp.out: llvm-test-suite/SYCL/AtomicRef/atomic_memory_order_acq_rel.h:39: void acq_rel_test(sycl::queue, size_t) [AtomicRef = sycl::ext::oneapi::atomic_ref, address_space = sycl::access::address_space::global_space, T = double]: Assertion `a == T(N)' failed.

error: command failed with exit status: -6

Could you take a look please?

t4c1 · 2022-01-28T09:11:34Z

This change just let these tests run for CUDA. What is broken is all the atomic_memory_order* tests. They are using the pattern:

            auto ld = aar.load();
            ld += 1;
            aar.store(ld);

and checking if the whole sequence of operations is atomic. Which it is not - only each of the operations (load/store) on its own is atomic. I am not sure what these tests are supposed to check ... maybe we can just remove them?

t4c1 · 2022-01-28T09:16:26Z

Also, as far as I know, no backend supported acquire release (or sequentially consistent) order before this PR was merged, so these tests were never actually run.

bader · 2022-01-28T10:01:26Z

Could you create a patch to llvm-test-suite with removing illegal checks and add @steffenlarsen to discuss this change, please?

Removes `atomic_memory_order*` tests, which are broken. They are using the pattern: ``` auto ld = aar.load(); ld += 1; aar.store(ld); ``` and checking if the whole sequence of operations is atomic. Which it is not - only each of the operations (load/store) on its own is atomic. Before intel/llvm#4853 was merged no backend supported acquire release or sequentially consistent memory orders, so these tests were never run before. This issue was first discussed here: intel/llvm#4853 (comment)

Removes `atomic_memory_order*` tests, which are broken. They are using the pattern: ``` auto ld = aar.load(); ld += 1; aar.store(ld); ``` and checking if the whole sequence of operations is atomic. Which it is not - only each of the operations (load/store) on its own is atomic. Before #4853 was merged no backend supported acquire release or sequentially consistent memory orders, so these tests were never run before. This issue was first discussed here: #4853 (comment)

…e#783) Removes `atomic_memory_order*` tests, which are broken. They are using the pattern: ``` auto ld = aar.load(); ld += 1; aar.store(ld); ``` and checking if the whole sequence of operations is atomic. Which it is not - only each of the operations (load/store) on its own is atomic. Before intel#4853 was merged no backend supported acquire release or sequentially consistent memory orders, so these tests were never run before. This issue was first discussed here: intel#4853 (comment)

[SYCL][PI][CUDA] Added querries for atomic memory order and scope cap…

4536f2e

…abilities for CUDA devices

t4c1 requested review from againull, smaslov-intel and a team as code owners October 29, 2021 12:37

steffenlarsen reviewed Nov 5, 2021

View reviewed changes

sycl/include/CL/sycl/detail/pi.h Outdated Show resolved Hide resolved

t4c1 changed the title ~~[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA~~ Draft: [SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA Nov 8, 2021

dm-vodopyanov added the cuda CUDA back-end label Nov 8, 2021

dm-vodopyanov added this to In review in oneAPI DPC++ via automation Nov 8, 2021

added stups to other plugins

a6b860d

format

5e76253

t4c1 added 3 commits November 17, 2021 08:35

[SYCL][PI][CUDA] fixed linux ABI test

4325008

[SYCL][PI][CUDA] actually fixed the linux ABI test

7ddef49

[SYCL][PI][CUDA] fixed windows ABI test

b9cfe79

bader previously approved these changes Nov 17, 2021

View reviewed changes

t4c1 mentioned this pull request Nov 18, 2021

[SYCL] Added tests for atomics with various memory orders and scopes intel/llvm-test-suite#534

Merged

bader requested review from steffenlarsen and removed request for againull November 20, 2021 23:36

steffenlarsen previously approved these changes Nov 22, 2021

View reviewed changes

Merge branch 'sycl' into atomic_querries

e076073

# Conflicts: # sycl/include/CL/sycl/info/info_desc.hpp

t4c1 dismissed steffenlarsen’s stale review via e076073 November 24, 2021 08:10

bader marked this pull request as draft November 30, 2021 13:34

t4c1 added 2 commits December 23, 2021 11:15

Merge branch 'sycl' into atomic_querries

708abd4

Merge branch 'sycl' into atomic_querries

b9993b0

t4c1 marked this pull request as ready for review January 18, 2022 10:25

t4c1 requested a review from a team as a code owner January 18, 2022 10:25

t4c1 changed the title ~~Draft: [SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA~~ [SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA Jan 18, 2022

bader requested review from a team January 18, 2022 10:28

steffenlarsen previously approved these changes Jan 18, 2022

View reviewed changes

Merge branch 'sycl' into atomic_querries

99fa647

t4c1 dismissed steffenlarsen’s stale review via 99fa647 January 21, 2022 08:55

bader approved these changes Jan 26, 2022

View reviewed changes

againull approved these changes Jan 26, 2022

View reviewed changes

bader merged commit 43a4192 into intel:sycl Jan 26, 2022

oneAPI DPC++ automation moved this from In review to Closed Jan 26, 2022

t4c1 mentioned this pull request Jan 28, 2022

[SYCL] Remove broken atomic_memory_order* tests intel/llvm-test-suite#783

Merged

t4c1 deleted the atomic_querries branch March 15, 2022 08:51

ldrumm mentioned this pull request Jan 9, 2023

Implement AtomicFAddEXT for the CUDA BE #2853

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA #4853

[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA #4853

t4c1 commented Oct 29, 2021 •

edited by bader

steffenlarsen left a comment

steffenlarsen commented Nov 5, 2021

t4c1 commented Nov 9, 2021

bader commented Nov 17, 2021

t4c1 commented Nov 17, 2021

bader left a comment

steffenlarsen left a comment

bader commented Nov 23, 2021

bader commented Nov 30, 2021

t4c1 commented Nov 30, 2021

t4c1 commented Jan 18, 2022

t4c1 commented Jan 26, 2022

bader commented Jan 26, 2022

againull left a comment

bader commented Jan 28, 2022

t4c1 commented Jan 28, 2022

t4c1 commented Jan 28, 2022

bader commented Jan 28, 2022

[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA #4853

[SYCL][PI][CUDA] Update queries for atomic order and scope for CUDA #4853

Conversation

t4c1 commented Oct 29, 2021 • edited by bader

steffenlarsen left a comment

Choose a reason for hiding this comment

steffenlarsen commented Nov 5, 2021

t4c1 commented Nov 9, 2021

bader commented Nov 17, 2021

t4c1 commented Nov 17, 2021

bader left a comment

Choose a reason for hiding this comment

steffenlarsen left a comment

Choose a reason for hiding this comment

bader commented Nov 23, 2021

bader commented Nov 30, 2021

t4c1 commented Nov 30, 2021

t4c1 commented Jan 18, 2022

t4c1 commented Jan 26, 2022

bader commented Jan 26, 2022

againull left a comment

Choose a reason for hiding this comment

bader commented Jan 28, 2022

t4c1 commented Jan 28, 2022

t4c1 commented Jan 28, 2022

bader commented Jan 28, 2022

t4c1 commented Oct 29, 2021 •

edited by bader