[LIBCLC][CUDA] Apply always_inline to all atomics #5710

jchlanda · 2022-03-02T14:49:50Z

Interestingly enough, the performance penalty comes here not from performing the call, but from clang not being able to optimise away all the cases that atomics define, but don't need at call site.

jchlanda · 2022-03-03T10:52:12Z

Am I right in thinking that the test suite failures are unrelated?

bader · 2022-03-03T10:56:16Z

Am I right in thinking that the test suite failures are unrelated?

I think so. AFAIK, libclc is not used for by OpenCL back-end.

[LIBCLC][CUDA] Apply always_inline to all atomics

fb8c9c2

jchlanda requested a review from bader as a code owner March 2, 2022 14:49

bader approved these changes Mar 2, 2022

View reviewed changes

bader merged commit dda743a into intel:sycl Mar 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LIBCLC][CUDA] Apply always_inline to all atomics #5710

[LIBCLC][CUDA] Apply always_inline to all atomics #5710

jchlanda commented Mar 2, 2022

jchlanda commented Mar 3, 2022

bader commented Mar 3, 2022

[LIBCLC][CUDA] Apply always_inline to all atomics #5710

[LIBCLC][CUDA] Apply always_inline to all atomics #5710

Conversation

jchlanda commented Mar 2, 2022

jchlanda commented Mar 3, 2022

bader commented Mar 3, 2022