New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce KOKKOS_IMPL_ARCH_NVIDIA_GPU
macro
#5948
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks OK to me.
Not quite sure why |
…ackends where appropriate
c9fc6de
to
4dcb294
Compare
Retriggering CI. Didn't change anything, just amended nothing to the last commit. |
Only one of the HIP CI runs is failing with the usual errors. |
I think we really should consider making this thing have the CC as a value, but not blocking for now. |
I implemented your suggestion. Setting the macro in the code rather than with CMake as you wanted. |
84c7ca8
to
f670cae
Compare
Should we initially introduce this as |
I brought this up at the developer meeting last week and it did not seem to be a concern. I do not feel strongly either way. |
Sorry, I missed that. It's fine by me as is. I just wanted to make sure it was considered |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Per dev meeting, make it an IMPL macro
KOKKOS_ARCH_NVIDIA_GPU
macroKOKKOS_IMPL_ARCH_NVIDIA_GPU
macro
Retest this please |
Clang+CUDA build failed with
Looks unrelated but rerunning to be safe. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks OK to me (apart from Bruno's questions).
…aGpuArchitectures.hpp
Co-authored-by: Phil Miller <unmobile+gh@gmail.com>
We already have
KOKKOS_ARCH_INTEL_GPU
. It is useful for backends like SYCL, OpenMPTarget, or OpenACC, that may target GPUs from different vendors.I am not sure whether we want it to be a public or a private macro. We could consider defining it to be the compute capability as an integer but that means more complicate logic in the generated makefiles and in CMake.
Was proposed at last developer meeting when discussing #5867