Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vc alpaka nbody #128

Merged

Conversation

bernhardmgruber
Copy link
Member

@bernhardmgruber bernhardmgruber commented Nov 17, 2020

Redesign the alpaka nbody to use Vc inside the kernel. The goal is to have a single source nbody version that performs optimal on GPU and CPU.

@bernhardmgruber bernhardmgruber changed the title [WIP] Vc alpaka nbody Vc alpaka nbody Nov 17, 2020
@bernhardmgruber bernhardmgruber force-pushed the vc_alpaka_nbody branch 2 times, most recently from 99e0030 to 337c472 Compare November 17, 2020 18:36
* use rsqrt and allow turning off via constant
* support AoSoA mapping as well
* allow choosing the vector length for the Vc kernel
* include inline in LLAMA_FN_HOST_ACC_INLINE for alpaka
* avoid Vc when compiling for CUDA
* keep Elems particles in registers during reduction
@bernhardmgruber bernhardmgruber merged commit 17c8fbe into alpaka-group:develop Apr 16, 2021
@bernhardmgruber bernhardmgruber deleted the vc_alpaka_nbody branch April 16, 2021 23:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants