Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Enable sort_particles_for_deposition on AMD by default #4946

Open
wants to merge 4 commits into
base: development
Choose a base branch
from

Conversation

AlexanderSinn
Copy link
Member

@AlexanderSinn AlexanderSinn commented May 22, 2024

Recently, I tweaked amrex::SortParticlesForDeposition to work better on MI250X (AMReX-Codes/amrex#3925). In my testing with HiPACE++, it now makes the deposition faster compared to using SortParticlesByCell on MI250X, where previously SortParticlesForDeposition was worse for smaller numbers of particles per cell.

  • TODO: benchmark WarpX on AMD with the different sorting methods.

@AlexanderSinn AlexanderSinn changed the title Enable sort_particles_for_deposition on AMD Enable sort_particles_for_deposition on AMD by default May 22, 2024
@ax3l ax3l added Performance optimization backend: hip Specific to ROCm execution (GPUs) labels May 29, 2024
@ax3l ax3l requested review from ax3l, atmyers and n01r May 29, 2024 22:06
@RemiLehe RemiLehe changed the title Enable sort_particles_for_deposition on AMD by default [WIP] Enable sort_particles_for_deposition on AMD by default Jun 3, 2024
@RemiLehe
Copy link
Member

RemiLehe commented Jun 3, 2024

@n01r I think that you volunteered to perform benchmarks with this PR. Any update?

@n01r
Copy link
Member

n01r commented Jun 3, 2024

@RemiLehe, sorry no updates yet but yes I volunteered!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend: hip Specific to ROCm execution (GPUs) Performance optimization
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants