Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fuse some Kokkos kernels to reduce launch latency for small systems #3758

Merged
merged 14 commits into from May 3, 2023

Conversation

stanmoore1
Copy link
Contributor

@stanmoore1 stanmoore1 commented Apr 28, 2023

Summary

Fuse some Kokkos kernels to reduce launch latency for small systems. Gives ~20% speedup for 1k atom LJ benchmark on a single V100 GPU.

Related Issue(s)

None

Author(s)

Stan Moore (SNL)

Licensing

By submitting this pull request, I agree, that my contribution will be included in LAMMPS and redistributed under either the GNU General Public License version 2 (GPL v2) or the GNU Lesser General Public License version 2.1 (LGPL v2.1).

Backward Compatibility

Yes

@stanmoore1 stanmoore1 requested a review from athomps May 1, 2023 19:18
@stanmoore1
Copy link
Contributor Author

Running this through Kokkos regression testing again.

@stanmoore1
Copy link
Contributor Author

Kokkos regression tests all pass now.

@stanmoore1 stanmoore1 assigned akohlmey and unassigned stanmoore1 May 1, 2023
@akohlmey akohlmey added this to the Stable Release Summer 2023 milestone May 2, 2023
@akohlmey akohlmey merged commit db38d99 into lammps:develop May 3, 2023
4 checks passed
@stanmoore1 stanmoore1 deleted the kk_fuse branch May 3, 2023 15:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

Successfully merging this pull request may close these issues.

None yet

2 participants