Port 12 benchmarks to Kokkos#68
Merged
Merged
Conversation
Port three SYCL benchmarks to Kokkos 3.7: - kalman-kokkos: Kalman filter on batched time series, one thread per series - libor-kokkos: LIBOR Monte Carlo with nogreek/greek kernels, stride loops - particle-diffusion-kokkos: Water molecule diffusion Monte Carlo All benchmarks build and run correctly on Kokkos OpenMP backend. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: kento <1034379+kento@users.noreply.github.com>
- particle-diffusion-kokkos: replace bitwise & with logical && in bounds check - libor-kokkos: add comment explaining intentional L_b -> L aliasing Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: kento <1034379+kento@users.noreply.github.com>
- mallocFree-kokkos: times Kokkos::View allocation/deallocation for sizes 64B–16MB (device and host memory spaces) - pitch-kokkos: compares pitched (64-byte aligned rows) vs simple 2D/3D sigmoid kernels using MDRangePolicy<Rank<2|3>> - matrixT-kokkos: 8 matrix-transpose variants using MDRangePolicy (OpenMP backend does not support team_size=256; MDRangePolicy faithfully replicates each variant's access pattern and passes PASS) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: kento <1034379+kento@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: kento <1034379+kento@users.noreply.github.com>
Successfully ported: epistasis, fpc, kalman, knn, libor, mallocFree,
matrixT, nms, particle-diffusion, pitch, scan, sheath
Skipped: langford (complex DFS with template recursion),
matern (complex 2D scratch memory with non-standard constants),
sad (requires external bitmap image files)
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: kento <1034379+kento@users.noreply.github.com>
Copilot created this pull request from a session on behalf of
kento
April 12, 2026 14:47
View session
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
HeCBench had 361 benchmarks lacking Kokkos implementations. This PR adds initial Kokkos ports for 12 of them.
New
*-kokkosdirectoriesEach adds
main.cpp+Makefileusing system Kokkos 3.7 (OpenMP backend):epistasisfpckalmanknnlibormallocFreematrixTnmsparticle-diffusionpitchscansheathPorting conventions
parallel_for(nd_range)→Kokkos::parallel_for/parallel_reduceKokkos::View, transfers viaKokkos::deep_copyKokkos::atomic_fetch_addKokkos::initialize/Kokkos::finalize-lkokkoscore -lkokkoscontainers -fopenmp -lpthread -ldl