This release contains several new kernels, plus substantial changes to many CUDA kernel variants to improve performance.
Please download the RAJAPerf-v0.5.0.tar.gz file below. The others will not work due to the way RAJAPerf uses git submodules.
Major changes include:
- Several new kernels in the polybench group.
- Update to RAJA v0.8.0 release.
- Exercise newer RAJA features in kernels, such as loop tiling, thread local memory, and GPU shared memory in CUDA variants.
- Build scripts have been updated to use newer compilers available on Livermore Computing platforms.