Skip to content

v1.2.0

Latest
Compare
Choose a tag to compare
@lukas-mazur lukas-mazur released this 12 Dec 15:20
· 11 commits to main since this release
c0a4a19

This release primarily includes improvements for AMD GPUs

Changelog:

  • Communication streams have been moved to CommunicationBase
  • Added launch bounds for HIP kernel
  • Improved RHMC for AMD GPUs
  • Manual loop unrolling in Dslash for AMD GPUs
  • Updated profiling applications
  • added more profiling applications for development:
    • axpy
    • triad
    • 7linkprof
    • cgprofiling