Skip to content

Releases: LatticeQCD/SIMULATeQCD

v1.2.0

12 Dec 15:20
c0a4a19
Compare
Choose a tag to compare

This release primarily includes improvements for AMD GPUs

Changelog:

  • Communication streams have been moved to CommunicationBase
  • Added launch bounds for HIP kernel
  • Improved RHMC for AMD GPUs
  • Manual loop unrolling in Dslash for AMD GPUs
  • Updated profiling applications
  • added more profiling applications for development:
    • axpy
    • triad
    • 7linkprof
    • cgprofiling

v1.1.0

23 Aug 21:21
a63e631
Compare
Choose a tag to compare

Changelog:

  • Simplified Haloloop
  • Added P2P support on AMD GPUs
  • Added Marker API for Profiling for NVIDIA and AMD GPUs
  • blocksize fix in runfunctor
  • CMakeList.txt update
  • Multi-RHS dslash improvements
  • Fixed clang warnings
  • Various bug fixes

v1.0.1

01 Jun 13:56
207164b
Compare
Choose a tag to compare

Improved performance for AMD GPUs.

v1.0.0

25 May 12:42
2a3652b
Compare
Choose a tag to compare

This is SIMULATeQCD version 1.0.0. The first release version.