Releases: LatticeQCD/SIMULATeQCD
Releases · LatticeQCD/SIMULATeQCD
v1.2.0
This release primarily includes improvements for AMD GPUs
Changelog:
- Communication streams have been moved to CommunicationBase
- Added launch bounds for HIP kernel
- Improved RHMC for AMD GPUs
- Manual loop unrolling in Dslash for AMD GPUs
- Updated profiling applications
- added more profiling applications for development:
- axpy
- triad
- 7linkprof
- cgprofiling
v1.1.0
Changelog:
- Simplified Haloloop
- Added P2P support on AMD GPUs
- Added Marker API for Profiling for NVIDIA and AMD GPUs
- blocksize fix in runfunctor
- CMakeList.txt update
- Multi-RHS dslash improvements
- Fixed clang warnings
- Various bug fixes
v1.0.1
Improved performance for AMD GPUs.
v1.0.0
This is SIMULATeQCD version 1.0.0. The first release version.