Skip to content

v1.1.0

Compare
Choose a tag to compare
@lukas-mazur lukas-mazur released this 23 Aug 21:21
· 90 commits to main since this release
a63e631

Changelog:

  • Simplified Haloloop
  • Added P2P support on AMD GPUs
  • Added Marker API for Profiling for NVIDIA and AMD GPUs
  • blocksize fix in runfunctor
  • CMakeList.txt update
  • Multi-RHS dslash improvements
  • Fixed clang warnings
  • Various bug fixes