Skip to content

rocWMMA 0.7 for ROCm 5.2.0

Compare
Choose a tag to compare
@ROCmMathLibrariesBot ROCmMathLibrariesBot released this 28 Jun 18:46
1c4614a

Added

  • Added unit tests for DLRM kernels
  • Added GEMM sample
  • Added DLRM sample
  • Added SGEMV sample
  • Added unit tests for cooperative wmma load and stores
  • Added unit tests for IOBarrier.h
  • Added wmma load/ store tests for different matrix types (A, B and Accumulator)
  • Added more block sizes 1, 2, 4, 8 to test MmaSyncMultiTest
  • Added block sizes 4, 8 to test MmaSynMultiLdsTest
  • Added support for wmma load / store layouts with block dimension greater than 64
  • Added IOShape structure to define the attributes of mapping and layouts for all wmma matrix types
  • Added CI testing for rocWMMA

Changed

  • Renamed wmma to rocwmma in cmake, header files and documentation
  • Renamed library files
  • Modified Layout.h to use different matrix offset calculations (base offset, incremental offset and cumulative offset)
  • Opaque load/store continue to use incrementatl offsets as they fill the entire block
  • Cooperative load/store use cumulative offsets as they fill only small portions for the entire block
  • Increased Max split counts to 64 for cooperative load/store
  • Moved all the wmma definitions, API headers to rocwmma namespace
  • Modified wmma fill unit tests to validate all matrix types (A, B, Accumulator)