Skip to content

AOCL-BLAS 4.1

Compare
Choose a tag to compare
@sireeshasanga sireeshasanga released this 07 Aug 15:39
· 251 commits to master since this release

AOCL-BLAS 4.1 Release Highlights

  • Additional APIs and Post-Ops support in addition to the improved performance for the existing APIs in aocl_gemm add-on
  • Dynamic dispatch and amdzen configuration support added to aocl_gemm add-on
  • Dynamic dispatch feature enhancements.
  • AVX 512-based optimizations for AMD “Zen4” platform:
    - SGEMM, DGEMM, and ZGEMM
    - DTRSM, D/ZAXPY, ZGEMV, DDOTV, and D/ZSCALV
  • Improved support for OpenMP nested parallelism.