Skip to content

rocBLAS 2.43.0 for ROCm 5.1.0

Compare
Choose a tag to compare
@lawruble13 lawruble13 released this 30 Mar 17:30
490c414

Added

  • Option to install script for number of jobs to use for rocBLAS and Tensile compilation (-j, --jobs)
  • Option to install script to build clients without using any Fortran (--clients_no_fortran)
  • rocblas_client_initialize function, to perform rocBLAS initialize for clients(benchmark/test) and report the execution time.
  • Added tests for output of reduction functions when given bad input
  • Added user specified initialization (rand_int/trig_float/hpl) for initializing matrices and vectors in rocblas-bench

Optimizations

  • Improved performance of trsm with side == left and n == 1
  • Improved perforamnce of trsm with side == left and m <= 32 along with side == right and n <= 32

Changed

  • For syrkx and trmm internal API use rocblas_stride datatype for offset
  • For non-batched and batched gemm_ex functions if the C matrix pointer equals the D matrix pointer (aliased) their respective type and leading dimension arguments must now match
  • Test client dependencies updated to GTest 1.11
  • non-global false positives reported by cppcheck from file based suppression to inline suppression. File based suppression will only be used for global false positives.
  • Help menu messages in install.sh
  • For ger function, typecast the 'lda'(offset) datatype to size_t during offset calculation to avoid overflow and remove duplicate template functions.
  • Modified default initialization from rand_int to hpl for initializing matrices and vectors in rocblas-bench

Fixed

  • For function trmv (non-transposed cases) avoid overflow in offset calculation
  • Fixed cppcheck errors/warnings
  • Fixed doxygen warnings