Skip to content

OpenBLAS 0.3.12 version

Compare
Choose a tag to compare
@martin-frbg martin-frbg released this 24 Oct 11:08
· 3028 commits to release-0.3.0 since this release
c5f280a

common:

  • Fixed missing BLAS/LAPACK functions (inadvertently dropped during
    the build system restructuring to support selective compilation)
  • Fixed argument conversion macro in LAPACKE_zgesvdq (LAPACK #458)

POWER:

  • Added optimized SCOPY/CCOPY kernels for POWER10
  • Increased and unified the default size of the GEMM buffer
  • Fixed building for POWER10 in DYNAMIC_ARCH mode
  • POWER10 compatibility test now checks binutils version as well
  • Cleaned up compiler warnings

x86_64:

  • corrected compiler version checks for AVX2 compatibility
  • added compiler option -mavx2 for building with flang
  • fixed direct SGEMM pathway for small matrix sizes (broken by
    the code refactoring in 0.3.11)
  • fixed unhandled partial register clobbers in several kernels
    for AXPY,DOT,GEMV_N and GEMV_T flagged by gcc10 tree-vectorizer

ARMV8:

  • improved Apple Vortex support to include cross-compiling

Download OpenBLAS

md5sums:
03bff4558fc701b7d0e689814055ecb2 OpenBLAS-0.3.12.zip
baf8c58c0ef6ebe0f9eb74a5c4acd662 OpenBLAS-0.3.12.tar.gz
4df4ebb7b5c4f1b5ec8fa58f48be6a51 OpenBLAS-0.3.12-x64.zip