Releases · ROCm/MIOpen

MIOpen now supports Navi21!! (via MIOpen PRs 973, 780, 764, 740, 739, 677, 660, 653, 493, 498)
Fixed a correctness issue with ImplicitGemm algorithm
Updated the performance data for new kernel versions
Improved MIOpen build time by splitting large kernel header files
Fixed an issue in reduction kernels for padded tensors
Various other bug fixes and performance improvements

Assets 2

17 May 19:31

JehandadKhan

2.11.0

516fb5b

MIOpen v2.11.0

Notes

This release contains various bug fixes and performance improvements.

Changes

Updates for Target ID features in ROCm stack
Correctness fix in Batchnorm kernels
Various bug fixes for MIOpenGEMM on the OpenCL backend
Various bug fixes in 3x3 assembly kernels

Assets 2

25 Mar 15:55

JehandadKhan

2.10.0

010ef4a

MIOpen v2.10.0

Notes

This release contains new reduction operations, Winograd algorithm performance improvements as well as bug fixes. Various host side performance improvements have been added as well.

Changes

Added a GPU reference kernel implementation for faster testing.
Add TargetID support for new AMD GPU architectures.
Implementation of four additional generic tensor reduction operations (AVG, AMAX, NORM1, NORM2).
Fixed a bug where Batchnorm would give incorrect results when the product of image height and image width is not a factor of four.
Various host side improvements for better find and tuning performance.
Added support for AMD Code Object V4.

Assets 2

02 Dec 02:15

daniellowell

2.9.0

a4af24a

MIOpen v2.9.0

Notes:

This release contains implicit GEMM algorithm performance updates and bug fixes. Additional performance improvements have been implement for batch normalization.

Changes:

Added new assembly implicit GEMM kernels
Added batch normalization optimizations
Fixed issue where miopen-hip backend install would not search for rocBLAS dependency
Removed missing tunings from previous release cycle
Removed deprecated implicit GEMM xDLOPs solvers
Removed incorrect error messages from implicit GEMM solvers
Disabled ConvAsmBwdWrW3x3 solver for stride > 1 cases
Disabled bidirectional multi-pass kernels due to stability issues

Assets 2

29 Oct 00:57

daniellowell

2.8.0

add232c

MIOpen v2.8.0

Notes:

This release provides additional bug fixes and support for embedded builds using MIOpen as a static library.

Changes:

Fixed workspace size calculation for GEMM group convolutions
Fixed performance regression for M/N
Fixed issue with faulty compiler option
Fixed typo in components dependency variable in CMakeLists.txt
Fixed issues with COMgr backed online compilation for HIP kernels
Added cmake flag for embedding system databases when building a static library
Added a way to disable building MIOpenDriver when building a static library
Added CC compiler detection in ROCm environment
Known issue: This release may show warnings for "obsolete configs" in the performance database. This can be fixed by rerunning tuning on a specific network; see tuning documentation

Assets 2

18 Sep 21:13

daniellowell

2.7.0

3aee82e

MIOpen v2.7.0

Notes:

This release contains a new reduction API; see API documentation for more information. Additional features for embedded builds have been added, and further support for 3D convolutional networks.

Changes:

Added additional tunings into performance database
Added general reduction API
Added cmake flag for embedding binary database into a static MIOpen build
Added cmake flag for embedding system find-db text files into static MIOpen build
Fixed issue with GEMM workspace size calculation for backwards data convolutions #381
Fixed issue with 3D pooling indexing #365

Assets 2

20 Aug 19:40

daniellowell

rocm-3.7.0

c16087a

MIOpen v2.6.0

Notes:

This release contains convolution performance improvements, improved multi-threading behavior, and improved stability for half precision convolutions. Initial iteration time has been reduced with the introduction of hybrid find mode. Builds for a static library have been refined for this release.

Changes:

Added MIOPEN_FIND_MODE=3 as the new default convolution Find mode; see documentation here for details
Added a more runtime-parameterized version of pooling to reduce the number of online compilations
Improved the performance of backwards spatial batch normalization for small images
Fixed issue with std::logic_error in SQLite deleter #306
Fixed issues with half precision stability for convolutions
Fixed issues with multi-threaded SQLite database accesses
Fixed issues with 3-D convolutions and incorrect parameters
Fixed various issues with implicit GEMM static assert failures
Removed inactive implicit GEMM convolution solvers
Removed SCGEMM convolutional algorithm from MIOpen

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Notes

Changes

Notes

Changes

Notes

Changes

Notes

Changes

Notes

Changes

Releases: ROCm/MIOpen

rocm-5.3.0

MIOpen v2.16.0

Notes

Changes

MIOpen v2.14.0

Notes

Changes

MIOpen v2.12.0

Notes

Changes

MIOpen v2.11.0

Notes

Changes

MIOpen v2.10.0

Notes

Changes

MIOpen v2.9.0

MIOpen v2.8.0

MIOpen v2.7.0

MIOpen v2.6.0