Releases · ROCm/Tensile

20 May 13:15

rocm-ci

rocm-6.4.1

be49885

Tensile 4.43.0 for ROCm 6.4.1 Latest

Latest

Tensile code for ROCm 6.4.1 did not change. The library was rebuilt for the updated ROCm 6.4.1 stack.

Assets 2

11 Apr 13:34

rocm-ci

rocm-6.4.0

be49885

Tensile 4.43.0 for ROCm 6.4.0

Added

Nightly builds with performance statistics
Cache asm capabilities for reuse
venv for Tensile create on Linux
Flag to keep build_tmp when running Tensile
Generalized profiling scripts
GFX1151 support
Single-threaded support in TensileCreateLibrary
Logic to remove temporary build artifacts

Changed

Updated Tensile documents (API reference, README.md, and comments)
Disabled asm-cache for tests
Used hipcc.bat as a compiler on Windows instead of the Perl script
Improved clarity of CHANGELOG.md
Enabled external CI
Improved Tensile documentation
Refactored kernel source and header creation
Refactored writeKernels in TensileCreateLibrary
Suppressed developer warnings (simplifying the Tensile output)
Used an explicit cast when invoking min is called
Used cache abbreviations to compute kernel names

Removed

OCL backend
Unsupported tests
Deep copy in TensileCreateLibrary

Optimized

Linearized asm register search to reduce build time

Resolved issues

Fixed Stream-K dynamic grid model
Fixed logic related to caching asm capabilities
Fixed accvgpr overflow
Fixed test failures in SLES containers when running TensileTests
Fixed a regression that prevents TensileCreateLibrary from completing when fallback logic is not available

Assets 2

06 Nov 19:55

rocm-ci

rocm-6.2.4

81ae953

Tensile 4.41.0 for ROCm 6.2.4

Tensile code for ROCm 6.2.4 did not change. The library was rebuilt for the updated ROCm 6.2.4 stack.

Assets 2

19 Feb 17:47

rocm-ci

rocm-6.3.3

aca95d1

Tensile 4.42.0 for ROCm 6.3.3

Tensile code for ROCm 6.3.3 did not change. The library was rebuilt for the updated ROCm 6.3.3 stack.

Assets 2

28 Jan 15:43

rocm-ci

rocm-6.3.2

aca95d1

Tensile 4.42.0 for ROCm 6.3.2

Tensile code for ROCm 6.3.2 did not change. The library was rebuilt for the updated ROCm 6.3.2 stack.

Assets 2

20 Dec 16:12

rocm-ci

rocm-6.3.1

aca95d1

Tensile 4.42.0 for ROCm 6.3.1

Tensile code for ROCm 6.3.1 did not change. The library was rebuilt for the updated ROCm 6.3.1 stack.

Assets 2

03 Dec 19:49

rocm-ci

rocm-6.3.0

aca95d1

Tensile 4.42.0 for ROCm 6.3.0

Additions

add contributor and developer guide
add testing and documentation for MasterSolutionLibrary.ArchitectureIndexMap and remapSolutionIndicesStartingFrom
add gfx12 support
add functions for writing master file
add tPrint and reconciles printing options
add Python unit test coverage report
add factor embed library logic into function and test
add clang++ as cxx-compiler option for windows
add logic to cope with different compilers
add generateManifest fxn and rename generateManifest to toFile and move to Utilities
add profiling CI job
add support for amdclang and use defaults
add architecture management functions to TensileCreateLibrary
add TensileCreateLibrary cli reference docs
add new documentation (sphinx prototype, build out skeleton)

Optimizations

add prediction model for optimal number of Stream-K tiles to run
use analytical grid size prediction model for Stream-K
remap XCC-based workgroup for Stream-K kernels
add two-tile algorithm with Stream-K after DP
add atomic 2-tile Stream-K and clean-up tuning parameters

Changes

improve rocBLAS build output by allowing warning suppression, ignoring only developer warnings, progress bar and quiet printing
reorder extensions for Windows in which function
remove deprecated flag from CI profiling job
update amdclang++ and asm directories
update duplicate marking tests with mocks
remove diagnostic print, and restore print ordering, and add missing print option
bump rocm-docs-core from 1.2.0 to 1.5.0 in /docs/sphinx
refactor kernel duplicate matching
refactor generateLogicDataAndSolutions
remove globals from prepAsm
restrict XCC mapping to gfx942
refactor argument parsing in TensileCreateLibrary
disable failing rhel9 tests
change line length for formatting to 100 characters
change YAML operations to use C libyaml backend
improve warning wording
remove deprecated package-library option
update clang support for Windows
update supportedCompiler fxn
use conditional choices and defaults
remove duplicate which function and minor cleanup
refactor sanity check in TensileCreateLibrary
factor client config logic from TensileCreateLibrary main into createClientConfig
use glob to find logic files in TensileCreateLibrary
use function to confirm supported compiler rather than raw logic
update verifyManifest in TensileCreateLibrary
update RTD configs
cleanup the CMake to prevent redundant work in client builds
update Stream-K debug settings

Fixes

fix Stream-K XCC configs for gfx942
update WMMA capability command for ISA 10+
fix progress bar character encoding error on Windows
fix solution redundancy removal
fix tuning imports for pyyaml
fix printing ASM capabilities for ROCm < 6.3
fix code objects by filtering kernels with build errors and unprocessed kernels
fix fully qualify std::get in contraction solutions
fix add -v flag and change system invocation
use conditional imports for new dependencies to fix yaml CSafe load and dump import, and to fix rich terminal print import
fix comments on scalarStaticDivideAndRemainder

Assets 2

12 Mar 18:30

rocm-ci

rocm-6.1.5

bf05992

Tensile 4.40.0 for ROCm 6.1.5

Tensile code for ROCm 6.1.5 did not change. The library was rebuilt for the updated ROCm 6.1.5 stack.

Assets 2

04 Jun 16:52

rocm-ci

rocm-6.1.2

bf05992

Tensile 4.40.0 for ROCm 6.1.2

Tensile code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.

Assets 2

08 May 17:59

rocm-ci

rocm-6.1.1

bf05992

Tensile 4.40.0 for ROCm 6.1.1

Tensile code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Added

Changed

Removed

Optimized

Resolved issues

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Additions

Optimizations

Changes

Fixes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ROCm/Tensile

Tensile 4.43.0 for ROCm 6.4.1

Uh oh!

Tensile 4.43.0 for ROCm 6.4.0

Added

Changed

Removed

Optimized

Resolved issues

Uh oh!

Tensile 4.41.0 for ROCm 6.2.4

Uh oh!

Tensile 4.42.0 for ROCm 6.3.3

Uh oh!

Tensile 4.42.0 for ROCm 6.3.2

Uh oh!

Tensile 4.42.0 for ROCm 6.3.1

Uh oh!

Tensile 4.42.0 for ROCm 6.3.0

Additions

Optimizations

Changes

Fixes

Uh oh!

Tensile 4.40.0 for ROCm 6.1.5

Uh oh!

Tensile 4.40.0 for ROCm 6.1.2

Uh oh!

Tensile 4.40.0 for ROCm 6.1.1

Uh oh!