Releases: ROCm/hipTensor
Releases · ROCm/hipTensor
hipTensor 1.3.0 for ROCm 6.2.2
hipTensor code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.
hipTensor 1.3.0 for ROCm 6.2.1
hipTensor code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.
hipTensor 1.3.0 for ROCm 6.2.0
Additions
- Added support for tensor permutation of ranks of 2, 3, 4, 5 and 6
- Added tests for tensor permutation of ranks of 2, 3, 4, 5 and 6
- Added support for tensor contraction of M6N6K6: M, N, K up to rank 6
- Added tests for tensor contraction of M6N6K6: M, N, K up to rank 6
- Added new test YAML parsing to support sequential parameters ordering
Changes
- Documentation updates for installation, programmer's guide and API reference
- Prefer amd-llvm-devel package before system LLVM library
- Preferred compilers changed to CC=amdclang CXX=amdclang++
- Updated actor-critic selection for new contraction kernel additions
Fixes
- Fixed LLVM parsing crash
- Fixed memory consumption issue in complex kernels
- Work-around implemented for compiler crash during debug build
- Allow random modes ordering for tensor contractions
rocm-6.1.2
ROCm release v6.1.2
rocm-6.1.1
ROCm release v6.1.1
hipTensor 1.2.0 for ROCm 6.1.0
Additions
- API support for permutation of rank 4 tensors: f16 and f32
- New datatype support in contractions of rank 4: f16, bf16, complex f32, complex f64
- Added scale and bilinear contraction samples and tests for new supported data types
- Added permutation samples and tests for f16, f32 types
Fixes
- Fixed bug in contraction calculation with data type f32
rocm-6.0.2
ROCm release v6.0.2
rocm-6.0.0
ROCm release v6.0.0
rocm-5.7.1
ROCm release v5.7.1
hipTensor 1.0.0 for ROCm 5.7.0
- Initial prototype enablement of hipTensor library supporting tensor operations
- Kernel selection support for Default and Actor-Critic algorithms
- API support for definition and contraction of rank 4 tensors
- API support for contextual logging and output redirection
- API support for kernel selection caching
- Datatype support for f32 and f64
- Architecture support for gfx908 and gfx90a