Skip to content

rocFFT 1.0.23 for ROCm 5.6.0

Compare
Choose a tag to compare
@rocm-ci rocm-ci released this 28 Jun 23:17
946a75d

Added

  • Implemented half-precision transforms, which can be requested by passing rocfft_precision_half to rocfft_plan_create.
  • Implemented a hierarchical solution map which saves how to decompose a problem and the kernels to be used.
  • Implemented a first version of offline-tuner to support tuning kernels for C2C/Z2Z problems.

Changed

  • Replaced std::complex with hipComplex data types for data generator.
  • FFT plan dimensions are now sorted to be row-major internally where possible, which produces better plans if the dimensions were accidentally specified in a different order (column-major, for example).
  • Added --precision argument to benchmark/test clients. --double is still accepted but is deprecated as a method to request a double-precision transform.

Fixed

  • Fixed over-allocation of LDS in some real-complex kernels, which was resulting in kernel launch failure.