Skip to content

rocFFT 1.0.22 for ROCm 5.5.0

Compare
Choose a tag to compare
@rocm-ci rocm-ci released this 01 May 21:04
e7d6273

Optimizations

  • Improved performance of 1D lengths < 2048 that use Bluestein's algorithm.
  • Reduced time for generating code during plan creation.
  • Optimized 3D R2C/C2R lengths 32, 84, 128.
  • Optimized batched small 1D R2C/C2R cases.

Added

  • Added gfx1101 to default AMDGPU_TARGETS.

Changed

  • Moved client programs to C++17.
  • Moved planar kernels and infrequently used Stockham kernels to be runtime-compiled.
  • Moved transpose, real-complex, Bluestein, and Stockham kernels to library kernel cache.

Fixed

  • Removed zero-length twiddle table allocations, which fixes errors from hipMallocManaged.
  • Fixed incorrect freeing of HIP stream handles during twiddle computation when multiple devices are present.