Skip to content

CUTLASS 4.5.2

Latest

Choose a tag to compare

@hwu36 hwu36 released this 16 Jun 03:04

CuTe DSL

  • New features

    • Python 3.14t is now supported with GIL enabled
  • Bug fixing and improvements

CUTLASS C++

  • Fix missing convert fucntion in EVT for fp4 kernels.
  • Avoid instantiate 2sm tma kernels where ctaN is none power of 64 when ctaN > 128 in profiler.