Skip to content

CUTLASS 4.2.2

Choose a tag to compare

@hwu36 hwu36 released this 21 Jun 16:44

CUTLASS C++

  • Make version.h NVRTC JIT compilation compatible.
  • Allow linking large cutlass library on 64bit platform.
  • Fix alignment-related miscalculation for pipeline stages of Blackwell blockscaled GEMM.
  • Fix for blockwise group gemm nosmem epilogues and no sfd with nosmem group gemm epilogues.