Skip to content

v1.1.0

Choose a tag to compare

@arjkesh arjkesh released this 03 Apr 10:27
c5f4d54

What's Changed

  • Igore vuln in pygments by @xjmxyt in #86
  • Fix random mhc test and benchmark failures & Add unsloth geglu and grouped_gemm kernels by @hannahli-nv in #84
  • bench: replace unfused reference_rms_norm with F.rms_norm as PyTorch baseline by @xjmxyt in #89
  • Add tileiras optional dependency for bundled compiler support by @hannahli-nv in #90
  • Add sparse MLA forward op in experimental by @Weili-0234 in #91
  • Bump CUDA base image from 13.1.0 to 13.2.0 by @hannahli-nv in #85
  • Bump version from 1.0.1 to 1.1.0 by @hannahli-nv in #94
  • Remove cuda-tile-experimental URL dep for PyPI compatibility by @hannahli-nv in #95

Full Changelog: v1.0.1...v1.1.0