v1.1.0
What's Changed
- Igore vuln in pygments by @xjmxyt in #86
- Fix random mhc test and benchmark failures & Add unsloth geglu and grouped_gemm kernels by @hannahli-nv in #84
- bench: replace unfused reference_rms_norm with F.rms_norm as PyTorch baseline by @xjmxyt in #89
- Add tileiras optional dependency for bundled compiler support by @hannahli-nv in #90
- Add sparse MLA forward op in experimental by @Weili-0234 in #91
- Bump CUDA base image from 13.1.0 to 13.2.0 by @hannahli-nv in #85
- Bump version from 1.0.1 to 1.1.0 by @hannahli-nv in #94
- Remove cuda-tile-experimental URL dep for PyPI compatibility by @hannahli-nv in #95
Full Changelog: v1.0.1...v1.1.0