Skip to content

V0.1.6: Add SCBench

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 17 Jun 09:29
d76b76e

What's Changed

Feature

  • [PreRelease]: SCBench by @iofu728 in #96
  • Feature(MInference): support transformers>=4.46.0, add chunk mlp by @iofu728 in #133, #113
  • Feature(MInference): update release cuda version by @iofu728 in #134
  • Fix(MInference): fix the search pattern feature by @iofu728 in #156

Ops

  • Feature(FlexPrefill): add flex-prefill by @liyucheng09 in #100
  • Feature(MInference): add xAttention by @iofu728 in #149
  • Feature(MInference): support SGLang and vLLM vertical_and_slash flash attention and index kernels by @iofu728 in #153

Model support

Bug Fix

New Contributors

Full Changelog: v0.1.5.post1...v0.1.6