Skip to content

v0.1.9a4

Pre-release
Pre-release

Choose a tag to compare

@Luosuu Luosuu released this 05 May 18:21
· 78 commits to main since this release
5be7db9

What's Changed

  • [model] fix: qwen3_moe router double-softmax and idempotent _init_weights wrap by @TimYangst in #715
  • [ops, model] feat: dispatch Qwen3.5 linear-attention kernels via OpSlot by @TimYangst in #714
  • [ops, model] feat: forward temperature and return entropy from log-probs path by @Luosuu in #720
  • [ci, model] test: add bitwise logits-equal tests for transformers v4 models by @TimYangst in #721
  • [release] chore: release v0.1.9a4 by @Luosuu in #723

Full Changelog: v0.1.9a3...v0.1.9a4