Skip to content

v0.15.3

Choose a tag to compare

@Alex-Wengg Alex-Wengg released this 13 Jun 03:58
· 20 commits to main since this release
3c6e79f

What's Changed

  • feat(tts): M5 benchmark re-baseline + Kokoro ane-tail-gpu (M5 fix) + Supertonic int4 default + PocketTTS v2.1 ANE docs in #666
  • fix(kokoro): make the M5-safe routing the default (Kokoro works on M5 out of the box) in #671
  • chore(asr): remove experimental Parakeet CTC zh-CN Mandarin model in #675
  • Remove experimental Magpie multilingual TTS backend in #674
  • chore(asr): remove experimental Qwen3 ASR backend in #676
  • fix(download): retry transient per-file failures in downloadRepo by @JulianPscheid in #681
  • feat(tts/pocket): ANE placements — rank-4 split-KV models (.ane) + MLState pipeline (.aneState) by @Alex-Wengg in #679
  • fix(kokoro): route the Noise stage to GPU in the M5-safe preset (+~10% synth) in #677
  • feat(asr/eou): opt-in fused decoder+joint_decision path (+7-9% RTFx, WER neutral-or-better) in #680
  • feat(asr): expose per-token timings from Nemotron streaming ASR (English + multilingual) by @JulianPscheid in #673
  • Add TypeWhisper to the FluidAudio showcase by @SeoFood in #685
  • fix(asr): make SlidingWindowAsrConfig.default fit the model's 240k-sample input in #689
  • fix(asr): splice long-form chunk merges on SentencePiece word boundaries in #688
  • fix(tts): Misaki-lexicon-first English frontend for KokoroAne in #692
  • feat(asr): Parakeet Unified 0.6B backend (chunked-attention streaming + offline batch) in #693

New Contributors

Full Changelog: v0.15.2...v0.15.3