v0.15.3
What's Changed
- feat(tts): M5 benchmark re-baseline + Kokoro ane-tail-gpu (M5 fix) + Supertonic int4 default + PocketTTS v2.1 ANE docs in #666
- fix(kokoro): make the M5-safe routing the default (Kokoro works on M5 out of the box) in #671
- chore(asr): remove experimental Parakeet CTC zh-CN Mandarin model in #675
- Remove experimental Magpie multilingual TTS backend in #674
- chore(asr): remove experimental Qwen3 ASR backend in #676
- fix(download): retry transient per-file failures in downloadRepo by @JulianPscheid in #681
- feat(tts/pocket): ANE placements — rank-4 split-KV models (.ane) + MLState pipeline (.aneState) by @Alex-Wengg in #679
- fix(kokoro): route the Noise stage to GPU in the M5-safe preset (+~10% synth) in #677
- feat(asr/eou): opt-in fused decoder+joint_decision path (+7-9% RTFx, WER neutral-or-better) in #680
- feat(asr): expose per-token timings from Nemotron streaming ASR (English + multilingual) by @JulianPscheid in #673
- Add TypeWhisper to the FluidAudio showcase by @SeoFood in #685
- fix(asr): make SlidingWindowAsrConfig.default fit the model's 240k-sample input in #689
- fix(asr): splice long-form chunk merges on SentencePiece word boundaries in #688
- fix(tts): Misaki-lexicon-first English frontend for KokoroAne in #692
- feat(asr): Parakeet Unified 0.6B backend (chunked-attention streaming + offline batch) in #693
New Contributors
- @JulianPscheid made their first contribution in #681
- @SeoFood made their first contribution in #685
- @rcourtman made their first contribution in #684
Full Changelog: v0.15.2...v0.15.3