fix(mlx): point Qwen 0.6B at the published mlx-community repo#501
fix(mlx): point Qwen 0.6B at the published mlx-community repo#501
Conversation
The 0.6B slot was aliased to the 1.7B repo as a temporary fallback because `mlx-community/Qwen3-TTS-12Hz-0.6B-Base-bf16` wasn't published when MLX support shipped. That conversion is live now, so use it — Apple Silicon users picking 0.6B get the actual 0.6B model (1.2 GB instead of 3.5 GB). Also drops the now-obsolete troubleshooting entry and updates the triage notes in PROJECT_STATUS.md. Fixes #485. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: defaults Review profile: CHILL Plan: Pro Run ID: 📒 Files selected for processing (5)
💤 Files with no reviewable changes (1)
📝 WalkthroughWalkthroughThis pull request corrects a bug where selecting Qwen TTS 0.6B model incorrectly downloaded the 1.7B variant. The fix updates model repository IDs in the MLX backend configuration and removes stale fallback logic, with accompanying documentation and test updates. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches📝 Generate docstrings
🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Summary
mlx-community/Qwen3-TTS-12Hz-0.6B-Base-bf16instead of falling back to the 1.7B repo.Why
When the MLX backend shipped,
mlx-communityhadn't converted the 0.6B base yet, so the 0.6B slot was aliased to the 1.7B repo as a temporary fallback (# 0.6B not available in MLX, falls back). That conversion is now live (1.4k downloads, published 2026-01-22), so Apple Silicon users picking 0.6B should get the actual 0.6B weights — ~1.2 GB instead of ~3.5 GB — with the corresponding speed/memory win.Reported in #485, also covers the older #423 / #329 reports.
Qwen CustomVoice is unaffected — it runs via PyTorch on all platforms and both sizes already have dedicated repos.
Test plan
mlx-community/Qwen3-TTS-12Hz-0.6B-Base-bf16(not the 1.7B one) and the download size is ~1.2 GB.Qwen/...repos.Fixes #485.
🤖 Generated with Claude Code
Note
Medium Risk
Changes which HuggingFace repo is downloaded/loaded for Qwen 0.6B on the MLX backend, which can affect cache reuse and download behavior for existing users. Logic is small and localized but touches core model resolution paths.
Overview
Fixes Apple Silicon (MLX) model resolution so selecting Qwen TTS 0.6B downloads/loads
mlx-community/Qwen3-TTS-12Hz-0.6B-Base-bf16instead of incorrectly aliasing to the 1.7B repo.Updates the MLX backend’s model-size mapping and the backend-aware model registry accordingly, adjusts the Qwen download test to remove the stale fallback note, and cleans up docs by removing/rewriting troubleshooting and project-status guidance that assumed the fallback behavior.
Reviewed by Cursor Bugbot for commit c70ec32. Configure here.
Summary by CodeRabbit
Bug Fixes
Documentation