v0.2.0 — local LLM polish (your dictation, cleaned up on-device)
The first release on the core-quality axis: v0.1.x made dictation pleasant to use and manage; v0.2.0 makes it read better — fully on-device.
Added
- 🧠 Local LLM polish — after transcription, an optional local Ollama model tidies the text before it's pasted:
light— faithful: fixes punctuation, typos, obvious stumbles, keeps your wording.concise— trims filler / repetition / 口头禅 for a much shorter result.- Default model
glm4(GLM-4-9B); 100% offline; opt-in (POLISH_ENABLED, off by default); a hard timeout falls back to the raw transcript, so a slow or absent Ollama never blocks dictation.
- 🤖 MCP
set_polish(mode/model/enabled) — flip light↔concise, swap models, or toggle it, right from Claude Desktop.POLISH_*added to the writable allowlist; polish state shows instatus().
Enable it
brew install ollama&&ollama pull glm4(GLM-4-9B, ~5.5 GB; orqwen2.5:3bfor lighter/faster).POLISH_ENABLED=truein.envand restart — or ask Claude Desktop "turn on polish, concise mode".
How it ships
Kit-owned helper src/transcription/ollama_polish.py (MIT) copied in by install.sh; the patcher only wires 4 thin main.py hooks. Verified byte-identical to a known-good install (24 edits + 3 shipped files) and idempotent. No upstream code redistributed.
macOS only. Not affiliated with or endorsed by upstream. See CREDITS.md.