Skip to content

v0.2.0 — local LLM polish (your dictation, cleaned up on-device)

Choose a tag to compare

@AlexFlanker AlexFlanker released this 02 Jun 06:52
· 1 commit to main since this release

The first release on the core-quality axis: v0.1.x made dictation pleasant to use and manage; v0.2.0 makes it read better — fully on-device.

Added

  • 🧠 Local LLM polish — after transcription, an optional local Ollama model tidies the text before it's pasted:
    • light — faithful: fixes punctuation, typos, obvious stumbles, keeps your wording.
    • concise — trims filler / repetition / 口头禅 for a much shorter result.
    • Default model glm4 (GLM-4-9B); 100% offline; opt-in (POLISH_ENABLED, off by default); a hard timeout falls back to the raw transcript, so a slow or absent Ollama never blocks dictation.
  • 🤖 MCP set_polish (mode / model / enabled) — flip light↔concise, swap models, or toggle it, right from Claude Desktop. POLISH_* added to the writable allowlist; polish state shows in status().

Enable it

  1. brew install ollama && ollama pull glm4 (GLM-4-9B, ~5.5 GB; or qwen2.5:3b for lighter/faster).
  2. POLISH_ENABLED=true in .env and restart — or ask Claude Desktop "turn on polish, concise mode".

How it ships

Kit-owned helper src/transcription/ollama_polish.py (MIT) copied in by install.sh; the patcher only wires 4 thin main.py hooks. Verified byte-identical to a known-good install (24 edits + 3 shipped files) and idempotent. No upstream code redistributed.

macOS only. Not affiliated with or endorsed by upstream. See CREDITS.md.