Detto 2.0.0
[2.0.0] — 2026-06-02
Initial public release of Detto, a complete rewrite of Tome. All processing on-device.
Added
- Dictation mode: hold-to-speak with configurable hotkeys (Control, Fn, Right Option, Right Command, Hyper Key, custom combos). Floating overlay with waveform visualization. Text injection at the cursor.
- Client briefing system: vault-aware context loading, attendee tracking, per-client transcript folders. Reads client metadata from your Obsidian vault.
- Vocabulary correction: 3-pass post-ASR correction (explicit mappings, case normalization, fuzzy matching). Bundled Canadian Politics pack (53 terms, 30 corrections); custom vocabulary file support via Settings.
- On-device transcript refinement: optional post-session cleanup via Llama 3.2 3B (MLX). Fixes proper nouns and grammar with five validation guards against hallucination.
- Channel-based speaker separation: dual-stream mic + system audio with per-stream VAD and ASR. Improved diarization on call recordings.
- Confidence scores: Parakeet ASR confidence captured per utterance and used to skip refinement on high-confidence segments.
Changed
- ASR engine: Parakeet-TDT v3 via GrembleVoice (25 European languages, multilingual auto-detection).
- License: MIT → Business Source License 1.1 (converts to MIT 2030-05-12). Existing Tome v1 forks retain their MIT copy.
- CI: signed and notarized DMG builds via GitHub Actions.
Removed
- Tome v1 codebase (complete rewrite).