Skip to content

v0.16.0

@perrette perrette tagged this 19 May 23:08
Highlights:
- OpenAI realtime streaming model gpt-realtime-whisper, surfaced
  under the existing openai backend (no separate --backend value).
  Streams partials live as you speak, identical UX to Vosk.
- New --type-direct flag (and Options → Keyboard backend → Type
  directly): bypass the clipboard and synthesise raw keystrokes —
  works in terminals and any other app without depending on a paste
  shortcut. Caveat: xkb-layout-dependent for non-ASCII.
- Auto-pick is now cloud-first: groq → openai → whisper → vosk.
- README + docs/ restructured: short landing page, in-depth content
  split into per-topic pages under docs/.
- VoskTranscriber migrated to StreamingSTTBackend (shared with
  desktop-ai-core>=0.2.0).
- Pasting-into-terminal callout: Ctrl+Shift+V workaround.

Fixes:
- Groq picked as default backend without explicit --model no longer
  raises 'Unknown backend' (latent bug exposed by the cloud-first
  auto-pick order).
Assets 2
Loading