Problem
Operator needs a fast, local STT path for capturing spoken "things to do" without leaving the current focus. Existing voice integrations (ElevenLabs/Gemini/Qwen) are cloud realtime voice assistants -- different latency budget, different privacy posture, different intent.
Proposal
Wire the modified OpenWhispr build (local Whisper STT) as a background listener that:
Out of scope here: the inbox UI itself, the keyword/hotkey UX. Those are #2.
Acceptance
Notes
Problem
Operator needs a fast, local STT path for capturing spoken "things to do" without leaving the current focus. Existing voice integrations (ElevenLabs/Gemini/Qwen) are cloud realtime voice assistants -- different latency budget, different privacy posture, different intent.
Proposal
Wire the modified OpenWhispr build (local Whisper STT) as a background listener that:
POST /api/inbox/things-to-do { text, source: "stt", capturedAt }Out of scope here: the inbox UI itself, the keyword/hotkey UX. Those are #2.
Acceptance
inbox_itemstable or similardocs/tooling/(operator-local)Notes