macOS menu bar dictation tool powered by Deepgram Nova-3. Press a hotkey, talk, and text gets typed into any app in real-time.
- Lives in the menu bar — no dock icon, no window clutter
- Real-time speech-to-text via Deepgram Nova-3 streaming
- Text injection into any focused text field (clipboard + paste)
- Floating pill widget with waveform visualizer
- Custom vocabulary (keyterms) for names, brands, technical terms
- Configurable hotkey, language (EN/RU/Multilingual), microphone
- Auto-save settings
# Clone
git clone https://github.com/iota-uz/speak.git
cd speak
# Install
npm install
# Get a Deepgram API key at https://console.deepgram.com
# You'll enter it in the app settings
# Run
npm startOn first launch, macOS will ask for microphone and accessibility permissions. Grant both.
- Click the tray icon to open settings
- Enter your Deepgram API key
- Set your preferred hotkey (default:
Cmd+Shift+Z) - Select microphone and language
- Press the hotkey to start dictating — a floating pill appears
- Speak — text appears in real-time in whatever app is focused
- Press the hotkey again to stop
Deepgram Nova-3 streaming: ~$0.36/hour. Realistic usage for a product manager is $3-7/month.
- macOS (uses macOS-specific accessibility APIs for text injection)
- Node.js 18+
- Deepgram API key