Say bye bye to WisprFlow π
Free & open-source voice dictation for Windows & macOS
Powered by Groq Whisper + Llama 3. No subscriptions, no cost.
WisprFlow is a great voice dictation tool, but it costs money. Voxel does the same thing for free.
If you use Claude Code in VS Code (or any AI coding extension), you lose the ability to talk via mic. Voxel brings it back. Hold a hotkey, speak, and your cleaned-up text gets pasted right into the chat, the terminal, or wherever your cursor is.
Windows has built-in voice typing (Win+H), but it produces messy, unedited text. You still have to go back and fix grammar, remove filler words, and add punctuation. That defeats the purpose.
Voxel fixes this by running your speech through two AI steps:
π€ Whisper (speech-to-text) - transcribes your voice accurately
β¨ Llama 3 (text cleanup) - removes filler words, fixes grammar, keeps your tone
The result: you speak naturally, and clean text appears. No editing needed.
| Voxel | WisprFlow | |
|---|---|---|
| Price | Free forever | $8/mo+ |
| Open source | Yes | No |
| AI cleanup | Yes (Groq Llama 3) | Yes |
| Works in any app | Yes | Yes |
| Customizable hotkey | Yes | Yes |
| Offline mode | Yes (local Whisper) | No |
| Dictation history | Yes | Yes |
| Custom voice commands | Yes | Yes |
| Multiple dictation modes | Yes (5 built-in) | Yes |
| Per-app auto profiles | Yes | Yes |
| Undo last dictation | Yes | No |
| Statistics dashboard | Yes | Yes |
| No account required | Just a free Groq API key | Requires account + payment |
| Windows + macOS | Yes | Yes |
- ποΈ Push-to-talk - hold hotkey, speak, release, get clean text
- β¨ AI text cleanup - removes "um", "uh", fixes grammar, adds punctuation
- π Works in any app - Chrome, VS Code, Slack, Word, Discord, Notepad
- β¨οΈ Customizable hotkey - set any combo you like
- π€ Mic selector - choose your recording device
- π Auto-detect language - or pick from 9 languages
- π Dictation History - every transcription saved, searchable
- π 5 Dictation Modes - Default, Professional Email, Casual, Code Comments, Technical
- π£οΈ Voice Commands - say "sign off" β insert your signature template
- β©οΈ Undo Last Dictation - made a mistake? Ctrl+Shift+Z to revert
- π Statistics Dashboard - track your usage and time saved
- πͺ Auto-switch profile per app - use "Professional" in Outlook, "Code" in VS Code
- πΎ Offline Mode - use local Whisper model, no internet needed
- π¨ Dark themed UI with indigo accents
- π Audio feedback (or mute if you prefer)
- π Clipboard fallback - copies text if no text field is focused
- π Always copy to clipboard option
- π Privacy-focused - your API key never leaves your machine
1. π’ Hold your hotkey (default: Ctrl+Shift+Space)
2. ποΈ Speak naturally - say "um" and "like" all you want
3. π΄ Release the hotkey
4. β
Clean, polished text is pasted into whatever app you're using
Works with any app - Chrome, VS Code, Word, Slack, Discord, Notepad, you name it.
Groq's free tier has a limit on audio length per request. If you hit it, just release the hotkey and press it again to start a new recording. It's seamless - you won't lose anything. For most dictation (emails, messages, code comments), you'll never hit the limit.
| Step | Action |
|---|---|
| 1 | Go to console.groq.com |
| 2 | Sign up for a free account (Google/GitHub sign-in works) |
| 3 | Go to API Keys in the sidebar |
| 4 | Click Create API Key |
| 5 | Copy the key (starts with gsk_) |
| 6 | Paste it into Voxel's settings when you first launch |
π‘ No credit card. No trial period. Just free.
Download Voxel_Setup.exe from π¦ Releases, run it, done.
git clone https://github.com/draxctrl/Voxel.git
cd Voxel
pip install -r requirements.txt
python -m src.mainDownload Voxel.dmg from π¦ Releases, open it, drag Voxel into your Applications folder, done.
git clone https://github.com/draxctrl/Voxel.git
cd Voxel/BudgetWhisper-mac
chmod +x setup_mac.sh run.sh
bash setup_mac.sh
bash run.sh
β οΈ macOS will ask for Microphone, Accessibility, and Input Monitoring permissions - say yes to all.
Default hotkey on Mac: Cmd+Shift+Space
Right-click the tray icon to access Settings, History, or Statistics. Inside Settings:
| Setting | Description |
|---|---|
| π API Key | Your Groq API key |
| β¨οΈ Hotkey | Customizable push-to-talk key combo |
| β©οΈ Undo hotkey | Key combo to undo last dictation |
| π Dictation Mode | 5 built-in cleanup styles + custom profiles |
| π£οΈ Voice Commands | Define trigger phrases that expand to templates |
| π€ Microphone | Choose your recording device |
| π Language | Auto-detect or pick from 9 languages |
| π Mute sounds | Disable the chime after recording |
| πͺ Auto-switch profile per app | Use different modes for different apps |
| πΎ Offline mode | Use local Whisper model instead of API |
| π Always copy to clipboard | Keep text on clipboard after paste |
| π¬ Show clipboard notice | Notification when text copied instead of pasted |
| Component | Technology |
|---|---|
| Language | Python 3.13+ |
| AI Backend | Groq API (free) - Whisper Large v3 + Llama 3.3 70B |
| Offline Transcription | faster-whisper (CTranslate2) |
| UI Framework | PyQt6 |
| Hotkey Listener | pynput |
| Audio | PyAudio |
| System Tray | pystray |
| Storage | SQLite (history) + JSON (config) |
| Packaging | PyInstaller + NSIS |