Skip to content

draxctrl/Voxel

Repository files navigation

Voxel - Your voice, clean text, any app

πŸŽ™οΈ Voxel

Hold a hotkey. Speak. Clean text appears wherever your cursor is.

Say bye bye to WisprFlow πŸ‘‹

Windows macOS Python License: MIT


Free & open-source voice dictation for Windows & macOS
Powered by Groq Whisper + Llama 3. No subscriptions, no cost.



πŸ€” Why Voxel exists

WisprFlow is a great voice dictation tool, but it costs money. Voxel does the same thing for free.

If you use Claude Code in VS Code (or any AI coding extension), you lose the ability to talk via mic. Voxel brings it back. Hold a hotkey, speak, and your cleaned-up text gets pasted right into the chat, the terminal, or wherever your cursor is.

Windows has built-in voice typing (Win+H), but it produces messy, unedited text. You still have to go back and fix grammar, remove filler words, and add punctuation. That defeats the purpose.

Voxel fixes this by running your speech through two AI steps:

🎀 Whisper (speech-to-text) - transcribes your voice accurately

✨ Llama 3 (text cleanup) - removes filler words, fixes grammar, keeps your tone

The result: you speak naturally, and clean text appears. No editing needed.

Voxel vs WisprFlow

Voxel WisprFlow
Price Free forever $8/mo+
Open source Yes No
AI cleanup Yes (Groq Llama 3) Yes
Works in any app Yes Yes
Customizable hotkey Yes Yes
Offline mode Yes (local Whisper) No
Dictation history Yes Yes
Custom voice commands Yes Yes
Multiple dictation modes Yes (5 built-in) Yes
Per-app auto profiles Yes Yes
Undo last dictation Yes No
Statistics dashboard Yes Yes
No account required Just a free Groq API key Requires account + payment
Windows + macOS Yes Yes

✨ Features

Core

  • πŸŽ™οΈ Push-to-talk - hold hotkey, speak, release, get clean text
  • ✨ AI text cleanup - removes "um", "uh", fixes grammar, adds punctuation
  • 🌐 Works in any app - Chrome, VS Code, Slack, Word, Discord, Notepad
  • ⌨️ Customizable hotkey - set any combo you like
  • 🎀 Mic selector - choose your recording device
  • 🌍 Auto-detect language - or pick from 9 languages

Smart Features

  • πŸ“š Dictation History - every transcription saved, searchable
  • 🎭 5 Dictation Modes - Default, Professional Email, Casual, Code Comments, Technical
  • πŸ—£οΈ Voice Commands - say "sign off" β†’ insert your signature template
  • ↩️ Undo Last Dictation - made a mistake? Ctrl+Shift+Z to revert
  • πŸ“Š Statistics Dashboard - track your usage and time saved
  • πŸͺ„ Auto-switch profile per app - use "Professional" in Outlook, "Code" in VS Code
  • πŸ’Ύ Offline Mode - use local Whisper model, no internet needed

Polish

  • 🎨 Dark themed UI with indigo accents
  • πŸ”Š Audio feedback (or mute if you prefer)
  • πŸ“‹ Clipboard fallback - copies text if no text field is focused
  • πŸ“‹ Always copy to clipboard option
  • πŸ”’ Privacy-focused - your API key never leaves your machine

⚑ How it works

1. 🟒 Hold your hotkey (default: Ctrl+Shift+Space)
2. πŸŽ™οΈ Speak naturally - say "um" and "like" all you want
3. πŸ”΄ Release the hotkey
4. βœ… Clean, polished text is pasted into whatever app you're using

Works with any app - Chrome, VS Code, Word, Slack, Discord, Notepad, you name it.


⏱️ Recording limit

Groq's free tier has a limit on audio length per request. If you hit it, just release the hotkey and press it again to start a new recording. It's seamless - you won't lose anything. For most dictation (emails, messages, code comments), you'll never hit the limit.


πŸ”‘ Getting a Groq API Key (free)

Step Action
1 Go to console.groq.com
2 Sign up for a free account (Google/GitHub sign-in works)
3 Go to API Keys in the sidebar
4 Click Create API Key
5 Copy the key (starts with gsk_)
6 Paste it into Voxel's settings when you first launch

πŸ’‘ No credit card. No trial period. Just free.


πŸͺŸ Install (Windows)

Option 1: Installer

Download Voxel_Setup.exe from πŸ“¦ Releases, run it, done.

Option 2: From source

git clone https://github.com/draxctrl/Voxel.git
cd Voxel
pip install -r requirements.txt
python -m src.main

🍎 Install (macOS)

Option 1: DMG

Download Voxel.dmg from πŸ“¦ Releases, open it, drag Voxel into your Applications folder, done.

Option 2: From source

git clone https://github.com/draxctrl/Voxel.git
cd Voxel/BudgetWhisper-mac
chmod +x setup_mac.sh run.sh
bash setup_mac.sh
bash run.sh

⚠️ macOS will ask for Microphone, Accessibility, and Input Monitoring permissions - say yes to all.

Default hotkey on Mac: Cmd+Shift+Space


βš™οΈ Settings

Right-click the tray icon to access Settings, History, or Statistics. Inside Settings:

Setting Description
πŸ”‘ API Key Your Groq API key
⌨️ Hotkey Customizable push-to-talk key combo
↩️ Undo hotkey Key combo to undo last dictation
🎭 Dictation Mode 5 built-in cleanup styles + custom profiles
πŸ—£οΈ Voice Commands Define trigger phrases that expand to templates
🎀 Microphone Choose your recording device
🌍 Language Auto-detect or pick from 9 languages
πŸ”‡ Mute sounds Disable the chime after recording
πŸͺ„ Auto-switch profile per app Use different modes for different apps
πŸ’Ύ Offline mode Use local Whisper model instead of API
πŸ“‹ Always copy to clipboard Keep text on clipboard after paste
πŸ’¬ Show clipboard notice Notification when text copied instead of pasted

πŸ› οΈ Tech stack

Component Technology
Language Python 3.13+
AI Backend Groq API (free) - Whisper Large v3 + Llama 3.3 70B
Offline Transcription faster-whisper (CTranslate2)
UI Framework PyQt6
Hotkey Listener pynput
Audio PyAudio
System Tray pystray
Storage SQLite (history) + JSON (config)
Packaging PyInstaller + NSIS

πŸ’œ Support this project

If Voxel saves you time, consider supporting development:

PayPal

MIT License

About

Free, open-source alternative to WisprFlow. Hold a hotkey, speak naturally, get clean text in any app. Powered by Groq. $0 forever.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages