🎙️ Whisper Hotkey Transcribe

Instantly transcribe your voice to text with a single keypress using OpenAI's Whisper running locally on your machine. Press F9 to start recording, press F9 again to stop and get the transcription copied to your clipboard!

✨ Features

Core Features

One-key operation: Press F9 to start/stop recording
Instant transcription: Audio is processed locally using whisper.cpp
Visual feedback: Desktop notifications show recording status
CPU optimized: Runs efficiently without GPU (perfect for laptops)
Privacy first: Everything runs locally, no internet required

🎨 Enhanced Features (v2.0)

🖱️ Direct Text Injection: Auto-type transcribed text directly where your cursor is (no more copy-paste!)
🎤 Audio Level Check: Automatically detects if microphone is working before recording
📚 Custom Vocabulary: Teach Whisper your technical terms, names, and phrases
⚙️ Beautiful GUI: Modern web-based configuration interface for all settings
🌍 Multi-language Support: Auto-detect or choose from 99+ languages
🎛️ Flexible Output: Choose between clipboard or auto-type mode

New to these features? Check out FEATURES.md for a complete guide!

🖥️ Tested System

This setup was tested and runs perfectly on:

OS: Ubuntu 22.04.5 LTS
CPU: Intel Ultra 9 185H (22 threads)
RAM: 64GB
Desktop: GNOME 42.9

📋 Prerequisites

Ubuntu/Debian-based Linux distribution
GNOME desktop environment (for keybinding setup)
Basic development tools (git, make, gcc)
Audio recording tools (arecord from alsa-utils)
Clipboard tool (xclip or xsel)

🚀 Installation

Quick Install

# Clone this repository
git clone https://github.com/atkvishnu/whisper-hotkey-transcribe.git
cd whisper-hotkey-transcribe

# Run the installation script
chmod +x install.sh
./install.sh

During installation, you'll be asked to choose between:

faster-whisper (Python-based, faster transcription, recommended)
whisper.cpp (C++ based, original implementation)

Simply press Enter to use the recommended faster-whisper backend, or type 2 for whisper.cpp.

📖 Want to learn more about the backends? See BACKEND_SETUP.md for a detailed comparison.

Manual Installation

Install dependencies:

sudo apt update
sudo apt install build-essential git alsa-utils xclip libnotify-bin

Clone and build whisper.cpp:

cd ~/Projects
git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp
make -j$(nproc)

Download a Whisper model:

# Download the base model (good balance of speed and accuracy)
bash ./models/download-ggml-model.sh base

Set up the transcription script:

# Copy the script from this repo
cp scripts/whisper-toggle.sh ~/Projects/whisper.cpp/
chmod +x ~/Projects/whisper.cpp/whisper-toggle.sh

Configure the F9 hotkey:

Method 1: Manual Configuration (Recommended)

Open Settings → Keyboard → View and Customize Shortcuts → Custom Shortcuts
Click the + button to add a new shortcut
Fill in the following:
- Name: Whisper Transcribe
- Command: /home/atkvishnu/Projects/whisper.cpp/whisper-toggle.sh
- Shortcut: Click "Set Shortcut" and press F9

Method 2: Command Line (May require logout)

# The install script attempts this automatically
gsettings set org.gnome.settings-daemon.plugins.media-keys custom-keybindings "['/org/gnome/settings-daemon/plugins/media-keys/custom-keybindings/custom0/']"
gsettings set org.gnome.settings-daemon.plugins.media-keys.custom-keybinding:/org/gnome/settings-daemon/plugins/media-keys/custom-keybindings/custom0/ name 'Whisper Transcribe'
gsettings set org.gnome.settings-daemon.plugins.media-keys.custom-keybinding:/org/gnome/settings-daemon/plugins/media-keys/custom-keybindings/custom0/ command '/home/atkvishnu/Projects/whisper.cpp/whisper-toggle.sh'
gsettings set org.gnome.settings-daemon.plugins.media-keys.custom-keybinding:/org/gnome/settings-daemon/plugins/media-keys/custom-keybindings/custom0/ binding 'F9'

🎯 Usage

Press F9 - You'll see a notification "Recording started... Press F9 again to stop"
Speak clearly into your microphone
Press F9 again - Recording stops, audio is transcribed
Check your clipboard (or text auto-types!) - The transcribed text is ready!

⚙️ Configuration

Using the GUI (Recommended)

Launch the beautiful configuration interface:

cd ~/Documents/Personal\ Code/whisper-hotkey-linux
./configure.sh

This opens a web-based GUI where you can:

Switch between clipboard and auto-type modes
Enable/disable audio level checking
Manage custom vocabulary entries
Select Whisper model and language
Adjust all settings with a modern interface

Manual Configuration

Edit the configuration file directly:

nano ~/.config/whisper-hotkey/config.conf

See FEATURES.md for detailed explanations of each setting.

Available Models

tiny - Fastest, lowest accuracy (39 MB)
base - Good balance (74 MB) - Recommended
small - Better accuracy (244 MB)
medium - High accuracy (769 MB)
large - Best accuracy (1550 MB)

📁 File Locations

Recordings: /tmp/whisper-recordings/recording_*.wav
Transcripts: /tmp/whisper-recordings/transcript_*.txt
PID file: /tmp/whisper-recording.pid

🐛 Troubleshooting

F9 key not working

Log out and log back in after setting up the keybinding
Or restart GNOME Shell: Alt+F2, type 'r', press Enter

"Library not found" errors

The script includes the necessary library paths. If you still get errors:

export LD_LIBRARY_PATH=/path/to/whisper.cpp/build/src:/path/to/whisper.cpp/build/ggml/src:$LD_LIBRARY_PATH

No audio recording

Check your microphone:

# List recording devices
arecord -l

# Test recording
arecord -d 5 test.wav
aplay test.wav

Transcription accuracy issues

Speak clearly and avoid background noise
Try a larger model for better accuracy
Ensure audio levels are appropriate

🗑️ Uninstallation

To completely remove Whisper Hotkey Transcribe:

cd ~/Documents/Personal\ Code/whisper-hotkey-linux
./uninstall.sh

The uninstaller will:

Remove both faster-whisper and whisper.cpp installations
Remove the F9 keyboard shortcut (GNOME only)
Clean up temporary files and recordings
Optionally remove configuration files (you'll be asked)

Note: Your custom vocabulary and settings will be preserved unless you choose to remove them.

🤝 Contributing

Feel free to open issues or submit pull requests! Some ideas for improvements:

Support for other desktop environments
Additional hotkey configurations
Integration with other applications
Support for multiple languages

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

whisper.cpp - Georgi Gerganov's excellent C++ port of Whisper
OpenAI Whisper - The original Whisper model
Tested and developed on an Intel Ultra 9 185H system with 64GB RAM

Made with ❤️ for the open-source community. If you find this useful, please star the repository!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Whisper Hotkey Transcribe

✨ Features

Core Features

🎨 Enhanced Features (v2.0)

🖥️ Tested System

📋 Prerequisites

🚀 Installation

Quick Install

Manual Installation

🎯 Usage

⚙️ Configuration

Using the GUI (Recommended)

Manual Configuration

Available Models

📁 File Locations

🐛 Troubleshooting

F9 key not working

"Library not found" errors

No audio recording

Transcription accuracy issues

🗑️ Uninstallation

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
docs		docs
gui		gui
scripts		scripts
LICENSE		LICENSE
README.md		README.md
configure.sh		configure.sh
install.sh		install.sh
record.py		record.py
setrecord.sh		setrecord.sh
uninstall.sh		uninstall.sh
whisper-hotkey-config.desktop		whisper-hotkey-config.desktop

Folders and files

Latest commit

History

Repository files navigation

🎙️ Whisper Hotkey Transcribe

✨ Features

Core Features

🎨 Enhanced Features (v2.0)

🖥️ Tested System

📋 Prerequisites

🚀 Installation

Quick Install

Manual Installation

🎯 Usage

⚙️ Configuration

Using the GUI (Recommended)

Manual Configuration

Available Models

📁 File Locations

🐛 Troubleshooting

F9 key not working

"Library not found" errors

No audio recording

Transcription accuracy issues

🗑️ Uninstallation

🤝 Contributing

📜 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages