HyperYap is a local voice-to-text application that bundles speech recognition, a terminal emulator, and hotkey automation into a single zero-config installer for Windows. No cloud, no internet required for transcription, no data collection. Install once, use everywhere.
- What You Get
- Install
- Default Hotkeys
- Requirements
- How It Works
- Configuration
- Build from Source
- Contributing
- Attribution
- License
One installer, one tray icon, everything just works:
- Local speech-to-text powered by NVIDIA Parakeet TDT 0.6B v3. No cloud, no internet after first install.
- Hotkey remapping turns CapsLock and mouse side buttons into recording triggers. Mouse Forward becomes Enter.
- Smart paste detects terminals and auto-saves clipboard images as PNGs, pasting the file path instead of garbled data.
- BenjaminTerm included. Hacker-styled WezTerm terminal with smart clipboard, 86 dark themes, and borderless mode.
- Auto-boot on login. Preset configs. Zero setup after install.
Everything is preconfigured. You do not need to set up shortcuts, change settings, or configure anything after install.
Download hyperyap_1.0.0_x64-setup.exe from the Releases page and run it. On first launch, HyperYap will:
- Download the NVIDIA Parakeet speech model (~440MB)
- Set up the hotkey engine and register autostart
- Apply all preset settings (toggle-to-talk, F13, English)
This installs everything in one shot, including BenjaminTerm and the speech model:
irm https://raw.githubusercontent.com/avalonreset/hyperyap/main/install.ps1 | iexOr clone and run locally:
git clone https://github.com/avalonreset/hyperyap.git
cd hyperyap
powershell -ExecutionPolicy Bypass -File install.ps1The PowerShell installer also removes old MURmure installations if present.
Run the installer again over an existing install. HyperYap upgrades in-place without deleting your speech model or requiring a re-download. Settings are reset to the recommended defaults on each upgrade.
| Key | Action |
|---|---|
F13 / CapsLock / Mouse Back |
Start/stop recording |
| Mouse Forward | Enter |
Ctrl+Shift+Space |
Paste last transcript |
Ctrl+Alt+Space |
LLM-assisted recording |
Ctrl+Shift+X |
Command mode |
Escape |
Cancel recording |
These hotkeys are built in. You can pause and resume them from the tray icon's right-click menu.
HyperYap disables CapsLock and repurposes it as a speech-to-text key. Press CapsLock to start recording, press it again to stop. Your transcription is pasted into whatever window is focused. CapsLock is permanently set to off so you never accidentally activate it.
HyperYap's hotkey engine is terminal-aware. It detects which application is focused and adapts Ctrl+V behavior accordingly:
In regular applications (browsers, editors, chat apps), Ctrl+V works exactly as it normally does. HyperYap does not interfere.
In supported terminals, HyperYap intercepts Ctrl+V and adds clipboard image intelligence. If your clipboard contains a screenshot or image, HyperYap automatically saves it as a timestamped PNG in ~/screenshots/ and replaces the clipboard with the file path before pasting. This means you can take a screenshot, switch to your terminal, press Ctrl+V, and get a usable file path instead of garbled binary data. Text clipboard contents paste normally.
This is especially useful for vibe coding workflows where you screenshot errors, UI mockups, or terminal output and need to reference them by path in a command or prompt.
Supported terminals:
| Terminal | Process |
|---|---|
| BenjaminTerm | benjaminterm-gui.exe |
| WezTerm | wezterm-gui.exe |
| Windows Terminal | windowsterminal.exe |
| PowerShell | powershell.exe |
| PowerShell 7+ | pwsh.exe |
| Command Prompt | cmd.exe |
| Alacritty | alacritty.exe |
| ConEmu | conemu.exe / conemu64.exe |
| Hyper | hyper.exe |
| Git Bash (mintty) | mintty.exe |
| Tabby | tabby.exe |
| Warp | warp.exe |
| MobaXterm | mobaxterm.exe |
- Windows 10+ (Windows only)
- A microphone
- ~700MB disk space (voice model)
- Internet connection for first launch (model download)
BenjaminTerm is installed by the PowerShell installer, or can be downloaded separately from its repo.
- Press the hotkey (F13, CapsLock, or Mouse Back) to start recording
- Speak naturally into your microphone
- Press the hotkey again to stop recording
- HyperYap transcribes locally using the Parakeet TDT model
- The transcription is automatically pasted into the active window
All processing happens on your machine. Audio never leaves your computer. The speech model runs entirely offline after the initial download.
HyperYap works out of the box with zero configuration. All settings can be changed from the app's Settings page.
| Setting | Default | Description |
|---|---|---|
| Record mode | Toggle-to-talk | Press once to start, press again to stop. Can be changed to push-to-talk. |
| Record shortcut | F13 | Configurable to any key or key combination |
| Language | English | Supports multiple languages via the Parakeet model |
| Overlay | Bottom of screen | Recording indicator position. Can be set to top, bottom, or hidden. |
| LLM Connect | Disabled | Post-process transcriptions with a local LLM (Ollama) or remote API |
| HTTP API | Disabled | Local API on localhost for external tool integration |
| Sound feedback | Enabled | Audio cues when recording starts and stops |
| Copy to clipboard | Disabled | Optionally keep transcriptions in the clipboard |
Settings are stored in %APPDATA%/com.avalonreset.hyperyap/settings.json. Each install or upgrade resets settings to the recommended defaults. If you need to preserve custom settings across upgrades, back up this file before updating.
All in-app hotkeys can be remapped from the Settings page. Mouse button and CapsLock remapping are handled by the bundled hotkey engine, which runs alongside the main app and can be toggled from the tray menu.
pnpm install
# Build the hotkey engine first
cd hotkeys && cargo build --release && cd ..
pnpm tauri dev # development
pnpm tauri build # production buildRequires: Node.js 18+, Rust, pnpm, Tauri prerequisites
Download the Parakeet model and extract to resources/parakeet-tdt-0.6b-v3-int8/.
Contributions are welcome. See CONTRIBUTING.md for development setup, PR workflow, and coding guidelines.
Please read the Code of Conduct before contributing.
HyperYap's voice engine is a modified version of MURmure by Kieirra. Full credit to the original author for building an excellent local speech-to-text application.
BenjaminTerm is a custom distribution of WezTerm by Wez Furlong.
Powered by NVIDIA's Parakeet TDT 0.6B v3 speech recognition model.
The voice engine is licensed under AGPL-3.0. BenjaminTerm is licensed under MIT. See NOTICE for full attribution details.
