Vox

Open-source voice-to-text app with local Whisper transcription and AI-powered correction.

Hold a keyboard shortcut, speak, and Vox transcribes your voice locally using whisper.cpp, optionally corrects it with AI, and pastes the text into your active app.

Demo

⚠️ Platform Support Vox currently runs on macOS (Apple Silicon and Intel). Cross-platform support for Windows and Linux is planned for future releases.

Quick Start

Download the latest version from the releases page and drag Vox.app to your Applications folder.

First Launch

When you first launch Vox, you'll need to:

Download a Whisper Model — Go to Settings > Local Model and download at least one speech recognition model. The "small" model (Recommended) is a good starting point.
Grant Permissions — Vox needs:
- Microphone: Required for voice recording
- Accessibility: Required for keyboard shortcuts and auto-paste
Configure Shortcuts (optional) — Customize keyboard shortcuts in Settings > Shortcuts
Enable AI Improvements (optional) — Configure LLM provider in Settings > AI Improvements

Vox will guide you through this setup process with visual indicators showing what's incomplete.

Once configured, hold Alt+Space to start recording.

Features

Local transcription — Powered by whisper.cpp, audio stays on your device
AI correction — Removes filler words and fixes grammar (optional)
Hold or toggle modes — Press-and-hold or toggle recording on/off
Auto-paste — Text appears in your focused app via Cmd+V
Multiple models — Choose speed vs accuracy (tiny to large)
Multiple LLM providers — OpenAI-compatible or AWS Bedrock
Menu bar app — Runs quietly in the background

Requirements

macOS (Apple Silicon or Intel)
LLM provider (optional) — for text correction:
- OpenAI-compatible endpoint with API key
- Or AWS Bedrock credentials with model access

Configuration

Whisper Models

Download at least one model from the Whisper tab:

Model	Size	Speed	Accuracy
tiny	~75 MB	Fastest	Lower
base	~140 MB	Fast	Decent
small	~460 MB	Good	Good
medium	~1.5 GB	Slow	Better
large	~3 GB	Slowest	Best

LLM Provider

Foundry (OpenAI-compatible)

Endpoint URL
API key
Model name (e.g., gpt-4o)

AWS Bedrock

AWS region
Credentials (access key, profile, or default chain)
Model ID (e.g., anthropic.claude-3-5-sonnet-20241022-v2:0)

Shortcuts

Customize keyboard shortcuts in the Shortcuts tab:

Hold mode (default: Alt+Space)
Toggle mode (default: Alt+Shift+Space)

Usage

Once configured, Vox runs as a menu bar icon.

Press your shortcut to record. The floating indicator shows:

Red — Recording
Yellow — Transcribing
Blue — Correcting (if LLM enabled)

Release (hold mode) or press again (toggle mode) to stop. Text is pasted automatically.

If correction fails, raw transcription is used. If transcription is empty (silence/noise), nothing is pasted.

Development

Setup

git clone https://github.com/app-vox/vox.git
cd vox
npm install

Run

npm run dev     # Development with hot reload
npm test        # Run tests
npm run dist    # Build production app

Built with Electron, React, TypeScript, and whisper.cpp.

Contributing

Contributions welcome! To contribute:

Fork and create a feature branch
Make your changes
Run npm run typecheck && npm run lint && npm test
Commit with Conventional Commits (e.g., feat(audio): add noise gate)
Open a pull request

⚠️ See more details in CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 284 Commits
.github		.github
build		build
docs		docs
resources		resources
scripts		scripts
src		src
tests		tests
.devskim.json		.devskim.json
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.kingfisher-baseline.yml		.kingfisher-baseline.yml
.mega-linter.yml		.mega-linter.yml
.secretlintrc.json		.secretlintrc.json
.stylelintrc.json		.stylelintrc.json
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
dev-app-update.yml		dev-app-update.yml
electron.vite.config.ts		electron.vite.config.ts
eslint.config.mjs		eslint.config.mjs
package-lock.json		package-lock.json
package.json		package.json
renovate.json		renovate.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.web.json		tsconfig.web.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vox

Demo

Table of Contents

Quick Start

First Launch

Features

Requirements

Configuration

Whisper Models

LLM Provider

Shortcuts

Usage

Development

Setup

Run

Contributing

License

About

Uh oh!

Releases 17

Uh oh!

Contributors 9

Uh oh!

Languages

License

app-vox/vox

Folders and files

Latest commit

History

Repository files navigation

Vox

Demo

Table of Contents

Quick Start

First Launch

Features

Requirements

Configuration

Whisper Models

LLM Provider

Shortcuts

Usage

Development

Setup

Run

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 17

Uh oh!

Contributors 9

Uh oh!

Languages