SuperVox

Voice-powered productivity TUI. Live call assistant + post-call analysis + agent chat.

Modes

Live -- real-time subtitles, translation, rolling summary, and audio recording
Analysis -- post-call summary, action items, follow-up draft
Agent -- chat with call history, search across past calls
History -- browse past calls, open any call in Analysis mode

Prerequisites

Rust 2024 edition
OPENAI_API_KEY environment variable (for realtime STT)
macOS for system audio capture (system-audio-tap binary)

Quick Start

export OPENAI_API_KEY="sk-..."
make test      # run all tests
make check     # test + clippy + fmt
make run       # launch TUI
make install   # install to ~/.cargo/bin

Usage

supervox live                        # live call assistant
supervox analyze <call.json>         # post-call analysis
supervox analyze <call.json> --json  # output analysis as JSON
supervox agent                       # chat with history
supervox calls                       # list past calls
supervox calls --json                # output calls as JSON
supervox delete <call-id>            # delete a call (with confirmation)
supervox delete <call-id> --force    # delete without confirmation
supervox export <call-id>            # export call as markdown to stdout
supervox export <call-id> -o file.md # export to file
supervox search <query>              # search call transcripts
supervox search <query> --json       # output matches as JSON
supervox play <call-id>              # play audio recording in system player

# Use local Ollama instead of cloud LLM
supervox --local live

Global keybindings

Key	Action
`?`	Show help overlay with all keybindings for current mode
`Ctrl+C`	Quit immediately

Live mode

Key	Action
`r`	Start recording
`s`	Stop recording
`h`	Open call history (when idle)
`q`	Quit (when idle)

Speaker labels are color-coded: You (cyan) and Them (yellow).

Audio is automatically saved as WAV (16-bit PCM mono) alongside the call JSON in ~/.supervox/calls/. Use supervox play <call-id> or press p in Analysis mode to listen back.

Analysis mode

Opens a call JSON file, runs LLM analysis automatically (summary, action items, mood, themes).

Key	Action
`f`	Generate follow-up email
`c`	Copy analysis to clipboard
`C`	Copy follow-up to clipboard
`e`	Export call + analysis as markdown to clipboard
`p`	Play audio recording (if available)
`h`	Open call history
Arrow keys	Scroll
`q`	Quit

History mode

Browse and manage past calls.

Key	Action
`↑/↓/j/k`	Navigate
`Enter`	Open in Analysis
`d`	Delete call (y/n confirmation)
`Esc`	Back to previous mode
`q`	Quit

Agent mode

Chat with your call history. The agent loads the last 10 calls as context and streams LLM responses in real-time.

Key	Action
Type + Enter	Send question
Esc	Quit

Tech Stack

Layer	Technology
Voice pipeline	voxkit (STT, VAD, TTS, mic, system audio)
LLM agent	sgr-agent (tool calling, sessions, compaction)
TUI	ratatui + sgr-agent-tui
Real-time STT	OpenAI Realtime WebSocket
LLM	Gemini Flash / OpenRouter / Ollama

Config

Config is loaded from ~/.supervox/config.toml at startup. A default is created if missing.

# ~/.supervox/config.toml
my_language = "ru"              # target language for translation/summary
stt_backend = "realtime"        # "realtime" (WebSocket) | "openai" (batch)
llm_model = "gemini-2.5-flash"  # model for translation + summary
summary_lag_secs = 5            # rolling summary interval
capture = "mic+system"          # "mic" | "mic+system"
llm_backend = "auto"            # "auto" | "ollama"
ollama_model = "llama3.2:3b"    # model when llm_backend = "ollama"

System audio setup (macOS)

System audio capture uses ScreenCaptureKit via the system-audio-tap helper binary. If unavailable, SuperVox falls back to mic-only mode automatically.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
.githooks		.githooks
.github/workflows		.github/workflows
.solo		.solo
crates		crates
docs		docs
schemas		schemas
tests/fixtures		tests/fixtures
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SuperVox

Modes

Prerequisites

Quick Start

Usage

Global keybindings

Live mode

Analysis mode

History mode

Agent mode

Tech Stack

Config

System audio setup (macOS)

License

About

Uh oh!

Releases 10

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SuperVox

Modes

Prerequisites

Quick Start

Usage

Global keybindings

Live mode

Analysis mode

History mode

Agent mode

Tech Stack

Config

System audio setup (macOS)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages