dictctl

CLI tool for dictation: microphone recording → Whisper transcription → text on stdout.

🎙️ Record & Transcribe — Record from your microphone, stop with Ctrl+C, get text on stdout
🏠 Local Backend — Offline transcription via whisper.cpp — no data leaves your machine
☁️ OpenAI Backend — Cloud transcription via the OpenAI Whisper API — no local GPU needed
📋 Clipboard Support — Copy transcription result directly to clipboard with -c
🔇 Silence Detection — Auto-stop recording after silence with -s
📁 File Transcription — Transcribe existing audio files without recording
🔧 Interactive Setup — Configure language, backend, model, and audio device via dictctl setup (requires fzf)
⚡ Single Binary — No runtime dependencies, no Python, no Docker — just a Go binary and sox

Requirements

sox — audio recording (rec command)
whisper-cpp — local transcription (optional, for local backend)
OpenAI API key — cloud transcription (optional, for openai backend)

brew install sox whisper-cpp

Installation

go install github.com/slauger/dictctl@latest

Or build from source:

git clone https://github.com/slauger/dictctl.git
cd dictctl
make install

Quick Start

# Download the default whisper model (~1.5 GB)
dictctl download

# Start dictating (Ctrl+C to stop)
dictctl

Usage

dictctl                         # record → default backend
dictctl -b openai               # record → OpenAI API
dictctl file audio.mp3          # transcribe existing file
dictctl devices                 # list audio input devices
dictctl download                # download whisper model (interactive)
dictctl download -m base        # download a specific model
dictctl setup                   # interactive configuration
dictctl --help                  # show help

Press Ctrl+C to stop recording. The audio is finalized cleanly and passed to the transcription backend.

Flags

Flag	Description
`-b <backend>`	Backend: `local`, `openai` (default: from config)
`-c`	Copy result to clipboard (macOS, via `pbcopy`)
`-d <device>`	Audio input device (see `dictctl devices`)
`-l <lang>`	Language code (default: `en`)
`-s`	Enable silence detection (auto-stop recording)
`-m <model>`	Override model name
`-h, --help`	Show help

Examples

# Record and transcribe in English (default)
dictctl

# Record in German via OpenAI
dictctl -b openai -l de

# Transcribe a file and copy to clipboard
dictctl file meeting.wav -c

# Use a specific local model
dictctl -m large-v3

# Record from a specific device
dictctl -d "Elgato Wave:3"

Models

For the local backend, download a whisper.cpp GGML model:

dictctl download                # downloads the configured model (default: large-v3-turbo)
dictctl download -m base        # download a smaller/faster model

Models are stored in ~/.local/share/whisper-cpp/. The search order is:

~/.local/share/whisper-cpp/ggml-<model>.bin
/opt/homebrew/share/whisper-cpp/ggml-<model>.bin

Or specify an absolute path via config or -m flag.

Audio Devices

List available input devices:

dictctl devices

* Elgato Wave:3 (1 ch)
  MacBook Pro-Mikrofon (1 ch)
  ...

* = default input device

Select a device per invocation with -d or set a default in the config file. When a device is configured, recording uses ffmpeg (avfoundation) instead of sox. Without a device, it uses the system default via sox.

Configuration

Config file: ~/.config/dictctl/config.yaml

default_backend: local
language: en
# device: "Elgato Wave:3"

backends:
  local:
    model: large-v3-turbo
    # binary: /opt/homebrew/bin/whisper-cli
  openai:
    api_key: sk-...
    model: whisper-1

The OpenAI API key can also be set via the OPENAI_API_KEY environment variable.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
cmd/dictctl		cmd/dictctl
internal		internal
.gitignore		.gitignore
.releaserc.json		.releaserc.json
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
renovate.json		renovate.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dictctl

Requirements

Installation

Quick Start

Usage

Flags

Examples

Models

Audio Devices

Configuration

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dictctl

Requirements

Installation

Quick Start

Usage

Flags

Examples

Models

Audio Devices

Configuration

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages