Skip to content

Arkham/transkeet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Transkeet 🦜

Push-to-talk voice transcription for your macOS menu bar, powered by Parakeet TDT running locally on Apple Silicon via MLX.

Hold a hotkey, speak, release β€” your words appear at the cursor. No cloud, no API calls, everything stays on your machine.

Requirements

  • macOS on Apple Silicon (M1+)
  • Python 3.10+
  • ffmpeg (used by the transcription engine)
  • A Hugging Face account with an access token (to download the model)
  • Microphone access (macOS will prompt on first use)
  • Accessibility access for keyboard listening and simulated paste (System Settings β†’ Privacy & Security β†’ Accessibility)

Setup

# Install ffmpeg if you don't have it
brew install ffmpeg

# Set your Hugging Face token (add this to your shell profile to persist it)
export HF_TOKEN="hf_..."

# Create a virtualenv and install
python3 -m venv .venv
source .venv/bin/activate
pip install -e .

On first launch the Parakeet model (~600 MB) will be downloaded from Hugging Face and cached locally.

Setup with Devbox (alternative)

If you have devbox, simply continue to the next step.

Usage

source .venv/bin/activate
transkeet

# or with devbox:
devbox run start

A 🦜 appears in your menu bar. Hold Right Cmd (default) to record β€” the icon turns πŸ”΄ while listening. Release the key and it switches to πŸ”„ while transcribing, then pastes the result at your cursor and returns to 🦜.

The menu bar dropdown shows your active microphone, current hotkey, and model.

Configuration

Config lives at ~/.config/transkeet/config.jsonc (created on first run):

{
  // Hotkey to hold for push-to-talk recording.
  // Format: modifier+modifier+key  (single key also works)
  // Available modifiers: cmd, shift, ctrl, alt
  // Side-specific variants: cmd_r, cmd_l, shift_r, shift_l, ctrl_r, ctrl_l, alt_r, alt_l
  "hotkey": "cmd_r",

  // Parakeet model identifier (any parakeet-mlx compatible model).
  "model": "mlx-community/parakeet-tdt-0.6b-v3"
}

How it works

  1. A global keyboard listener (pynput) watches for your hotkey
  2. While held, audio is captured from the default mic at 16 kHz (sounddevice)
  3. On release, the audio is transcribed locally with Parakeet TDT on MLX
  4. The transcribed text is placed on the clipboard and pasted via simulated Cmd+V
  5. Your original clipboard contents are restored afterward

About

Push-to-talk Mac OS widget using parakeet

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages