Fast local speech-to-text. 25 languages. ~18x faster than Whisper on Apple Silicon.
- CoreML on Apple Silicon โ ~155x real-time via FluidAudio
- ONNX on CPU โ cross-platform fallback, 3x faster than Whisper
- Any audio format โ ffmpeg handles OGG, MP3, WAV, FLAC, M4A
- Zero Python โ Bun + TypeScript, native Swift binary for CoreML
bun install -g @drakulavich/parakeet-cli
parakeet install # CoreML on macOS arm64, ONNX elsewhere
parakeet audio.ogg # โ transcript to stdoutparakeet install # auto-detect backend
parakeet install --coreml # force CoreML (macOS arm64)
parakeet install --onnx # force ONNX (~3GB)
parakeet audio.ogg # transcribe (language auto-detected)
parakeet --versionStdout: transcript. Stderr: errors. Pipe-friendly.
๐ง๐ฌ Bulgarian, ๐ญ๐ท Croatian, ๐จ๐ฟ Czech, ๐ฉ๐ฐ Danish, ๐ณ๐ฑ Dutch, ๐ฌ๐ง English, ๐ช๐ช Estonian, ๐ซ๐ฎ Finnish, ๐ซ๐ท French, ๐ฉ๐ช German, ๐ฌ๐ท Greek, ๐ญ๐บ Hungarian, ๐ฎ๐น Italian, ๐ฑ๐ป Latvian, ๐ฑ๐น Lithuanian, ๐ฒ๐น Maltese, ๐ต๐ฑ Polish, ๐ต๐น Portuguese, ๐ท๐ด Romanian, ๐ท๐บ Russian, ๐ธ๐ฐ Slovak, ๐ธ๐ฎ Slovenian, ๐ช๐ธ Spanish, ๐ธ๐ช Swedish, ๐บ๐ฆ Ukrainian
MacBook Pro M3 Pro โ 10 Russian voice messages:
| faster-whisper (CPU) | Parakeet (CoreML) | |
|---|---|---|
| Total | 35.3s | 1.9s |
| Speed | ~18x faster |
Full results with transcripts: BENCHMARK.md
parakeet audio.ogg
โโโ CoreML installed? โ parakeet-coreml subprocess โ stdout
โโโ ONNX installed? โ ffmpeg โ mel โ encoder โ decoder โ stdout
- CoreML: Swift binary wraps FluidAudio + CoreML model
- ONNX: NVIDIA Parakeet TDT 0.6B v3 via onnxruntime-node
Drop-in replacement for OpenClaw voice processing โ no API keys, runs locally.
{
"tools": {
"media": {
"audio": {
"enabled": true,
"models": [{"type": "cli", "command": "parakeet", "args": ["{{MediaPath}}"], "timeoutSeconds": 120}],
"echoTranscript": false
}
}
}
}MIT