Open-source audio intelligence.
speech-swift — AI speech models for Apple Silicon. ASR, TTS, speech-to-speech, VAD, diarization, and speech enhancement — all running locally via MLX and CoreML. No cloud, no API keys.
speech-core — Cross-platform voice agent pipeline engine in C++. Turn detection, interruption handling, speech queuing, and protocol handling.