Machine learning for Audio, speech, and General Optimization
We build and share open-source tools for speech recognition, speaker diarization, and speech AI research.
|
Ultra-Sortformer Extending NVIDIA's Sortformer streaming diarization model to support N > 4 speakers via SVD-based orthogonal initialization and split learning rates. |
|
audion-python-sdk Python SDK for the Audion API — speech recognition and audio AI integration. |
audion-java-sdk Java SDK for the Audion API — speech recognition and audio AI integration. |