Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
-
Updated
Apr 20, 2025
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Synchronized Translation for Videos. Video dubbing
turnkey self-hosted offline transcription and diarization service with llm summary
UniSpeech - Large Scale Self-Supervised Learning for Speech
Open source inference code for Rev's model
Gecko - A Tool for Effective Annotation of Human Conversations
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
Identify the emotion of multiple speakers in an Audio Segment
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Python package for combining diarization system outputs.
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
pyannote audio diarization in rust
A lightweight library to compute Diarization Error Rate (DER).
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
On-device speaker diarization powered by deep learning
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Add a description, image, and links to the diarization topic page so that developers can more easily learn about it.
To associate your repository with the diarization topic, visit your repo's landing page and select "manage topics."