#

speaker-diarization

Here are 65 public repositories matching this topic...

espnet / espnet

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jun 18, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Jun 20, 2024
Python

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 21, 2024
Python

uis-rnn

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Aug 28, 2023
Python

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Apr 22, 2024
Python

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

speaker-recognition speaker-diarization uis-rnn ghostvlad vgg-speaker-recognition

Updated Jul 1, 2021
Python

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated Jun 15, 2024
Python

SpectralCluster

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

python machine-learning clustering unsupervised-learning constrained-clustering speaker-diarization spectral-clustering unsupervised-clustering auto-tune

Updated Jan 9, 2024
Python

diart

juanmc2005 / diart

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jun 1, 2024
Python

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker rdino cnceleb

Updated Jun 19, 2024
Python

yinruiqing / pyannote-whisper

whisper asr speaker-diarization meeting-summarization pyannote chatgpt

Updated May 11, 2024
Python

manojpamk / pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-recognition speaker-verification speaker-diarization speaker-embeddings

Updated Nov 11, 2020
Python

hitachi-speech / EEND

End-to-End Neural Diarization

machine-learning deep-learning chainer end-to-end kaldi speaker-diarization eend

Updated Aug 30, 2021
Python

cvqluu / TDNN

Time delay neural network (TDNN) implementation in Pytorch using unfold method

pytorch speech-recognition speaker-recognition speaker-verification speech-processing asr speaker-diarization tdnn x-vector

Updated Nov 21, 2019
Python

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition speaker-verification source-separation speaker-diarization speaker-identification

Updated Jun 13, 2024
Python

cvqluu / Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

neural-network pytorch speech-recognition neural-networks kaldi speaker-recognition speaker-verification embedding speaker-diarization tdnn acoustic-model acoustic-models x-vector tdnn-f factorized-tdnn

Updated Jan 6, 2020
Python

transcriptionstream / transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation speech-recognition transcription whisper speaker-diarization diarization llm whisperx ollama mistral-7b

Updated Jun 2, 2024
Python

cvqluu / simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

speech-to-text transcription asr speaker-diarization colab-notebook diarization

Updated May 2, 2024
Python

yuyq96 / D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

speech speaker-recognition speaker-verification speaker-diarization time-delay-neural-network speaker-embedding speaker-adaptation temporal-convolutional-network d-tdnn

Updated May 4, 2023
Python

nuaazs / VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

microservices speech-recognition speaker-recognition antifraud speaker-diarization

Updated Apr 16, 2024
Python

Improve this page

Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."