A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
Updated
May 31, 2024 - Python
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Worflow for literature character personality profiling 📚 which is solely relies on book content 📕
A PyTorch-based Speech Toolkit
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software
SA-toolkit: Speaker speech anonymization toolkit in python
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Python toolkit for speech processing
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
使用Tensorflow实现声纹识别
基于Kersa实现的声纹识别模型
On-device speaker recognition engine powered by deep learning
Speaker recognition task using wav2vec2 model.
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
In defence of metric learning for speaker recognition
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
Add a description, image, and links to the speaker-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speaker-recognition topic, visit your repo's landing page and select "manage topics."