Stars
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A powerful 🚀 Android chart view / graph view library, supporting line- bar- pie- radar- bubble- and candlestick charts as well as scaling, panning and animations.
Charting library for Android applications. Automatically exported from code.google.com/p/achartengine
The repo provides information about KeSpeech dataset.
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Python wrapper for the xeno-canto.org API to aid in downloading and managing recordings.
Automatic headphone equalization from frequency responses
AS-pVAD: A Real-time Personalized Voice Activity Detection Network With Attentive Score Loss
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
adefossez / demucs
Forked from facebookresearch/demucsCode for the paper Hybrid Spectrogram and Waveform Source Separation
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Unsupervised Music Source Separation Using Differentiable Parametric Source Models
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
Tiny library to interface with ALSA in the Linux kernel
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
Production First and Production Ready End-to-End Speech Recognition Toolkit
Reformats Java source code to comply with Google Java Style.
滤波器设计之路(The road to filter-design, including FIR, IIR, sinc, Butterworth, etc.)
Tips for best practices with filterbanks