A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
Updated
Jun 13, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Automagically synchronize subtitles with video.
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
Python AI assistant 🧠
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
An audio/acoustic activity detection and audio segmentation tool
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
A python package to build AI-powered real-time audio applications
Voice Activity Detection based on Deep Learning & TensorFlow
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
The codebase for Data-driven general-purpose voice activity detection.
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
A collection of basic python modules for spoken natural language processing
Auto transcribe tool based on whisper
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."