Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.
-
Updated
Mar 3, 2024 - Python
Lightweight speech-to-speech web-based chat app combining speech recognition, LLM completion and text-to-speech. Implemented with Python (Flask) and vanilla JavaScript only.
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
The codebase for Data-driven general-purpose voice activity detection.
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Add a description, image, and links to the speech-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-activity-detection topic, visit your repo's landing page and select "manage topics."