A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
-
Updated
May 27, 2024 - Python
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Config files for my GitHub profile.
Spoofing voice detection : 2nd YAICON
An audio/acoustic activity detection and audio segmentation tool
Gecko - A Tool for Effective Annotation of Human Conversations
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
End to end AWS SageMaker application for detecting the AWS Polly voice in an audio recording using Gluon and MXNet.
A statistical model-based Voice Activity Detection
this is a p5js experiment that uses voice detection and cursor movement to multiply creative content in a variety of colours
Efficient voice activity detection algorithm using long-term speech information
Add a description, image, and links to the voice-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-detection topic, visit your repo's landing page and select "manage topics."