vad

Here are 128 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Aug 5, 2025
Python

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Jul 20, 2025
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jun 11, 2025
Python

CheshireCC / faster-whisper-GUI

Star

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Dec 8, 2024
Python

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

kotlin python c go csharp cpp speech-recognition vad asr voice-activity-detection

Updated May 30, 2025
C++

TEN-framework / ten-vad

Star

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

audio real-time voice-commands speech voice-recognition vad automatic-speech-recognition speech-processing conversational-ai voice-activity-detection silero-vad

Updated Aug 11, 2025
C

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Dec 11, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 5, 2025
Python

FluidInference / FluidAudio

Star

Fully Native Swift and CoreML. Efficient Speaker Diarization, VAD, and Speech-to-Text for realtime workloads

Updated Aug 11, 2025
Swift

shashikg / WhisperS2T

Star

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

deep-learning speech-recognition vad speech-to-text whisper asr tensorrt voice-activity-detection tensorrt-llm

Updated Aug 27, 2024
Jupyter Notebook

gkonovalov / android-vad

Star

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Jul 15, 2025
C

gtreshchev / RuntimeAudioImporter

Star

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

Updated Feb 23, 2025
C++

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

Baidu-AIP / speech-vad-demo

Star

集成Webrtc的VAD，用于切分音频文件

webrtc speech vad webrtc-vad

Updated Aug 26, 2020
C

EtienneAb3d / WhisperHallu

Star

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

text-to-speech sound-processing vad whisper audio-processing asr noise-removal vocals

Updated Nov 12, 2024
Python

Picovoice / cobra

Star

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Aug 6, 2025
Python

xiongyihui / python-webrtc-audio-processing

Star

Python bindings of WebRTC Audio Processing

python vad ns agc webrtc-audio-processing

Updated May 7, 2025
C++

eesungkim / Voice_Activity_Detector

Star

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Updated Nov 30, 2018
Jupyter Notebook

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

Star

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.

tts vad audio-processing asr voice-assistant conversational-ai speech-to-speech ollama kokoro-tts

Updated Apr 17, 2025
Python

Improve this page

Add a description, image, and links to the vad topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vad topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vad

Here are 128 public repositories matching this topic...

modelscope / FunASR

smacke / ffsubsync

snakers4 / silero-vad

CheshireCC / faster-whisper-GUI

k2-fsa / sherpa-ncnn

TEN-framework / ten-vad

jtkim-kaist / VAD

amsehili / auditok

DmitryRyumin / ICASSP-2023-24-Papers

FluidInference / FluidAudio

shashikg / WhisperS2T

gkonovalov / android-vad

gtreshchev / RuntimeAudioImporter

filippogiruzzi / voice_activity_detection

Baidu-AIP / speech-vad-demo

EtienneAb3d / WhisperHallu

Picovoice / cobra

xiongyihui / python-webrtc-audio-processing

eesungkim / Voice_Activity_Detector

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

Improve this page

Add this topic to your repo