Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
-
Updated
Dec 21, 2023 - Python
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
An audio/acoustic activity detection and audio segmentation tool
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
pitch detection,CNN
Build a digital music library by downloading and segmenting youtube videos.
tensorflow for speech-music-detection task,acc 96%+
Automatic generation of speech dataset markup using Wav2Vec2 ASR models
Spliting speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).
SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Whole Audio Analysis with Python
Add a description, image, and links to the audio-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the audio-segmentation topic, visit your repo's landing page and select "manage topics."