This repository contains data preprocessing and analysis techniques for audio data.
-
Updated
Oct 25, 2024 - Python
This repository contains data preprocessing and analysis techniques for audio data.
Redis integration for the audio-dataset-converter library.
For converting audio datasets from one format into another.
Faster whisper plugins for the audio-dataset-converter library.
Visualization plugins for the audio-dataset-converter library.
Interpseech 2024 - News Topic classification (dataset, evaluation and models source)
Download and convert the original MTG Jamendo dataset to opus
Fine tuning Whisper-Small LLM for Hinglish Audio dataset
These are different files I created to do different tasks when I was working on creating ASR model for mTEDx dataset.
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
[AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Source code for baseline obtenience
CNN Based Audio and Image Captcha Breaker Project
The Abuse Project Audio Dataset (TAPAD). Think MNIST for audio profanity.
Add a description, image, and links to the audio-dataset topic page so that developers can more easily learn about it.
To associate your repository with the audio-dataset topic, visit your repo's landing page and select "manage topics."