GUI for a Vocal Remover that uses Deep Neural Networks.
-
Updated
Dec 9, 2024 - Python
GUI for a Vocal Remover that uses Deep Neural Networks.
A PyTorch-based Speech Toolkit
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Code for the paper "Jukebox: A Generative Model for Music"
Automagically synchronize subtitles with video.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
MusicBrainz Picard audio file tagger
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Auto-Editor: Efficient media analysis and rendering
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Noise supression using deep filtering
Data manipulation and transformation for audio signal processing, powered by PyTorch
Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Add a description, image, and links to the audio topic page so that developers can more easily learn about it.
To associate your repository with the audio topic, visit your repo's landing page and select "manage topics."