Real time multilingual face translator
-
Updated
Jul 15, 2024 - Python
Real time multilingual face translator
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.
Ultimate Vocal Remover for Google Colab
The PyTorch-based audio source separation toolkit for researchers
Unofficial PyTorch implementation of Google AI's VoiceFilter system
OpenVINO DevCUP music aeparation & transcription
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
An exploration of blind source audio separation using spiking neural networks. Latency, power. and intelligibility are primary objectives while bio-plausibility is left as a secondary objective to be addressed in the future.
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
A PyTorch implementation of DNN-based source separation.
Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis).
Download multiple tracks from youtube by a single query - with GUI.
A convolutional neural network for blind audio source separation.
Deezer source separation library including pretrained models.
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Add a description, image, and links to the audio-separation topic page so that developers can more easily learn about it.
To associate your repository with the audio-separation topic, visit your repo's landing page and select "manage topics."