Skip to content
View dmarx's full-sized avatar

Organizations

@pytti-tools

Block or report dmarx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ml audio

263 repositories

🎛 🔊 A Python library for audio.

C++ 5,976 317 Updated Feb 2, 2026

Deezer source separation library including pretrained models.

Python 28,048 3,065 Updated Apr 2, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,767 1,421 Updated Apr 24, 2024

This is the repo for my experiments with StyleGAN2. There are many like it, but this one is mine. Contains code for the paper Audio-reactive Latent Interpolations with StyleGAN.

Python 179 29 Updated Jun 26, 2021

Audio generation using diffusion models, in PyTorch.

Python 2,095 177 Updated Jun 12, 2023

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,687 216 Updated Jun 23, 2025

Steerable discovery of neural audio effects

Jupyter Notebook 208 18 Updated Mar 2, 2022

Fast Infinite Waveform Music Generation

Python 687 50 Updated Oct 28, 2022

Collection of audio-focused loss functions in PyTorch

Python 851 74 Updated Jul 30, 2024
Jupyter Notebook 399 33 Updated Jan 16, 2025

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 4 2 Updated Sep 30, 2021

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 21,059 4,910 Updated Jan 29, 2026

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,618 282 Updated Jan 12, 2025

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

C++ 1,313 62 Updated Apr 11, 2023

A simple library and set of tools for parsing, modifying, and composing SRT files.

Python 530 52 Updated Mar 19, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,946 11,788 Updated Dec 15, 2025

A PyTorch-based Speech Toolkit

Python 11,231 1,651 Updated Feb 11, 2026

Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)

Python 47 3 Updated Dec 3, 2024

Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)

Python 122 11 Updated Dec 5, 2024

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Jupyter Notebook 85 7 Updated Dec 3, 2024

Guitar plugin made with JUCE that uses neural network models to emulate real world hardware.

C++ 291 22 Updated Oct 12, 2022

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,213 1,017 Updated Feb 20, 2026

Tools to train a generative model on arbitrary audio samples

Jupyter Notebook 1,111 175 Updated Apr 29, 2024

Audio Dataset for training CLAP and other models

Python 730 59 Updated Jan 8, 2026

Trainer for audio-diffusion-pytorch

Python 129 22 Updated Jan 13, 2023

A collection of useful audio datasets and transforms for PyTorch.

Python 144 23 Updated Feb 11, 2023

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Python 404 51 Updated May 30, 2023

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,233 211 Updated Dec 27, 2025

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021

Python 283 20 Updated Jul 22, 2022

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,904 494 Updated Oct 12, 2024