dmarx

Follow

David Marx dmarx

Follow

Engineer / Machine Learning Researcher interested in deep learning, probabilistic ML, generative models, multi-modal SSL, visual understanding, geometric

577 followers · 376 following

CoreWeave, EleutherAI
Seattle, WA
http://dmarx.github.io
@digthatdata.bsky.social
@DigThatData

Achievements

Achievements

Organizations

Stars

ml audio

263 repositories

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,976 317 Updated Feb 2, 2026

deezer / spleeter

Deezer source separation library including pretrained models.

Python 28,048 3,065 Updated Apr 2, 2025

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,767 1,421 Updated Apr 24, 2024

JCBrouwer / maua-stylegan2

This is the repo for my experiments with StyleGAN2. There are many like it, but this one is mine. Contains code for the paper Audio-reactive Latent Interpolations with StyleGAN.

Python 179 29 Updated Jun 26, 2021

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,095 177 Updated Jun 12, 2023

acids-ircam / RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,687 216 Updated Jun 23, 2025

csteinmetz1 / steerable-nafx

Steerable discovery of neural audio effects

Jupyter Notebook 208 18 Updated Mar 2, 2022

marcoppasini / musika

Fast Infinite Waveform Music Generation

Python 687 50 Updated Oct 28, 2022

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 851 74 Updated Jul 30, 2024

magenta / music-spectrogram-diffusion

Jupyter Notebook 399 33 Updated Jan 16, 2025

NotNANtoN / AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 4 2 Updated Sep 30, 2021

RasaHQ / rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 21,059 4,910 Updated Jan 29, 2026

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,618 282 Updated Jan 12, 2025

GuitarML / SmartGuitarAmp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

C++ 1,313 62 Updated Apr 11, 2023

cdown / srt

A simple library and set of tools for parsing, modifying, and composing SRT files.

Python 530 52 Updated Mar 19, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,946 11,788 Updated Dec 15, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 11,231 1,651 Updated Feb 11, 2026

ilaria-manco / mulap

Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)

Python 47 3 Updated Dec 3, 2024

ilaria-manco / muscall

Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)

Python 122 11 Updated Dec 5, 2024

ilaria-manco / muscaps

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Jupyter Notebook 85 7 Updated Dec 3, 2024

GuitarML / SmartGuitarPedal

Guitar plugin made with JUCE that uses neural network models to emulate real world hardware.

C++ 291 22 Updated Oct 12, 2022

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,213 1,017 Updated Feb 20, 2026

Harmonai-org / sample-generator

Tools to train a generative model on arbitrary audio samples

Jupyter Notebook 1,111 175 Updated Apr 29, 2024

LAION-AI / audio-dataset

Audio Dataset for training CLAP and other models

Python 730 59 Updated Jan 8, 2026

archinetai / audio-diffusion-pytorch-trainer

Trainer for audio-diffusion-pytorch

Python 129 22 Updated Jan 13, 2023

archinetai / audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Python 144 23 Updated Feb 11, 2023

adobe-research / DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Python 404 51 Updated May 30, 2023

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,233 211 Updated Dec 27, 2025

maum-ai / nuwave

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021

Python 283 20 Updated Jul 22, 2022

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,904 494 Updated Oct 12, 2024