-
Centre for Digital Music, QMUL
-
13:03
- same time - @mimbres1
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A note-aligned performance-to-score-to-annotations dataset of 12 complete Mozart piano sonatas for expressive performance analysis
Official PyTorch implementation for "Large Language Diffusion Models"
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
deepbeepmeep / YuEGP
Forked from multimodal-art-projection/YuEYuE: Open Full-song Generation Foundation for the GPU Poor
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
"Neural Loop Combiner: Neural Network Models For Assessing The Compatibility of Loops", ISMIR 2020
YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
The augmented version of Expert-Novice Dataset from 'Expert and Novice Evaluations of Piano Performances'
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
The latent diffusion model for text-to-music generation.
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging different representations and enhancing generation with RAG.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
🎼 The clean and modern way of accessing IMSLP data and scores programmatically. 🎶
A small cli tool that downloads sheet music from MuseScore without the hassle
Cambridge-MT dataset auto-download & resampling.
PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.