DIRAC ICML '24

Here we will provide the source code and supplementary material for DIRAC: Diffusion-Based Representation Learning for Modality-Agnostic Compositionality.

Please, refer to our website https://anonymsubicml24.github.io/anonymsubicml24/ for listening to the audio results.

Abstract

In this paper, we target the extrapolation and out-of-distribution generation problem in generative models by introducing a generic compositional inductive bias. Leveraging state-of-the-art generative models in an encoder-decoder scheme, our approach focuses on compositional representation learning without any form of supervision. We perform experiments on image and audio data, demonstrating the adaptability of our model to different modalities and representations. Our Diffusion-based Representation Learning for Modality-Agnostic Compositionality (DIRAC), builds upon diffusion models and shows promising results in separating meaningful entities in both images and music, serving as a powerful baseline for future investigations around compositional generation and representation learning.

Images experiments

Audio experiments

Code

We will provide the code for the DIRAC model in this repository. The code will be available soon.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
docs		docs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DIRAC ICML '24

Abstract

Images experiments

Audio experiments

Code

About

Releases

Packages

anonymsubicml24/anonymsubicml24

Folders and files

Latest commit

History

Repository files navigation

DIRAC ICML '24

Abstract

Images experiments

Audio experiments

Code

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages