CLAudioLM: Text-Conditioned Music Generation in Waveform Domain

idl-project:

CLAudioLM: Text-Conditioned Music Generation in Waveform Domain

We tackle the task of generating music in the raw audio domain conditioned with text inputs. For this, we implemented a variant of the MusicLM[1] architecture, which models the generation of music as an auto-regressive task of hierarchical discrete tokens. The main updates made to the MusicLM architecture were chang- ing the music embedding component, called MuLan[5], with CLAP[21]. We also changed the SoundStream component, a universal neural audio codec, with EnCodec[29] which has similar performance and is publicly available. Our goal is to out-perform previous methods by generating a new dataset to fine-tune the CLAP component using distilled musical knowledge from a large language model like ChatGPT[30]. We observed that ChatGPT can generate subjectively accurate and expressive captions for a whole variety of songs, which can be effective in improving the overall text conditioning for this model.

Usage

The main portion of the code is in the form of jupyter notebook files in the folder notebooks.

There are other auxiliary scripts on deployment that were used to execute code (datase extraction and some training) on the AWS cloud using IaC.

Citations

This code uses directly or reuses code from the following repos:

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
artifacts		artifacts
configs		configs
deployment		deployment
source/notebooks		source/notebooks
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

idl-project:

CLAudioLM: Text-Conditioned Music Generation in Waveform Domain

Usage

Citations

About

Releases

Packages

Languages

krishhrana/idl-project-ClaudioLM

Folders and files

Latest commit

History

Repository files navigation

idl-project:

CLAudioLM: Text-Conditioned Music Generation in Waveform Domain

Usage

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages