Flow Matching Text-To-Music Tutorial

This is a PyTorch tutorial on Flow Matching for Text-To-Music. The main goal of this repository is to learn flow matching at the code level through a fun task and a simple dataset.

Setup

Clone the Repository

git clone git@github.com:jakeoneijk/FlowMatchingTextToMusicTutorial.git

cd FlowMatchingTextToMusicTutorial

Create a Conda Environment (Optional)

If you don't want to use a Conda environment, you may skip this step.

source conda create -n flow python==3.11

conda activate flow

Install PyTorch.

👉 You should check your CUDA Version and install compatible version.

Install Requirements

pip install -r ./requirements.txt

Download Pretrained Weights

Download the pretrained weights for both the AutoEncoder and CLAP models:

Save them to the following directory:

 .
 └── CKPT
     ├── autoencoder.pth
     └── music_audioset_epoch_15_esc_90.14.pt

Download Medley-solos-DB

Download the Medley-solos-DB dataset and place it in the following directory:

 .
 └── Data
     └── Dataset
         └── MedleySolosDB
             ├── ~.wav
             ├── ...
             └── ~.wav

Training

Check `HParams.py` for Configurations

class Mode:
    # You can choose how to optimize the model
    config_name:str = [
        'diffusion', 
        'flow'
    ][1]
    # Currently only supports the "train" stage  
    stage:str = {
        0:"preprocess", 
        1:"train", 
        2:"inference", 
        3:"evaluate"
    }[1]

class Resource:
    # Choose device
    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

Train the Model

If you don’t set lv (log visualizer), TensorBoard will be used by default.

python Main.py -lv wandb -do

References

StableAudioOpen

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Config		Config
Data		Data
DataProcess		DataProcess
Evaluater		Evaluater
Inference/Inferencer		Inference/Inferencer
Model		Model
Script		Script
TestCode		TestCode
Train		Train
.gitignore		.gitignore
HParams.py		HParams.py
Main.py		Main.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Flow Matching Text-To-Music Tutorial

Setup

Clone the Repository

Create a Conda Environment (Optional)

Install PyTorch.

Install Requirements

Download Pretrained Weights

Download Medley-solos-DB

Training

Check `HParams.py` for Configurations

Train the Model

References

About

Uh oh!

Releases

Packages

Languages

jakeoneijk/FlowMatchingTextToMusicTutorial

Folders and files

Latest commit

History

Repository files navigation

Flow Matching Text-To-Music Tutorial

Setup

Clone the Repository

Create a Conda Environment (Optional)

Install PyTorch.

Install Requirements

Download Pretrained Weights

Download Medley-solos-DB

Training

Check HParams.py for Configurations

Train the Model

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Check `HParams.py` for Configurations

Packages