Skip to content

This is the official repo for our EMNLP'23 paper "ALCAP: Alignment-Augmented Music Captioner".

Notifications You must be signed in to change notification settings

zihaohe123/ALCAP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ALCAP

In this paper we propose to learn the alignment between audio and lyrics using contrastive learning to achieve higher-quality music captions.

Framework

Data Download

For copyright considerations we are only able to provide the song interpretation dataset but not the netease dataset.

  1. Download the metadata to data/music4all.
  2. Download the song waveforms to data/music4all/audios.
  3. (Optional) Download the song embeddings to data/music4all/audios. If not downloaded the code will generate the embeddings from scratch.
  4. (Optional) Download the CNN music encoder to ckp/

Model Training

python run_train.py

Try different corpora and random seeds.

Inference

python run_eval.py

Try different corpora and random seeds.

Citation

@inproceedings{he2023alcap,
  title={ALCAP: Alignment-Augmented Music Captioner},
  author={He, Zihao and Hao, Weituo and Lu, Wei-Tsung and Chen, Changyou and Lerman, Kristina and Song, Xuchen},
  booktitle={Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing},
  pages={16501--16512},
  year={2023}
}

About

This is the official repo for our EMNLP'23 paper "ALCAP: Alignment-Augmented Music Captioner".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages