Skip to content

juliagsy/immute

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ImMuTe

Introduction

Due to the limited number of image-music dataset in the field, the work investigates feasible and efficient methods of collecting image-music dataset. A combination of the Google MusicCaps text-music dataset and Stable Diffusion text-to-image Generative AI is applied. The work has succeeded in producing a reliable image-music-text dataset - ImMuTe of size 5521 data pairs for training and testing purposes.

Download

  1. GitHub repository
git clone https://github.com/juliagsy/immute
  1. Hugging Face
from datasets import load_dataset
dataset = load_dataset("juliagsy/immute")
  1. Manual script

Example shown here

Usage

from immute.dataset import ImMuTe

immute = ImMuTe("images", "caption.json", "audios", start=0, end=100, sampling_rate=32000, pixel=256)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published