PreProcessing-MillionDatasetsPlaylist

This repository releases a source code to pre-process the data files published together with the paper Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging. by Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov.

Step 0. Install the Dependencies

After having clone this repository with

git clone repo-name

we suggest creating e virtual environment install the required Python dependencies with the following commands

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Step 1. Download the Data

Download the dataset provided form the public Melon link placing them in the

.original_dataset/

Step 2. Run the Script

To measure all the results it is necessary to run the following command in the terminal

python preprocess_data_with_pandas.py

Note that the operation is time-consuming. We have released preprocess_dataset to speed up it.

Step 3. Output Files

The result files will be stored in

.melon/*

with the following formats:

dataset.tsv has playlist_id [TAB] song_id [TAB] 1 [TAB] sequence-order
playlist_title.tsv has playlist_id [TAB] title
... same pattern in the other files

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
original_dataset		original_dataset
src		src
.gitignore		.gitignore
README.md		README.md
extract_features.py		extract_features.py
preprocess_data_with_pandas.py		preprocess_data_with_pandas.py
preprocess_dataset.py		preprocess_dataset.py
requirements.txt		requirements.txt
train_cnn.py		train_cnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PreProcessing-MillionDatasetsPlaylist

Step 0. Install the Dependencies

Step 1. Download the Data

Step 2. Run the Script

Step 3. Output Files

About

Releases

Packages

Contributors 2

Languages

merrafelice/PreProcessing-MillionDatasetsPlaylist

Folders and files

Latest commit

History

Repository files navigation

PreProcessing-MillionDatasetsPlaylist

Step 0. Install the Dependencies

Step 1. Download the Data

Step 2. Run the Script

Step 3. Output Files

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages