Music Recommender

We want to find out how to introduce users to new songs that they enjoy listening to, and evaluate if it is more successful to suggest songs based on what other users with similar tastes listen to (collaborative filtering strategy), or suggest songs that have similar qualities to songs that the user likes (content-based filtering strategy). Both approaches try to answer the question of "closeness," but the data that is used differs. Collaborative filtering works on a large sample of (user, song) pairs, and content-based filtering requires an in-depth analysis of each song's acoustic features and metadata.

For our collaborative filtering approach, we implemented a Logistic Matrix Factorization probabilistic model using implicit user feedback.

For our content-based approach, we use three different metrics to find similarity between the user profile vector and song vectors. We will experiment with Euclidean distance, Cosine distance, and Pearson Correlation.

Setting up the Environment

In the directory, to set up the virtual environment:

python3 -m venv .venv

Then to use the environment, run

source .venv/bin/activate

or

.\.venv\bin\activate.bat

to enter the virtual environment.

Use

pip install -r requirements.txt

to get the requirements and

deactivate

to exit the virtual environment.

Performance

Run

python performance.py [-s]

The number of ss in the flag indicate how small of a dataset we want. No -s is the whole dataset, -s is one tenth, -ss is one hundredth, ...

This will initilize, train, and print out the MPR score of the two recommender schemes (see below).

Open the file to change tuning parameters or update k in k-cross-fold validation.

Collaborative Filtering

Run

python collaborative.py [-s]

Running this file will read in the collaborative matrix and store it as a sparse matrix.

Running

python logistic_mf.py [-s]

will print the log likelihood of the trained matrix, and also create a rank matrix.

Content Filtering

Run

python content_rec.py

to try out the content recommender. It uses both the triplets and track_data datasets. Running this file doesn't print anything, but it's an easy way to verify that the code runs.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.gitignore		.gitignore
Group 83 - Project Report.pdf		Group 83 - Project Report.pdf
README.md		README.md
clean_track_dataset.py		clean_track_dataset.py
collaborative.py		collaborative.py
combine_datasets.py		combine_datasets.py
content_rec.ipynb		content_rec.ipynb
content_rec.py		content_rec.py
create_dataset.py		create_dataset.py
five_users_track_data.csv		five_users_track_data.csv
five_users_triplets.csv		five_users_triplets.csv
five_users_user_profile_mat.csv		five_users_user_profile_mat.csv
logistic_mf.py		logistic_mf.py
mid_track_data.csv		mid_track_data.csv
mid_triplets.csv		mid_triplets.csv
mini_track_data.csv		mini_track_data.csv
mini_triplets.csv		mini_triplets.csv
one_user_track_data.csv		one_user_track_data.csv
one_user_triplets.csv		one_user_triplets.csv
one_user_user_profile_mat.csv		one_user_user_profile_mat.csv
performance.py		performance.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Recommender

Setting up the Environment

Performance

Collaborative Filtering

Content Filtering

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Music Recommender

Setting up the Environment

Performance

Collaborative Filtering

Content Filtering

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages