Final model for the 2018 Recys challenge
Switch branches/tags
Nothing to show
Clone or download
Latest commit 0350086 Jul 17, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Artist_Album_Track_SPARSE.py Added all model files. Jun 30, 2018
Artist_Album_Track_SPARSE_LARGE.py Added all model files. Jun 30, 2018
BOW_FINAL.py Added all model files. Jun 30, 2018
Challenge1.py Added all model files. Jun 30, 2018
Challenge10_exp.py Added all model files. Jun 30, 2018
Challenge2_exp.py Added all model files. Jun 30, 2018
Challenge3_exp.py Added all model files. Jun 30, 2018
Challenge4_exp.py Added all model files. Jun 30, 2018
Challenge5_exp.py Added all model files. Jun 30, 2018
Challenge6_exp.py Added all model files. Jun 30, 2018
Challenge7_exp.py Added all model files. Jun 30, 2018
Challenge8_exp.py Added all model files. Jun 30, 2018
Challenge9_exp.py Added all model files. Jun 30, 2018
CreateSubmission_alternative.py Added all model files. Jun 30, 2018
Extract_Shared_Artist_Album.py Added all model files. Jun 30, 2018
Extract_Shared_Final_Artist_Album.py Added all model files. Jun 30, 2018
Extract_Special_Chars_From_Past_NAME.py Added all model files. Jun 30, 2018
Extract_Special_Chars_From_Playlist_NAME.py Added all model files. Jun 30, 2018
Extract_Special_Chars_From_Track_NAME.py Added all model files. Jun 30, 2018
Find_Common_Words.py Added all model files. Jun 30, 2018
Generate_features_scenario1.py Added all model files. Jun 30, 2018
Generate_features_scenario10.py Added all model files. Jun 30, 2018
Generate_features_scenario2.py Added all model files. Jun 30, 2018
Generate_features_scenario3.py Added all model files. Jun 30, 2018
Generate_features_scenario4.py Added all model files. Jun 30, 2018
Generate_features_scenario5.py Added all model files. Jun 30, 2018
Generate_features_scenario6.py Added all model files. Jun 30, 2018
Generate_features_scenario7.py Added all model files. Jun 30, 2018
Generate_features_scenario8.py Added all model files. Jun 30, 2018
Generate_features_scenario9.py Added all model files. Jun 30, 2018
Get_Proportion.py Added all model files. Jun 30, 2018
Henry Create Henry Jul 17, 2018
LICENSE Initial commit Jun 30, 2018
Load_Playlists.py Added all model files. Jun 30, 2018
Load_Testdata.py Added all model files. Jun 30, 2018
Most_Frequent_Model.py Added all model files. Jun 30, 2018
NEW_SAMPLER.py Added all model files. Jun 30, 2018
NEW_SAMPLER_album_artist_fixed.py Fixed some minor stuff Jun 30, 2018
README.md Update README.md Jun 30, 2018
Transformer11.py Added all model files. Jun 30, 2018
lgb_model1.py Update lgb_model1.py Jun 30, 2018
lgb_model10_exp.py Added all model files. Jun 30, 2018
lgb_model2_exp.py Added all model files. Jun 30, 2018
lgb_model3_exp.py Added all model files. Jun 30, 2018
lgb_model4_exp.py Added all model files. Jun 30, 2018
lgb_model5_exp.py Added all model files. Jun 30, 2018
lgb_model6_exp.py Added all model files. Jun 30, 2018
lgb_model7_exp.py Added all model files. Jun 30, 2018
lgb_model8_exp.py Added all model files. Jun 30, 2018
lgb_model9_exp.py Added all model files. Jun 30, 2018
make_data.py Fixed some minor stuff Jun 30, 2018
transformer.py Added Transforming files Jun 30, 2018
transformer10.py Added Transforming files Jun 30, 2018
transformer2.py Added Transforming files Jun 30, 2018
transformer3.py Added Transforming files Jun 30, 2018
transformer4.py Added Transforming files Jun 30, 2018
transformer5.py Added Transforming files Jun 30, 2018
transformer6.py Added Transforming files Jun 30, 2018
transformer7.py Added Transforming files Jun 30, 2018
transformer8.py Added Transforming files Jun 30, 2018
transformer9.py Added Transforming files Jun 30, 2018

README.md

Recsys2018

Final model for the 2018 Recys challenge by the team STORMtroopers

Our solutions can be reproduced as follows:

  1. Move all of these files to the directory "~/Spotify"
  2. Store all the csv files in the data directory
  3. Run transformer.py, transformer2.py, ... , transformer10.py (in that order!)
  4. Move all csv files to the data directory
  5. Run Transformer11.py
  6. Run make_data.py

These steps significantly reduce the disk space required creating a variety of dictionaries. We are now ready to run the models

  1. Run Challenge1.py
  2. Run Challenge2_exp.py ...
  3. Run Challenge10_exp.py

(N.B. We ran these models on large nodes. Some of these models request 25 cores and need 400GB of RAM - You can however reduce the amout of data used to reduce the computational ressources required.)

We can now match all these csv files into one final submission:

  1. Run CreateSubmission_alternative.py

This is it! Please do not hesitate to get in touch should you have any questions.