Skip to content

This is the repository for the visum 2021 project competition!

Notifications You must be signed in to change notification settings

visum-summerschool/visum-competition2021

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

visum-competition2021

VISUM 2021 Summer School Digital Edition - The Competition Repository

Here you will find all information regarding the competition, including:

  1. General information about the challenge, including the evaluated metrics.
  2. Detailed technical instructions about the competition.
  3. Auxiliary source code, including the source code of the baseline solution.

We wish you all good luck and much success in your work :)

Presentation - here

FAQs - here

Submission Platform - here

Final Leaderboard - here

How to run the baseline code (from the /home/visum folder)

  1. Access the baseline source code by either:
    • Accessing the files already present in your machine OR
    • Cloning this repo into your machine by doing and afterwards moving the contents of the created directory into /home/visum
      git clone https://github.com/visum-summerschool/visum-competition2021.git
      cp -r visum-competition2021/ .
      rm -r visum-competition2021 
      
  2. run python3 split_data_nondisjoint.py to create the training and validation splits in /home/visum/processed_data
  3. run python3 generate_community_prods.py to generate the louvain communities
  4. Either:
    • train the baseline using python3 train.py (model weights and checkpoints are saved in /home/visum/results/<timestamp>) OR
    • use the baseline weights which are already in /home/visum/results/.
  5. IF you trained a new model you need to copy the weights (and the tokenizer) into /home/visum by doing:
       cp -r results/<timestamp>/tokenizer results/.
       cp results/<timestamp>/best_model_weights.pth results/.
    
    where <timestamp> corresponds to the name of the folder where the trained model is located. Usually this will be a timestamp in the format YYYY-MM-DD_hh-mm-ss.
  6. generate predictions for your model with python3 test.py
  7. check the predictions are in the correct format by running python3 evaluate.py preds.csv /home/master/dataset/test/solutions.csv
  8. submit your results by accessing here

[Optional] How to test your model on the validation split

The steps described in the previous section allow you to train your model and test it on some dummy test queries (generated from the train data) located in home/master/dataset/test. However, you might want to test the model on the validation split created in step 2. To do so, consider the following steps:

  1. run python3 generate_test_queries.pyto generate queries from your validation split
  2. test your model with python3 test.py -t processed_data/valid
  3. evaluate your model by running python3 evaluate.py preds.csv processed_data/valid/solutions.csv

Check for erros in your submission:

cd /home/visum/
rm preds.csv
python3 test.py
python3 evaluate.py preds.csv /home/master/dataset/test/solutions.csv```


 

About

This is the repository for the visum 2021 project competition!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages