download_data.py
to get data for all 6 languagescreate_6L_data.py
to create 6L datasetcreate_5L_data.py
to create 5L datasetpython bpe.py --lang java --vocab_max_size 10000 --bpe_pct 0.5
to create bpe dataset for javapython bpe.py --lang 6L --vocab_max_size 10000 --bpe_pct 0.5
to create bpe dataset for 6Lpython bpe.py --lang 5L --vocab_max_size 10000 --bpe_pct 0.5
to create bpe dataset for 5Lpython create_vocab.py
to create vocabularies
python main_code_retrieval.py --lang java --model transformer --seed 1
model on code retrieval task, in java, w/ random initial weightspython main_method_prediction.py --lang java --model transformer --seed 1
model on method prediction task, in java, w/ random initial weights
python main_code_retrieval.py --lang 6L-java --model transformer --seed 1
python main_method_prediction.py --lang 6L-java --model transformer --seed 1
python main_code_retrieval.py --lang 5L-java --model transformer --seed 1
pythom main_method_prediction.py --lang 5L-java --model transformer --seed 1
python main_sequence_lm.py --lang java --model --transformer --data code --seed 1
python main_sequence_lm.py --lang java --model --transformer --data code --seed 1
python main_code_retrieval.py --lang java --model transformer --seed 1 --load
model on code retrieval task, in java, fine tunedpython main_method_prediction.py --lang java --model transformer --seed 1 --load
model on method prediction task, in java, fine tuned