labels.csv
: Duplicated Dockerfiles with labels
- Run
./experiments/I-parse/1-phase-1-dockerfile-asts/generate.sh
for parsing phase I. - Run
./experiments/I-parse/2-phase-2-dockerfile-asts/generate.sh
for parsing phase II. - Run
./experiments/I-parse/3-phase-3-dockerfile-asts/generate.sh
for parsing phase III. - Run
./experiments/I-parse/4-phase-4-dockerfile-asts/generate.sh
for parsing phase IV.
- Run
./experiments/II-feature/word2vec
for corpus training. - Run
./experiments/II-feature/feature_save
for feature saving.
Run ./experiments/III-prediction/transformer_predict.py
for DeepPDBR (Our Method) prediction.
For this part of the discussion, we have selected 100 Dockerfiles with their prefix names in dataset.tar.gz
as shown in RQ6.2-IDs.csv
.