This is the code repository for our ICPC 2021 paper "Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting".
Code splitting. As the limitation of LFS, the splitting code data set and experiment dataset can be downloaded from Google Drive or 阿里云盘.
See the readme of the
data_preprocessfolder for details.
Note: You can skip this step, directly download our processed dataset
split_test/train/valid_ast.jsonand proceed to the next step.
Tree-LSTM_pretrain. See the readme of the
Tree-LSTM_pretrainfolder for details.
Train BASTS model. See the readme of the
BASTSfolder for details.