IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion

Datasets

In the IRCoCo, we use three large-scale datasets for experiments, including one Java and one Python datasets. If you want to train the model, you must download the datasets.

Py150

To download and preprocess the dataset, navigate to data/py150 directory, and run

bash download_and_extract.sh
python preprocess.py --base_dir=py150_files --output_dir=token_completion

Java Corpus

To download the preprocessed dataset, navigate to data/javaCorpus directory, and run

bash download.sh
python preprocess.py --base_dir=token_completion --output_dir=token_completion

Sample RL training data

The training data of our reinforcement learning is obtained through model sampling and organized into the format of APPS. The data is processed by running the python arrow.py file. Please refer to the paper for details and see the data/RL_data folder for examples

Fine-tune LM through SFT

Refer to fine-tuning steps in CodeXGLUE

Code Completion Evaluator (Take Bleu as an example)

Generate training dataset for code completion evaluator

We use the GPT-2 model as an example to show the steps

python generate_score.py --checkpoint_path PATH_OF_FINETUNED_GPT2 --mode score --model gpt2 --batch_size 1 --text_length 256 --min_query_len 10 --dataset_name data/final/jsonl --language python (or java)

**Training code completion evaluator **

python train_estimator.py --batch_size 8 --run_name A_NAME_AS_YOU_WANT --epoch 30 --data_path PATH_OF_GENERATED_DATASET --metric gpt2_bleu --language python (or java)

Fine-tune LM through Deep Reinforcement learning

train.py uses sampled synthetic samples to train the code completion model using reinforcement learning. You can run the following command：

python train.py --batch-size-per-replica=14 --grad-acc-steps=14 --epochs=10 --lr=5e-5 --save-freq=4000 --log-freq=500 --save_total_limit=20 --fp16 --tuning_mode=rl --model=gpt-2 --model_path=PATH_OF_FINETUNED_GPT2

Model testing

In Model file, the code/evaluator.py enables to train the model.

python evaluator.py -a=evaluator/answers.json -p=evaluator/predictions.json

Requirements

Install the transformers library from the source code (the current source code is developed from the original code of version 4.16.1):

cd transformers
pip install -e .

Install other requirements

torch == 1.9.1
tqdm == 4.64.0
python 3.7
javalang == 0.13.0
pyext==0.7
deepspeed

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
Datasets		Datasets
cached/bleu		cached/bleu
code		code
configs		configs
data		data
evaluator		evaluator
trainers		trainers
transformers		transformers
utils		utils
README.md		README.md
arrow.py		arrow.py
prediction.py		prediction.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion

Datasets

Fine-tune LM through SFT

Code Completion Evaluator (Take Bleu as an example)

Fine-tune LM through Deep Reinforcement learning

Model testing

Requirements

About

Releases

Packages

Languages

Libolun-star/IRCoCo

Folders and files

Latest commit

History

Repository files navigation

IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion

Datasets

Fine-tune LM through SFT

Code Completion Evaluator (Take Bleu as an example)

Fine-tune LM through Deep Reinforcement learning

Model testing

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages