APIrecX

This our submission repo in emnlp 2021

Install

Use the following command to install the environment required by APIrecX

conda env create -f pytorch-gpu.yaml

Usage

pretrain quick start

first clone this repository:

git clone https://github.com/anonymous98514/APIrecX.git

Then enter the pre-training folder

cd pretrain
python -u main.py --train_corpus {raw data path} --vocab_file {bpe tokenizer vocabulary} --batch_size 32 --pretrained_sp_model {bpe tokenizer model} --local_rank 1 --n_layers 6 --lr 1.5e-4 --n_attn_heads 8 --epochs 15 --max_seq_len 512 --hidden 256 --ffn_hidden 512 --model_output_path {output path} --pretrain

fine_tune

Use the following command to quickly start APIrecX fine tuning

cd finetune
python -u train.py --epoch 15 --lr 2e-3 --weight_decay 1e-8  --batch_size 8 --sample {sample ratio} --max_seq_len 512 --device_index 0 --k 10 --is_save False --boundary {beam size}

baseline

cd lstm

lstm baseline training

python train.py --epoch 30 --lr 5e-3 --weight_decay 1e-8 --hidden_size 128 --batch_size 300 --num_layers 2 --sample {sample ratio} --max_seq_len 128 --device_index 1 --k 10 --boundary 20 --is_save True --mode pretrain

lstm baseline test

python train.py --epoch 1 --sample {sample ratio} --max_seq_len 128 --device_index 1 --k 10 --boundary 20 --is_save False --mode train

cd ngram

ngram baseline training

python train_ngram_baseline.py --mode pretrain --device_index 1 --sample {sample ratio} --epoch 30 --batch_size 10000 --lr 5e-3 --weight_decay 1e-8 --max_seq_len 128 --is_save True --domain jdbc

ngram baseline test

python train_ngram_baseline.py --mode fine_tune --device_index 1 --sample {sample ratio} --epoch 1 --batch_size 100  --max_seq_len 128 --is_save False --domain jdbc

Contributing

PRs accepted.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APIrecX

Install

Usage

pretrain quick start

fine_tune

baseline

Contributing

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
baseline		baseline
finetune		finetune
pretrain		pretrain
README.md		README.md
pytorch-gpu.yaml		pytorch-gpu.yaml

yuningkang/APIRecX

Folders and files

Latest commit

History

Repository files navigation

APIrecX

Install

Usage

pretrain quick start

fine_tune

baseline

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages