Semiparametric Token Sequence Cosupervision

This is the official codebase of Semiparametric Token Sequence Cosupervision. The repository provides the train and inference code of our main experiment, which is as follows:

Train/inference NTP+NSP(Ours)
Train/inference NTP(baseline)

This repository is based on the llama recipes repository from meta. Huge thanks to the contributors!

Requirements

Virtual Environment

To run the examples, make sure to install the requirements using

# python 3.9 or higher recommended
pip install -r requirements.txt

Please note that the above requirements.txt will install PyTorch 2.0.1 version, in case you want to run FSDP + PEFT, please make sure to install PyTorch nightlies.

GPU Resource

We recommend using 8 A100 80GB NVIDIA GPU Node for training and 1 A6000 NVIDIA GPU Node for inference in order to reproduce our experiments. However, you can follow the most of the fine-tuning strategy used in the llama recipes repository(We don't support peft method yet.)

Data

We provide filtered data used to train Self-RAG.

1. Train/inference NTP+NSP(Ours)

1.1 Train

We provide training code where both Emb_seq and Gen are initialized from Llama-2 7B hf ckpt, which is

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nnodes 1 --master_port=29100 --nproc_per_node 8 run_train.py --dist_checkpoint_folder ntp_nsp_cosupervision

We also provide the result model of both Emb_seq and Gen.

1.2. Inference

You should either train the model, or download it directly from huggingface. Modify the --dataset argument to experiment on different dataset. If you want to locally train your model and use it for inference, you can specify --dist_checkpoint_root_folder and --dist_checkpoint_folder to specify the path of your trained model.(default is model_checkpoints/ntp_nsp_cosupervision-meta-llama/Llama-2-7b-hf)

CUDA_VISIBLE_DEVICES=0 accelerate launch run_inference_ntp_nsp.py --dataset kilt_hotpotqa --dist_checkpoint_folder ntp_nsp_cosupervision --ndocs 100

If your local model path is not well specified, the code will automatically download the huggifngface ckpt that we provide to proceed the inference stage.

2. Train/inference NTP(baseline)

As above, Emb_seq and Gen are Llama-2 7B hf ckpt.

2.1. Train Emb_seq

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nnodes 1 --master_port=29100 --nproc_per_node 8 run_train.py --single --dist_checkpoint_folder emb_single_ntp_singlesupervision

2.2. Train Gen

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nnodes 1 --master_port=29100 --nproc_per_node 8 run_train_genonly.py --dist_checkpoint_folder gen_ntp_singlesupervision

We also provide the result model of both Emb_seq and Gen.

2.3. Inference

Specify dist_checkpoint_folder and ret_checkpoint_folder as a local model ckpt path for both Gen and Emb_seq, respectively.(If your local model path is not well specified, the code will automatically download the huggifngface ckpt that we provide to proceed the inference stage.) Modify the --dataset argument to experiment on different dataset.

CUDA_VISIBLE_DEVICES=0 accelerate launch run_inference_ntp.py --dataset kilt_hotpotqa --dist_checkpoint_folder gen_ntp_singlesupervision --ret_checkpoint_folder emb_single_ntp_singlesupervision --ndocs 100 --retriever llama --single

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
dataset		dataset
model		model
policies		policies
utils		utils
.gitignore		.gitignore
README.md		README.md
eval_metric.py		eval_metric.py
requirements.txt		requirements.txt
run_inference_ntp.py		run_inference_ntp.py
run_inference_ntp_nsp.py		run_inference_ntp_nsp.py
run_train.py		run_train.py
run_train_genonly.py		run_train_genonly.py

kaistAI/Semiparametric_Token-Sequence_Co-Supervision

Folders and files

Latest commit

History

Repository files navigation

Semiparametric Token Sequence Cosupervision

Requirements

Virtual Environment

GPU Resource

Data

1. Train/inference NTP+NSP(Ours)

1.1 Train

1.2. Inference

2. Train/inference NTP(baseline)

2.1. Train Emb_seq

2.2. Train Gen

2.3. Inference

About

Resources

Stars

Watchers

Forks

Languages