DBKD-PLM

Codebase for ACL 2023 conference long paper Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models.
arXiv version: https://arxiv.org/abs/2306.08909
Our code currently is based on huggingface transformers package and its run_glue scripts.

Requirements

See requirements.txt

Usage

To perform the proposed distillation method in our paper, please follow the following steps. We also include the example scripts in the scripts/ folder.

Step 1: Generate decision-to-logits look-up table before performing distillation by python utils/decision2prob.py, and it will generate a look-up table at utils/ptable_mc10.pkl.
Step 2: Inference teacher model on a dataset by bash scripts/inference_teacher.sh.
- If you do not have a teacher model, you can finetune one by bash scripts/finetune_teacher.sh before that.
- If have your own teacher model, change the TEACHER_PATH variable to your path to teacher model.
Step 3: Run knowledge distillation by base scripts/distil.sh. It will automatically do hyper-parameter search and evaluations. If you do not want to search hyper-parameter, we recommend to set ALPHA=0.7, TEMP=1, and SIGMA=1.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
inference		inference
scripts		scripts
utils		utils
visualization		visualization
.gitignore		.gitignore
README.md		README.md
distil.py		distil.py
distil_gpt.py		distil_gpt.py
make_log.py		make_log.py
requirements.txt		requirements.txt
run_glue.py		run_glue.py
run_glue_gpt.py		run_glue_gpt.py
student.py		student.py
teacher_inference.py		teacher_inference.py
teacher_inference_gpt.py		teacher_inference_gpt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DBKD-PLM

Requirements

Usage

About

Releases

Packages

Languages

THUNLP-MT/DBKD-PLM

Folders and files

Latest commit

History

Repository files navigation

DBKD-PLM

Requirements

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages