Consistency-regularized Intermediate Layer Distillation (EACL2023 Findings)

Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective
Jongwoo Ko, Seungjoon Park, Minchan Jeong, Sukjin Hong, Euijai Ahn, Du-Seong Chang, Se-Young Yun

Requirements

Python modules

pip install -r requirements.txt

Example to Run

Prepare the GLUE datasets

python download_glue_data.py

Prepare the pre-trained Language Models

For BERT experiments, you have to prepare the teacher model and student model. You have to download the teacher and student model from these link.

Then, you have to first fine-tune the teacher model, and then conducting ILD. You only need pytorch_model.bin and config.json.

Examples for script files

Fine-tuning

bash run_ft_standard.sh ${task_name}

CR-ILD

bash scripts/standard_glue_truncated_bert.sh 0 ${task_name}

TinyBERT-like KD

bash scripts/standard_glue_truncated_bert.sh 1 ${task_name}

BERT-EMD

bash scripts/standard_glue_truncated_bert.sh 2 ${task_name}

Patient KD

bash scripts/standard_glue_truncated_bert.sh 3 ${task_name}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dataloaders		dataloaders
modules		modules
scripts		scripts
.gitignore		.gitignore
README.md		README.md
download_glue_data.py		download_glue_data.py
generate_wiki_data.py		generate_wiki_data.py
losses.py		losses.py
opts.py		opts.py
parse_test_res.ipynb		parse_test_res.ipynb
requirements.txt		requirements.txt
run_finetune.py		run_finetune.py
training_functions.py		training_functions.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Consistency-regularized Intermediate Layer Distillation (EACL2023 Findings)

Requirements

Python modules

Example to Run

Prepare the GLUE datasets

Prepare the pre-trained Language Models

Examples for script files

Fine-tuning

CR-ILD

TinyBERT-like KD

BERT-EMD

Patient KD

References

About

Releases

Packages

Languages

jongwooko/CR-ILD

Folders and files

Latest commit

History

Repository files navigation

Consistency-regularized Intermediate Layer Distillation (EACL2023 Findings)

Requirements

Python modules

Example to Run

Prepare the GLUE datasets

Prepare the pre-trained Language Models

Examples for script files

Fine-tuning

CR-ILD

TinyBERT-like KD

BERT-EMD

Patient KD

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages