LAD: Length-Adaptive Distillation

This repo provides the implementation of our work Length-Adaptive Distillation: Customizing Small Language Model for Dynamic Token Pruning published in Findings of EMNLP 2023.

Our implementation is mainly based on transformers. We using the same data augmentation code provided by TinyBERT. We following LAT to calculate the speedup ratio.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
distillation		distillation
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LAD: Length-Adaptive Distillation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

EMNLP-LAD/LAD

Folders and files

Latest commit

History

Repository files navigation

LAD: Length-Adaptive Distillation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages