iwslt

Here are 2 public repositories matching this topic...

vineeths96 / Compressed-Transformers

In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performance for 8 bits, 4 bits, 2 bits and 1 bit (binary) quantization.

deep-learning pytorch transformer model-compression federated-learning iwslt edge-ml

Updated May 14, 2021
Python

sutd-visual-computing-group / LS-KD-compatibility

Star

[ICML 2022] This work investigates the compatibility between label smoothing (LS) and knowledge distillation (KD). We suggest to use an LS-trained teacher with a low-temperature transfer to render high performance students.

machine-translation pytorch imagenet knowledge-distillation label-smoothing iwslt cub200-2011 systematic-diffusion

Updated Dec 8, 2022
Python

Improve this page

Add a description, image, and links to the iwslt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the iwslt topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly