KD with pretrained students. #429

liguopeng0923 · 2023-12-01T08:51:26Z

liguopeng0923
Dec 1, 2023

I want to know whether there are any distilling methods to fine-tune the pre-trained students, not train them from scratch.

Answered by yoshitomo-matsubara

Dec 1, 2023

For instance, the following papers present methods to train modified, pretrained image classification and object detection models as students (only first layers are modified and randomly initialized), learning from the original pretrained teacher models.

https://ieeexplore.ieee.org/abstract/document/9265295/
https://arxiv.org/abs/2007.15818
https://openaccess.thecvf.com/content/WACV2022/html/Matsubara_Supervised_Compression_for_Resource-Constrained_Edge_Computing_Systems_WACV_2022_paper.html

View full answer

yoshitomo-matsubara · 2023-12-01T08:55:56Z

yoshitomo-matsubara
Dec 1, 2023
Maintainer

Hi @liguopeng0923

For NLP tasks (e.g., GLUE), the following example and paper fine-tune pretrained BERT-Base as a student, using fine-tuned BERT-Large as a teacher

Colab: https://github.com/yoshitomo-matsubara/torchdistill#glue
Paper: https://arxiv.org/abs/2310.17644

3 replies

liguopeng0923 Dec 1, 2023
Author

Do you know any similar methods for computer vision?

yoshitomo-matsubara Dec 1, 2023
Maintainer

For instance, the following papers present methods to train modified, pretrained image classification and object detection models as students (only first layers are modified and randomly initialized), learning from the original pretrained teacher models.

https://ieeexplore.ieee.org/abstract/document/9265295/
https://arxiv.org/abs/2007.15818
https://openaccess.thecvf.com/content/WACV2022/html/Matsubara_Supervised_Compression_for_Resource-Constrained_Edge_Computing_Systems_WACV_2022_paper.html

Answer selected by liguopeng0923

liguopeng0923 Dec 1, 2023
Author

Thank you very much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KD with pretrained students. #429

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

KD with pretrained students. #429

liguopeng0923 Dec 1, 2023

Replies: 1 comment · 3 replies

yoshitomo-matsubara Dec 1, 2023 Maintainer

liguopeng0923 Dec 1, 2023 Author

yoshitomo-matsubara Dec 1, 2023 Maintainer

liguopeng0923 Dec 1, 2023 Author

liguopeng0923
Dec 1, 2023

Replies: 1 comment 3 replies

yoshitomo-matsubara
Dec 1, 2023
Maintainer

liguopeng0923 Dec 1, 2023
Author

yoshitomo-matsubara Dec 1, 2023
Maintainer

liguopeng0923 Dec 1, 2023
Author