Generic-to-Specific Distillation of Masked Autoencoders, CVPR 2023

🔥 Accepted by CVPR 2023!

Introduction

Large vision Transformers (ViTs) driven by self-supervised pre-training mechanisms achieved unprecedented progress. Lightweight ViT models limited by the model capacity, however, benefit little from those pre-training mechanisms. Knowledge distillation defines a paradigm to transfer representations from large (teacher) models to small (student) ones. However, the conventional single-stage distillation easily gets stuck on task-specific transfer, failing to retain the task-agnostic knowledge crucial for model generalization. In this study, we propose generic-to-specific distillation (G2SD), to tap the potential of small ViT models under the supervision of large models pre-trained by masked autoencoders. In generic distillation, decoder of the small model is encouraged to align feature predictions with hidden representations of the large model, so that task-agnostic knowledge can be transferred. In specific distillation, predictions of the small model are constrained to be consistent with those of the large model, to transfer task-specific features which guarantee task performance. With G2SD, the vanilla ViT-Small model respectively achieves 98.7%, 98.1% and 99.3% the performance of its teacher (ViT-Base) for image classification, object detection, and semantic segmentation, setting a solid baseline for two-stage vision distillation.

Model weights and logs

You could download model weights and logs here: Google drive.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
G2SD		G2SD
G2SD_det_dis		G2SD_det_dis
G2SD_seg_dis		G2SD_seg_dis
fig		fig
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

G2SD

G2SD

G2SD_det_dis

G2SD_det_dis

G2SD_seg_dis

G2SD_seg_dis

fig

fig

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Generic-to-Specific Distillation of Masked Autoencoders, CVPR 2023

Introduction

Model weights and logs

About

Releases

Packages

Contributors 2

Languages

pengzhiliang/G2SD

Folders and files

Latest commit

History

Repository files navigation

Generic-to-Specific Distillation of Masked Autoencoders, CVPR 2023

Introduction

Model weights and logs

About

Resources

Stars

Watchers

Forks

Languages