I am an Assistant Professor at Westlake U. I did my PhD at SMILE Lab of Northeastern University (Boston, USA). Before that, I spent seven wonderful years at Zhejiang University (Hangzhou, China) to get my B.E. and M.S. degrees.
I am interested in various topics in computer vision and machine learning. My research works orbit efficient deep learning (a.k.a. model compression), spanning from the most common image classification task (GReg, Awesome-PaI, TPP) to neural style transfer (Collaborative-Distillation), single image super-resolution (ASSL, SRP), and 3D novel view synthesis (R2L, MobileR2L).
I do my best towards easily reproducible research.
๐ฅ NEWS: [NeurIPS'23] We are excited to present SnapFusion, a super-efficient mobile diffusion model that can do text-to-image generation in less than 2s๐ on mobile devices! [Arxiv] [Webpage]
๐ฅ NEWS: [CVPR'23] Check out our new blazing fast๐ neural rendering model on mobile devices: MobileR2L (the lightweight version of R2L), can render 1008x756 images at 56fps on iPhone13 [Arxiv] [Code]
๐ฅ NEWS: [ICLR'23] Check out the very first trainability-preserving filter pruning method: TPP [Arxiv] [Code]
๐ฅ NEWS: Check out our preprint work that deciphers the so confusing benchmark situation in neural network (filter) pruning: [Arxiv] [Code]
โจ NEWS: Check out our investigation of what makes a "good" data augmentation in knowledge distillation, in NeurIPS 2022: [Webpage] [Code]
โจ NEWS: Check out our Efficient NeRF project via distillation, in ECCV 2022: [R2L]