-
Data Science and Analytic Thrust, Information Hub, HKUST(GZ)
- GuangZhou
- https://www.zhihu.com/people/peijieDong
- https://pprp.github.io
- https://scholar.google.com/citations?user=TqS6s4gAAAAJ
VIT
Implementation of vision transformer. ⭐⭐⭐
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 with small number of parameters (= 6.3M, originally ViT-B has 8…
CVNets: A library for training computer vision networks
A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"
Implementation of the Swin Transformer in PyTorch.
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Reformer, the efficient Transformer, in Pytorch
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
A treasure chest for visual classification and recognition powered by PaddlePaddle
A modular PyTorch library for vision transformer models
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition





