Implementation of Vision Transformers in Flax
-
Updated
Oct 12, 2020 - Python
Implementation of Vision Transformers in Flax
Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Tensorflow implementation of the Vision Transformer (Bye-Bye Convolutions)
Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML project.
An implementation of multiple notable attention mechanisms using TensorFlow 2
CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)
Tensorflow 2.x implementation of Vision-Transformer model
Implementation of Convolutional enhanced image Transformer
SiT: Self-supervised vision Transformer
Compact Convolution Transformers
Pytorch implementation of ViT on CIFAR-10.
Code for the Top-1 submission of contest of VCS AY 2020-2021, the Vision and Cognitive Service class, University of Padova, Italy.
PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation" Strudel et al. (2021)
🥈50th place in Bristol-Myers Squibb – Molecular Translation competition🥈
short training script for ViT, Swin-T, CvT, MsViT and Dino
Simple Implementation of Vision Transformer (https://openreview.net/pdf?id=YicbFdNTTy)
A PyTorch Implementation of ViT (Vision Transformer)
Code for ViTAS_Vision Transformer Architecture Search
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.
To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."