Skip to content
View l-sf's full-sized avatar

Block or report l-sf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

transformer

19 repositories

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,738 2,213 Updated Jul 24, 2024

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3,565 400 Updated Jan 7, 2025

real Transformer TeraFLOPS on various GPUs

Jupyter Notebook 913 116 Updated Jan 9, 2024

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 12,167 1,957 Updated Dec 6, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,458 5,127 Updated Mar 9, 2026

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Shell 822 89 Updated Apr 20, 2023

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,983 3,485 Updated Feb 11, 2026

This is a collection of our NAS and Vision Transformer work.

Python 1,823 239 Updated Jul 25, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,232 1,345 Updated Jul 23, 2024

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Python 229 38 Updated Jul 4, 2022

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Python 1,108 94 Updated Aug 13, 2023

Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"

Python 144 10 Updated Jul 26, 2022

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

Jupyter Notebook 489 38 Updated Jun 2, 2023

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Jupyter Notebook 649 80 Updated Jul 11, 2023

CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107.06263.pdf).

Python 103 16 Updated Jul 1, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 157,586 32,331 Updated Mar 8, 2026

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].

Python 164 18 Updated Jul 12, 2023

MetaFormer Baselines for Vision (TPAMI 2024)

Python 495 31 Updated Jun 1, 2024