Simple prototypes of SOTAs with dummy data for better vision of model architecture itself.
Other versions might work as well.
- Python 3.7.10
- Pytorch 1.11.0
-
1-1. Vision Transformer (ViT)
-
1-2. Swin Transformer (Swin-T)
-
1-3. Resnet
- Paper - Deep Residual Learning for Image Recognition (2015)
- Ipynb - Resnet.ipynb
-
1-4. VGG
-
2-1. Detection Transformer (DETR)
- Paper - End-to-End Object Detection with Transformers (2020)
- Ipynb - DETR.ipynb
-
2-2. Single Shot Detector (SSD)
- Paper - SSD: Single Shot MultiBox Detector (2015)
- Ipynb - SSD300.ipynb
- 3-1. U-Net
- 4-1. Masked Autoencoder (MAE)
- Paper - Masked Autoencoders Are Scalable Vision Learners (2021)
- Ipynb - MAE.ipynb
- 5-1. Generative Adversarial Networks (GAN)
- Paper - Generative Adversarial Networks (2014)
- Ipynb - GAN.ipynb
- 6-1. Transformer
- Paper - Attention Is All You Need (2017)
- Ipynb - Transformer.ipynb
- 0-1. Distributed Data Parallel (DDP)
- Ipynb - DDP1.0.ipynb (Single Node Single GPU)
- Ipynb - DDP2.0.ipynb (Multi Nodes Multi GPUs)