Skip to content

Commit d5ed58d

Browse files
authored
Merge pull request #533 from rwightman/pit_and_vit_update
Addition of PiT models and update/cleanup of ViT, new NFNet weight, TFDS wrapper fix, few misc fixes/updates
2 parents 1ad1645 + 37c71a5 commit d5ed58d

22 files changed

+1550
-324
lines changed

README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,22 @@ I'm fortunate to be able to dedicate significant time and money of my own suppor
2323

2424
## What's New
2525

26+
### April 1, 2021
27+
* Add snazzy `benchmark.py` script for bulk `timm` model benchmarking of train and/or inference
28+
* Add Pooling-based Vision Transformer (PiT) models (from https://github.com/naver-ai/pit)
29+
* Merged distilled variant into main for torchscript compatibility
30+
* Some `timm` cleanup/style tweaks and weights have hub download support
31+
* Cleanup Vision Transformer (ViT) models
32+
* Merge distilled (DeiT) model into main so that torchscript can work
33+
* Support updated weight init (defaults to old still) that closer matches original JAX impl (possibly better training from scratch)
34+
* Separate hybrid model defs into different file and add several new model defs to fiddle with, support patch_size != 1 for hybrids
35+
* Fix fine-tuning num_class changes (PiT and ViT) and pos_embed resizing (Vit) with distilled variants
36+
* nn.Sequential for block stack (does not break downstream compat)
37+
* TnT (Transformer-in-Transformer) models contributed by author (from https://gitee.com/mindspore/mindspore/tree/master/model_zoo/research/cv/TNT)
38+
* Add RegNetY-160 weights from DeiT teacher model
39+
* Add new NFNet-L0 w/ SE attn (rename `nfnet_l0b`->`nfnet_l0`) weights 82.75 top-1 @ 288x288
40+
* Some fixes/improvements for TFDS dataset wrapper
41+
2642
### March 17, 2021
2743
* Add new ECA-NFNet-L0 (rename `nfnet_l0c`->`eca_nfnet_l0`) weights trained by myself.
2844
* 82.6 top-1 @ 288x288, 82.8 @ 320x320, trained at 224x224
@@ -189,6 +205,7 @@ A full version of the list below with source links can be found in the [document
189205
* NFNet-F - https://arxiv.org/abs/2102.06171
190206
* NF-RegNet / NF-ResNet - https://arxiv.org/abs/2101.08692
191207
* PNasNet - https://arxiv.org/abs/1712.00559
208+
* Pooling-based Vision Transformer (PiT) - https://arxiv.org/abs/2103.16302
192209
* RegNet - https://arxiv.org/abs/2003.13678
193210
* RepVGG - https://arxiv.org/abs/2101.03697
194211
* ResNet/ResNeXt
@@ -204,6 +221,7 @@ A full version of the list below with source links can be found in the [document
204221
* ReXNet - https://arxiv.org/abs/2007.00992
205222
* SelecSLS - https://arxiv.org/abs/1907.00837
206223
* Selective Kernel Networks - https://arxiv.org/abs/1903.06586
224+
* Transformer-iN-Transformer (TNT) - https://arxiv.org/abs/2103.00112
207225
* TResNet - https://arxiv.org/abs/2003.13630
208226
* Vision Transformer - https://arxiv.org/abs/2010.11929
209227
* VovNet V2 and V1 - https://arxiv.org/abs/1911.06667

0 commit comments

Comments
 (0)