How to replicate ViT results on smaller dataset YFCC15M #162

kyleliang919 · 2022-09-10T17:44:10Z

kyleliang919
Sep 10, 2022

Before I launch long runs on Laion 400M, I wish to confirm my hardware setup is indeed correct. So I set up ViT-B with the repo and ran it with the 32K batch size and default parameters (weight decay: 0.2 and lr: 5e-4 ,32 epoch). But I consistently got top-1 zero-shot accuracy below 10.

ViTs are expected to perform worse in lower data regimes, but this seems to be far off. Does anyone encounter similar issues or have an idea of what might have gone wrong? Any help will be highly appreciated.

Answered by rwightman

Sep 14, 2022

@kyleliang919 I have some old training logs for ViT-B/32 on cc12m and managed ~31% top-1 imagenet-1k zero shot. Best ResNet-50 in that setup was ~36%. cc12m is higher quality than yfcc15m so might want to test with that one. 10% seems low though... but I would definitely expect it to end up worse than 30%

View full answer

rwightman · 2022-09-14T04:41:49Z

rwightman
Sep 14, 2022
Maintainer

@kyleliang919 I have some old training logs for ViT-B/32 on cc12m and managed ~31% top-1 imagenet-1k zero shot. Best ResNet-50 in that setup was ~36%. cc12m is higher quality than yfcc15m so might want to test with that one. 10% seems low though... but I would definitely expect it to end up worse than 30%

1 reply

kyleliang919 Oct 31, 2022
Author

@rwightman have you run this setting on TPU with bf16? It has significantly worse zero-shot accuracy compared to GPU checkpoints. Do you think precision could play a significant role here?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to replicate ViT results on smaller dataset YFCC15M #162

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

How to replicate ViT results on smaller dataset YFCC15M #162

kyleliang919 Sep 10, 2022

Replies: 1 comment · 1 reply

rwightman Sep 14, 2022 Maintainer

kyleliang919 Oct 31, 2022 Author

kyleliang919
Sep 10, 2022

Replies: 1 comment 1 reply

rwightman
Sep 14, 2022
Maintainer

kyleliang919 Oct 31, 2022
Author