Skip to content

How to replicate ViT results on smaller dataset YFCC15M #162

Answered by rwightman
kyleliang919 asked this question in Q&A
Discussion options

You must be logged in to vote

@kyleliang919 I have some old training logs for ViT-B/32 on cc12m and managed ~31% top-1 imagenet-1k zero shot. Best ResNet-50 in that setup was ~36%. cc12m is higher quality than yfcc15m so might want to test with that one. 10% seems low though... but I would definitely expect it to end up worse than 30%

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@kyleliang919
Comment options

Answer selected by kyleliang919
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants