Hey!,
Testing vit_base_patch32_sam_224 and vit_base_patch16_sam_224 loaded with pretrained weights, I noticed a near 0 accuracy on the ImageNet validation set.
There is even an undeterminstic behavior with predicted labels between sequential runs. Debugging the code, as far as I can tell, it seems the pretrained weights were loaded properly, however (in terms of calling all the correct and necessary methods).
To Reproduce
Creating the model with:
timm_factory.create_model('vit_base_patch16_sam_224'), pretrained=True)
Extracting the proper transform (using config and create_transform(**config)) and running on ImageNet validation set.
Thanks!