Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Imagenet Pretrained Weights #9

Open
RaivoKoot opened this issue Apr 1, 2021 · 5 comments
Open

Imagenet Pretrained Weights #9

RaivoKoot opened this issue Apr 1, 2021 · 5 comments

Comments

@RaivoKoot
Copy link

Thanks for the work! In their paper they say
For all our experiments, we adopt the “Base” ViT model architecture (Dosovitskiy et al., 2020) pretrained on ImageNet.

I know that you said the official weights trained on kinetics and such are not officially released yet. However, I am not interested in those but am actually in need of the initial weights of the network just based on ViT Imagenet pretraining. I need to train this implementation of yours starting from those. From what it looks like, you don't have weights for this implementation that come from imagenet pretraining, do you?

@jwohlwend
Copy link

+1

1 similar comment
@MohamedOsman1998
Copy link

+1

@tcapelle
Copy link

I am not sure it is possible, but ViT weights are available on timm and others.

@RaivoKoot
Copy link
Author

For now, I found this repository that includes ViT initialization (https://github.com/m-bain/video-transformers).

@tcapelle
Copy link

You can even download a pretrained ViT from huggingface: https://huggingface.co/google/vit-large-patch16-224

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants