Code for Google's ViT and complete example #2

sayakpaul · 2021-03-25T06:17:15Z

Thank you for this amazing piece of work. I was wondering if you plan to open-source the code to try out your experiments on Google's ViT (An Image is Worth ...) as well. If it's already there inside the repo, could you point me to it?

Update: I was able to use timm and make use of the ViT model it comes with:

timm_vit_model = timm.create_model('vit_large_patch16_384', pretrained=True)
timm_vit_model.eval()
roller = VITAttentionGradRollout(timm_vit_model, discard_ratio=0.9)
mask = roller(x.unsqueeze(0), label_idx)

However, I am still a bit unsure as to how to actually visualize the mask. Could you help?

The text was updated successfully, but these errors were encountered:

jacobgil · 2021-04-02T20:10:05Z

Hi,
In vit_explain.py there is an example.
Once you have the mask you can do
mask = show_mask_on_image(img, mask).

Did it work out?

sayakpaul · 2021-04-03T01:40:02Z

It does thanks.

sayakpaul changed the title ~~Code for Google's ViT~~ Code for Google's ViT and complete example Mar 26, 2021

sayakpaul closed this as completed Apr 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code for Google's ViT and complete example #2

Code for Google's ViT and complete example #2

sayakpaul commented Mar 25, 2021 •

edited

jacobgil commented Apr 2, 2021

sayakpaul commented Apr 3, 2021

Code for Google's ViT and complete example #2

Code for Google's ViT and complete example #2

Comments

sayakpaul commented Mar 25, 2021 • edited

jacobgil commented Apr 2, 2021

sayakpaul commented Apr 3, 2021

sayakpaul commented Mar 25, 2021 •

edited