have you tried different CLIP models? #53

dhansmair · 2022-07-24T19:41:32Z

Hi @rmokady,
Thank you for your nice work, I learned a lot from it. Since the default CLIP model you are using seems to be the ViT-B32 version, I am wondering if you have tried other visual features e.g. from ViT-L or the resnet models? I can't find it mentioned in the paper. I'm trying to train a similar model at the moment and assume the features extracted from bigger vision encoders would contain more information.

Best, David

eeyrw · 2022-09-27T02:33:24Z

Have you tried it? I just have same question.

ret7020 · 2023-02-24T14:42:06Z

I tried ViT-L/14. You have to just change it in inference code and feature extractor code.
For example parse_coco.py:

parser.add_argument('--clip_model_type', default="ViT-L/14", choices=('RN50', 'RN101', 'RN50x4', 'ViT-B/32', 'ViT-L/14'))

Just add argument choice.
But in train code we need to change prefix_dim. It is 768 for ViT-L/14

eliphatfs · 2023-04-27T02:52:19Z

I tried ViT-L/14. You have to just change it in inference code and feature extractor code. For example parse_coco.py:
parser.add_argument('--clip_model_type', default="ViT-L/14", choices=('RN50', 'RN101', 'RN50x4', 'ViT-B/32', 'ViT-L/14'))
Just add argument choice. But in train code we need to change prefix_dim. It is 768 for ViT-L/14

Hi there, would you mind sharing your ViT/L-14 model checkpoints?
Thanks.

alexisthual · 2024-01-17T18:52:08Z

I tried ViT-L/14. You have to just change it in inference code and feature extractor code. For example parse_coco.py:
parser.add_argument('--clip_model_type', default="ViT-L/14", choices=('RN50', 'RN101', 'RN50x4', 'ViT-B/32', 'ViT-L/14'))
Just add argument choice. But in train code we need to change prefix_dim. It is 768 for ViT-L/14
Hi there, would you mind sharing your ViT/L-14 model checkpoints? Thanks.

I would be very interested in having access to the checkpoints too! 😊

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

have you tried different CLIP models? #53

have you tried different CLIP models? #53

dhansmair commented Jul 24, 2022

eeyrw commented Sep 27, 2022

ret7020 commented Feb 24, 2023

eliphatfs commented Apr 27, 2023

alexisthual commented Jan 17, 2024

have you tried different CLIP models? #53

have you tried different CLIP models? #53

Comments

dhansmair commented Jul 24, 2022

eeyrw commented Sep 27, 2022

ret7020 commented Feb 24, 2023

eliphatfs commented Apr 27, 2023

alexisthual commented Jan 17, 2024