You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
Thank you for your excellent work, may I use this model "METER-CLIP16-RoBERTa fine-tuned on Flickr30k IR/TR (resolution: 384^2)" as meter_clip16_288_roberta_flickr.ckpt, why does the code report this error showing inconsistent dimensions, thank you answer my question.
The text was updated successfully, but these errors were encountered:
it seems that you are using BERT and CLIP32 as the text and image encoder while the checkpoint is based on RoBERTa and CLIP16. As in README, an example command is:
it seems that you are using BERT and CLIP32 as the text and image encoder while the checkpoint is based on RoBERTa and CLIP16. As in README, an example command is:
Hi,
Thank you for your excellent work, may I use this model "METER-CLIP16-RoBERTa fine-tuned on Flickr30k IR/TR (resolution: 384^2)" as meter_clip16_288_roberta_flickr.ckpt, why does the code report this error showing inconsistent dimensions, thank you answer my question.
The text was updated successfully, but these errors were encountered: