The model meter_clip16_288_roberta_flickr.ckpt is inconsistent with the network weight parameter dimension #25

attutude · 2022-07-01T08:55:39Z

Hi,
Thank you for your excellent work, may I use this model "METER-CLIP16-RoBERTa fine-tuned on Flickr30k IR/TR (resolution: 384^2)" as meter_clip16_288_roberta_flickr.ckpt, why does the code report this error showing inconsistent dimensions, thank you answer my question.

zdou0830 · 2022-07-01T16:40:52Z

it seems that you are using BERT and CLIP32 as the text and image encoder while the checkpoint is based on RoBERTa and CLIP16. As in README, an example command is:

python run.py with data_root=/data2/dsets/dataset num_gpus=8 num_nodes=1 task_finetune_irtr_f30k_clip_bert get_recall_metric=True per_gpu_batchsize=32 load_path=meter_f30k.ckpt clip16 text_roberta image_size=384 test_only=True

attutude · 2022-07-20T07:10:46Z

it seems that you are using BERT and CLIP32 as the text and image encoder while the checkpoint is based on RoBERTa and CLIP16. As in README, an example command is:
python run.py with data_root=/data2/dsets/dataset num_gpus=8 num_nodes=1 task_finetune_irtr_f30k_clip_bert get_recall_metric=True per_gpu_batchsize=32 load_path=meter_f30k.ckpt clip16 text_roberta image_size=384 test_only=True

Thank you for your answer, can you please publish the weights of the merged attention model, thank you very much.

zdou0830 · 2022-07-20T07:13:14Z

Hi, see #21

attutude · 2022-07-20T07:14:44Z

Hi, see #21

OK, thanks

zdou0830 closed this as completed Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The model meter_clip16_288_roberta_flickr.ckpt is inconsistent with the network weight parameter dimension #25

The model meter_clip16_288_roberta_flickr.ckpt is inconsistent with the network weight parameter dimension #25

attutude commented Jul 1, 2022

zdou0830 commented Jul 1, 2022

attutude commented Jul 20, 2022

zdou0830 commented Jul 20, 2022

attutude commented Jul 20, 2022

The model meter_clip16_288_roberta_flickr.ckpt is inconsistent with the network weight parameter dimension #25

The model meter_clip16_288_roberta_flickr.ckpt is inconsistent with the network weight parameter dimension #25

Comments

attutude commented Jul 1, 2022

zdou0830 commented Jul 1, 2022

attutude commented Jul 20, 2022

zdou0830 commented Jul 20, 2022

attutude commented Jul 20, 2022