You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 16, 2022. It is now read-only.
Is there anyone who has tried to run the evaluation with the provided multi_task_model.bin model?
I obtained
COCO (5K test set), R@{1 | 5 | 10}, IR: image retrieval, TR: text retrieval
IR: 32.979 | 61.911 | 74.082
TR: 14.62 | 32.18 | 39.76
Flickr30K
IR: 52.84 | 79.54 | 87.18
TR: 69.3 | 89 | 94
For what it's worth, those are not really comparable with those in the 12-in-1 paper.
I understand that there's room for improvement on TR as there's no hard negative mining for texts, but seems IR results are also unsatisfactory. I'm wondering if there's something missing here.
Thanks!
The text was updated successfully, but these errors were encountered:
Hi!
Is there anyone who has tried to run the evaluation with the provided multi_task_model.bin model?
I obtained
COCO (5K test set), R@{1 | 5 | 10}, IR: image retrieval, TR: text retrieval
Flickr30K
For what it's worth, those are not really comparable with those in the 12-in-1 paper.
I understand that there's room for improvement on TR as there's no hard negative mining for texts, but seems IR results are also unsatisfactory. I'm wondering if there's something missing here.
Thanks!
The text was updated successfully, but these errors were encountered: