You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I follow the demo and use the features generated by albef_feature_extraction to perform zero-shot cross-modal retrieval on MSCOCO. The t2i recall scores are extremely low while that of i2t score looks normal, and I don't know the answer. What's more, I found the cosine similarity even between the paired image and text is low (about 0.09).
The text was updated successfully, but these errors were encountered:
I follow the demo and use the features generated by
albef_feature_extraction
to perform zero-shot cross-modal retrieval on MSCOCO. The t2i recall scores are extremely low while that of i2t score looks normal, and I don't know the answer. What's more, I found the cosine similarity even between the paired image and text is low (about 0.09).The text was updated successfully, but these errors were encountered: