image and text retrieval #2

wendy0527 · 2023-02-07T11:50:31Z

Does VLDet support image and text retrieval? For example, my purpose is to give a text to retrieve the most matching image. If the model supports it, should I use the image embedding? Or each instance embedding? As far as I understand, should I use
proj_x = self.linear(input_x) [VLDet/vldet/modeling/roi_heads/zero_shot_classifier.py line98] as the image/instances embedding?

clin1223 · 2023-02-16T02:49:17Z

Thanks for your interests! VLDet currently does not support image and text retrieval. You can try to solve retrieval problems as the CLIP way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image and text retrieval #2

image and text retrieval #2

wendy0527 commented Feb 7, 2023

clin1223 commented Feb 16, 2023

image and text retrieval #2

image and text retrieval #2

Comments

wendy0527 commented Feb 7, 2023

clin1223 commented Feb 16, 2023