The official implementation for the ICCV 2023 paper "Grounded Image Text Matching with Mismatched Relation Reasoning".
vision-and-language
vision-and-language-pre-training
vision-language-dataset
vision-language-model
vision-language-learning
-
Updated
Dec 8, 2023 - Python