A list of research papers on knowledge-enhanced multimodal learning
-
Updated
Dec 8, 2022
A list of research papers on knowledge-enhanced multimodal learning
A simple open-sourced SigLIP model finetuned on Genshin Impact's image-text pairs.
Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
Code implementation of paper "SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval" (ACM TOMM 2024).
BSs Graduation Project implementation [Image-Text Matching]
The Unified Code of Image-Text Retrieval for Further Exploration.
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Implementation of the "Learn No to Say Yes Better" paper.
Easy wrapper for inserting LoRA layers in CLIP.
The 3rd place solution code for the Wikipedia - Image/Caption Matching Competition on Kaggle
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding
Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU
Image-Text Matching Model Zoo
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
CLIP (Contrastive Language–Image Pre-training) for Bangla.
Unofficial code of paper "Improving description-based person re-identification by multi-granularity image-text alignment." by Niu et al. (partially implemented)
Add a description, image, and links to the image-text-matching topic page so that developers can more easily learn about it.
To associate your repository with the image-text-matching topic, visit your repo's landing page and select "manage topics."