📄 A curated list of visual reasoning papers.
-
Updated
May 1, 2024 - TeX
📄 A curated list of visual reasoning papers.
NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment
[AAAI 2023] Hierarchical ConViT with Attention-based Relational Reasoner for Visual Analogical Reasoning
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
Image captioning using python and BLIP
A list of research papers on knowledge-enhanced multimodal learning
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
Convert RGB images of Visual-Genome dataset to Depth Maps.
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning
Learning Perceptual Inference by Contrasting
FiLM: Visual Reasoning with a General Conditioning Layer
ACRE: Abstract Causal REasoning Beyond Covariation
LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
Add a description, image, and links to the visual-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the visual-reasoning topic, visit your repo's landing page and select "manage topics."