Official code for paper "GRIT: Teaching MLLMs to Think with Images"
-
Updated
Jul 26, 2025 - Python
Official code for paper "GRIT: Teaching MLLMs to Think with Images"
ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"
Add a description, image, and links to the thinking-with-image topic page so that developers can more easily learn about it.
To associate your repository with the thinking-with-image topic, visit your repo's landing page and select "manage topics."