MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning
Accepted by the 31st International Conference on Database Systems for Advanced Applications. Please refer to instruction.pdf for the detailed prompt to the expert model (GPT-4o) in Stage 1 SFT mentioned in the paper.