vlm-r1

Here are 3 public repositories matching this topic...

Solve Visual Understanding with Reinforced VLMs

reinforcement-learning vlm multimodal llm qwen deepseek-r1 grpo r1-zero vlm-r1 multimodal-r1

Skywork-R1V2 : Multimodal Hybrid Reinforcement Learning for Reasoning(最好的多模态推理)

reinforcement-learning reasoning vlm llm multimodal-understanding deepseek-r1 grpo vlm-r1 multimodal-r1 r1v skywork-r1v

Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.

reinforcement-learning vlm crowdcounting llm reward-model r1-zero vlm-r1 multimodal-r1

Add a description, image, and links to the vlm-r1 topic page so that developers can more easily learn about it.

To associate your repository with the vlm-r1 topic, visit your repo's landing page and select "manage topics."