This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 406 9 Updated Mar 24, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 728 31 Updated Mar 29, 2025

LengSicong / MMR1

MMR1: Advancing the Frontiers of Multimodal Reasoning

148 4 Updated Mar 17, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,095 1,376 Updated Mar 3, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,799 171 Updated Jan 22, 2025

OpenRLHF / OpenRLHF-M

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 94 5 Updated Mar 10, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

22,438 1,849 Updated Mar 26, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,408 271 Updated Mar 24, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,774 116 Updated Mar 27, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 3,134 395 Updated Mar 30, 2025

DAMO-NLP-SG / FineReason

FineReason: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Python 3 Updated Mar 3, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 863 33 Updated Mar 30, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,491 2,139 Updated Mar 30, 2025

XunhaoLai / native-sparse-attention-triton

Efficient triton implementation of Native Sparse Attention.

Python 127 6 Updated Mar 28, 2025

opendilab / awesome-ui-agents

A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)

178 20 Updated Mar 12, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,689 80 Updated Mar 5, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 661 38 Updated Mar 30, 2025

WangRongsheng / awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结（数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

4,532 472 Updated Mar 30, 2025

InternLM / OREAL

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Python 161 6 Updated Mar 20, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 15,101 1,553 Updated Mar 24, 2025