Stars
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
A project to improve skills of large language models
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Democratizing Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Official Repo for Open-Reasoner-Zero
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
MoBA: Mixture of Block Attention for Long-Context LLMs
A series of technical report on Slow Thinking with LLM
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Repo of "Quantification of Large Language Model Distillation"
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
Retrieval and Retrieval-augmented LLMs
My learning notes/codes for ML SYS.
Recipes to train reward model for RLHF.
An Open Large Reasoning Model for Real-World Solutions
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering