Skip to content
View 26hzhang's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report 26hzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model merging is a highly efficient approach for long-to-short reasoning.

Python 22 1 Updated Mar 27, 2025

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 406 9 Updated Mar 24, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 728 31 Updated Mar 29, 2025

MMR1: Advancing the Frontiers of Multimodal Reasoning

148 4 Updated Mar 17, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,095 1,376 Updated Mar 3, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,799 171 Updated Jan 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 94 5 Updated Mar 10, 2025

Awesome-LLM: a curated list of Large Language Model

22,438 1,849 Updated Mar 26, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,408 271 Updated Mar 24, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,774 116 Updated Mar 27, 2025

Align Anything: Training All-modality Model with Feedback

Python 3,134 395 Updated Mar 30, 2025

FineReason: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Python 3 Updated Mar 3, 2025

Latest Advances on System-2 Reasoning

Python 863 33 Updated Mar 30, 2025

Fully open reproduction of DeepSeek-R1

Python 23,491 2,139 Updated Mar 30, 2025

Efficient triton implementation of Native Sparse Attention.

Python 127 6 Updated Mar 28, 2025

A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)

178 20 Updated Mar 12, 2025

Official Repo for Open-Reasoner-Zero

Python 1,689 80 Updated Mar 5, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 661 38 Updated Mar 30, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

4,532 472 Updated Mar 30, 2025

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Python 161 6 Updated Mar 20, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 15,101 1,553 Updated Mar 24, 2025
Python 171 5 Updated Feb 20, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,589 74 Updated Feb 11, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,919 73 Updated Jan 22, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,237 147 Updated Mar 20, 2025

A fork to add multimodal model training to open-r1

Python 1,143 58 Updated Feb 8, 2025

Witness the aha moment of VLM with less than $3.

Python 3,432 271 Updated Mar 1, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 682 43 Updated Mar 21, 2025
Python 493 47 Updated Mar 25, 2025
Next
Showing results