-
OpenGVLab@Shanghai AI Laboratory
- Shanghai
- http://whai362.github.io/
Highlights
- Pro
Stars
[ICLR 2025] Agent S: an open agentic framework that uses computers like a human
verl: Volcano Engine Reinforcement Learning for LLMs
Bringing BERT into modernity via both architecture changes and scaling
TransMLA: Multi-Head Latent Attention Is All You Need
Solve Visual Understanding with Reinforced VLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Witness the aha moment of VLM with less than $3.
Fully open reproduction of DeepSeek-R1
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cโฆ
Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
A series of technical report on Slow Thinking with LLM
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Interactive Image Generation via Generative Adversarial Networks
[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity". The MMInstruct dataset includes 973K instructions โฆ
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838