Skip to content
View whai362's full-sized avatar
๐ŸŠ
๐ŸŒฐ
๐ŸŠ
๐ŸŒฐ

Highlights

  • Pro

Block or report whai362

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Agent S: an open agentic framework that uses computers like a human

Python 1,405 160 Updated Mar 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,906 589 Updated Mar 30, 2025

Bringing BERT into modernity via both architecture changes and scaling

Python 1,301 102 Updated Mar 25, 2025

TransMLA: Multi-Head Latent Attention Is All You Need

Python 223 19 Updated Mar 1, 2025

minimal-cost for training 0.5B R1-Zero

Python 674 86 Updated Mar 28, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,412 271 Updated Mar 30, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,427 1,445 Updated Mar 10, 2025

Witness the aha moment of VLM with less than $3.

Python 3,432 271 Updated Mar 1, 2025

Fully open reproduction of DeepSeek-R1

Python 23,499 2,142 Updated Mar 30, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,979 591 Updated Mar 27, 2025

s1: Simple test-time scaling

Python 6,086 711 Updated Mar 6, 2025

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 31 1 Updated Dec 13, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cโ€ฆ

Jupyter Notebook 7,843 503 Updated Mar 28, 2025

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 118 8 Updated Mar 29, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 13,251 1,877 Updated Mar 29, 2025

A series of technical report on Slow Thinking with LLM

Python 604 33 Updated Mar 28, 2025

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 3,229 324 Updated Feb 27, 2025

O1 Replication Journey

1,980 65 Updated Jan 14, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 40,508 5,759 Updated Mar 29, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,461 146 Updated Mar 27, 2025

Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Python 189 25 Updated Mar 11, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,958 389 Updated Mar 17, 2025

Interactive Image Generation via Generative Adversarial Networks

Python 3,995 588 Updated Aug 5, 2020

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 442 25 Updated Feb 10, 2025

[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity". The MMInstruct dataset includes 973K instructions โ€ฆ

Python 48 4 Updated Nov 7, 2024

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 122,197 16,393 Updated Mar 18, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,402 78 Updated Sep 27, 2024
Next
Showing results