Skip to content
View li-plus's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report li-plus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 12,887 1,734 Updated Mar 27, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,101 535 Updated Mar 28, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,322 683 Updated Mar 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,516 5,562 Updated Mar 27, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,790 580 Updated Mar 28, 2025

Fully open reproduction of DeepSeek-R1

Python 23,428 2,130 Updated Mar 27, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,629 5,192 Updated Oct 10, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,091 1,112 Updated Oct 9, 2024

A collection of resources and papers on Diffusion Models

HTML 11,584 969 Updated Aug 1, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,574 2,148 Updated Mar 27, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,937 188 Updated Mar 27, 2025

A guidance language for controlling large language models.

Jupyter Notebook 19,960 1,094 Updated Mar 19, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,224 1,809 Updated Mar 24, 2025

The Arcade Learning Environment (ALE) -- a platform for AI research.

C++ 2,247 439 Updated Feb 15, 2025

A Survey on Large Language Model-Based Game Agents

549 20 Updated Mar 26, 2025

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,360 255 Updated Jan 21, 2025

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 428 54 Updated Mar 11, 2025

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,455 975 Updated Mar 28, 2025

A PyTorch native library for large model training

Python 3,503 324 Updated Mar 28, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,485 243 Updated Feb 20, 2025

Efficient Triton Kernels for LLM Training

Python 4,736 286 Updated Mar 28, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 19,464 2,073 Updated Mar 11, 2025

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,164 4,896 Updated Aug 1, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 10,715 2,301 Updated Aug 5, 2024

StarCraft II Learning Environment

Python 8,103 1,159 Updated Jul 23, 2024

A StarCraft II bot api client library for Python 3

Python 540 167 Updated Jan 11, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 82,306 61,042 Updated Mar 24, 2025
Next
Showing results