Skip to content
View Wind0121's full-sized avatar
  • Huazhong University of Science and Technology
  • Wuhan

Highlights

  • Pro

Block or report Wind0121

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM Training

11 repositories

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 4,463 630 Updated Jul 15, 2025

A PyTorch native platform for training generative AI models

Python 5,128 736 Updated Mar 12, 2026

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 40,994 4,953 Updated Feb 6, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,924 1,580 Updated Feb 27, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 87,744 13,357 Updated Mar 7, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,896 369 Updated Dec 17, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,370 341 Updated Jul 12, 2025

Utilities intended for use with Llama models.

Python 7,499 1,338 Updated Feb 11, 2026

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,364 2,112 Updated May 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,838 3,407 Updated Mar 12, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,700 623 Updated Mar 12, 2026