mugglewei97

Follow

mugglew mugglewei97

Follow

CS bachelor at Bei Jing University of Post and Telecommunication

Alibaba & Ant Group
Shanghai

Popular repositories Loading

slime slime Public

Forked from THUDM/slime

slime is an LLM post-training framework for RL Scaling.

Python
ms-swift ms-swift Public

Forked from modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python
Megatron-LM Megatron-LM Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
InfraTech InfraTech Public

Forked from CalvinXKY/InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook