Popular repositories Loading
-
slime
slime PublicForked from THUDM/slime
slime is an LLM post-training framework for RL Scaling.
Python
-
ms-swift
ms-swift PublicForked from modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
Python
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Python
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
InfraTech
InfraTech PublicForked from CalvinXKY/InfraTech
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.