-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedMar 4, 2025 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMar 1, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMar 1, 2025 -
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedMar 1, 2025 -
OLMo Public
Forked from allenai/OLMoModeling, training, eval, and inference code for OLMo
Python Apache License 2.0 UpdatedMar 1, 2025 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedFeb 28, 2025 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplement a ChatGPT-like LLM in PyTorch from scratch, step by step
Jupyter Notebook Other UpdatedFeb 28, 2025 -
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedFeb 28, 2025 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 24, 2025 -
-
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Python Apache License 2.0 UpdatedFeb 15, 2025 -
nccl-tests Public
Forked from NVIDIA/nccl-testsNCCL Tests
Cuda BSD 3-Clause "New" or "Revised" License UpdatedFeb 6, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 6, 2025 -
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python Apache License 2.0 UpdatedFeb 6, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 6, 2025 -
horovod Public
Forked from horovod/horovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Python Other UpdatedFeb 1, 2025 -
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
C++ Apache License 2.0 UpdatedFeb 1, 2025 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedJan 27, 2025 -
onediff Public
Forked from siliconflow/onediffOneDiff: An out-of-the-box acceleration library for diffusion models.
Jupyter Notebook Apache License 2.0 UpdatedJan 13, 2025 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJan 9, 2025 -
-
-
-
-
-
-
-
-