Lists (1)
Sort Name ascending (A-Z)
Stars
An audio spectrum visualizer built with HTML5 Audio API
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
DeepEP: an efficient expert-parallel communication library
SGLang is a fast serving framework for large language models and vision language models.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
A latent text-to-image diffusion model
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
A throughput-oriented high-performance serving framework for LLMs
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Solutions and Notes for Labs of Computer Systems: A Programmer's Perspective 3rd Editon // 《深入理解计算机系统》第三版的实验文件、解答与笔记
📚 A curated list of awesome articles, videos, and other resources to learn and practice software architecture, patterns, and principles.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step