Lists (15)
Sort Name ascending (A-Z)
Stars
- All languages
- Agda
- Assembly
- Batchfile
- Bluespec
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Common Lisp
- Coq
- Cuda
- Dart
- Dockerfile
- Elixir
- Emacs Lisp
- Erlang
- F*
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lean
- Lex
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Mathematica
- OCaml
- Objective-C
- P4
- PHP
- Perl
- PureBasic
- Python
- Racket
- ReScript
- Roff
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Swift
- SystemVerilog
- TLA
- Tcl
- TeX
- TypeScript
- Typst
- V
- VHDL
- Verilog
- Vim Script
- Vue
- WebAssembly
- Yacc
A minimum demo for PyTorch distributed extension functionality for collectives.
A Datacenter Scale Distributed Inference Serving Framework
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A lightweight, powerful framework for multi-agent workflows
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
[TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Arash Nasr-Esfahany, Kevin Zhao, Prateesh Goyal, Mohammad Alizadeh, Thomas Anderson.
InferX is a Inference Function as a Service Platform
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
REPETITA: Repeatable Experiments for Performance Evaluation of Traffic-Engineering Algorithms
A lightweight data processing framework built on DuckDB and 3FS.
Main source code repository of the Tamarin prover for security protocol verification.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Analyze computation-communication overlap in V3/R1.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling