Lists (5)
Sort Name ascending (A-Z)
Starred repositories
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
The Modern Vulkan Cookbook published by Packt
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Rendering glTF scenes with ray tracer and raster (Vulkan)
DeepEP: an efficient expert-parallel communication library
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Highly available elephant herd: HA PostgreSQL cluster using Docker
Path tracing renderer and utilities for three.js built on top of three-mesh-bvh.
Real-time PathTracing with global illumination and progressive rendering, all on top of the Three.js WebGL framework. Click here for Live Demo: https://erichlof.github.io/THREE.js-PathTracing-Rende…
Babylon.js is a powerful, beautiful, simple, and open game and rendering engine packed into a friendly JavaScript framework.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
A template for PostgreSQL High Availability with Etcd, Consul, ZooKeeper, or Kubernetes
Ray tracing examples and tutorials using VK_KHR_ray_tracing
Fully open reproduction of DeepSeek-R1
Machine Learning Engineering Open Book
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
PlayStation 4 emulator for Windows, Linux and macOS written in C++