Starred repositories
Touying is a powerful package for creating presentation slides in Typst.
A Datacenter Scale Distributed Inference Serving Framework
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
A toolkit for making real world machine learning and data analysis applications in C++
Neptune OS: A Windows NT personality for the seL4 microkernel
C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library
Clang/LLVM/Binutils prebuilts with Full LTO, PGO and BOLT optimization.
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
A set of scripts to build LLVM and binutils
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Sockets, timers, resolvers, events, reactors, proactors, and thread pools for asynchronous network programming
Implementation of Peter Shirley's Ray Tracing In One Weekend book using Vulkan and NVIDIA's RTX extension.
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
A minimal GPU design in Verilog to learn how GPUs work from the ground up
深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
New file format for storage of large columnar datasets.
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
how to learn PyTorch and OneFlow
A fast high compression read-only file system for Linux, Windows and macOS
ClickBench: a Benchmark For Analytical Databases