Stars
GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs
Solve Visual Understanding with Reinforced VLMs
A very simple GRPO implement for reproducing r1-like LLM thinking.
Occam’s LGS: An efficient approach for Language Gaussian Splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
A curated list for Efficient Large Language Models
🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)
Puzzles for learning Triton, play it with minimal environment configuration!
Efficient Triton Kernels for LLM Training
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
how to optimize some algorithm in cuda.
JackonYang / hands-on-tvm
Forked from mlc-ai/notebookshands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
FlashInfer: Kernel Library for LLM Serving
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
An archive of every iOS wallpaper officially released by Apple
Machine learning compiler based on MLIR for Sophgo TPU.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)