Skip to content
View Cydia2018's full-sized avatar

Block or report Cydia2018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 60 4 Updated Dec 27, 2024

Fast low-bit matmul kernels in Triton

Python 273 21 Updated Mar 26, 2025

GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs

Python 5 Updated Mar 25, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,395 271 Updated Mar 24, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 795 68 Updated Mar 24, 2025

Occam’s LGS: An efficient approach for Language Gaussian Splatting

Python 22 Updated Mar 28, 2025

Explainability for Vision Transformers

Python 927 103 Updated Mar 12, 2022

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 6,923 427 Updated Mar 16, 2025

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Python 760 86 Updated May 17, 2024

A curated list for Efficient Large Language Models

Python 1,570 123 Updated Mar 23, 2025

🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)

Python 6 1 Updated Mar 29, 2025

Puzzles for learning Triton, play it with minimal environment configuration!

Python 267 25 Updated Dec 3, 2024

Efficient Triton Kernels for LLM Training

Python 4,741 286 Updated Mar 28, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,062 1,054 Updated Mar 25, 2025
Python 90 8 Updated Sep 9, 2024

Material for gpu-mode lectures

Jupyter Notebook 4,138 418 Updated Feb 9, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 781 63 Updated Sep 4, 2024

how to optimize some algorithm in cuda.

Cuda 2,052 183 Updated Mar 26, 2025

GPTQ inference Triton kernel

Jupyter Notebook 299 22 Updated May 18, 2023

hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.

Jupyter Notebook 45 9 Updated Jun 15, 2023

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 678 56 Updated Jan 21, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,511 261 Updated Mar 27, 2025

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 979 164 Updated Sep 19, 2024

An archive of every iOS wallpaper officially released by Apple

1,047 86 Updated Sep 22, 2023

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 697 173 Updated Mar 24, 2025

PLCT实验室的公开演讲,或者决定公开的组内报告

1,069 158 Updated Dec 12, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 13,030 1,874 Updated Mar 26, 2025

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Shell 100 16 Updated May 3, 2024
Next
Showing results