resorcap

Follow

cyan resorcap

Follow

10 followers · 13 following

Beijing

Achievements

Achievements

Popular repositories Loading

serving serving Public

Forked from tensorflow/serving

A flexible, high-performance serving system for machine learning models

C++ 1
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++
cutlass_fpA_intB_gemm cutlass_fpA_intB_gemm Public

Forked from tlc-pack/cutlass_fpA_intB_gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

C++
FasterTransformer FasterTransformer Public

Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++
googletest googletest Public

Forked from google/googletest

GoogleTest - Google Testing and Mocking Framework

C++