jychen21

Follow

🐼

Jinyan Chen jychen21

🐼

Follow

2 followers · 15 following

Achievements

Achievements

Pinned Loading

TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++
cutlass cutlass Public

Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++
Awesome-LLM-Inference Awesome-LLM-Inference Public

Forked from DefTruth/Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
CUDA-Learn-Notes CUDA-Learn-Notes Public

Forked from DefTruth/CUDA-Learn-Notes

🎉 Modern CUDA Learn Notes with PyTorch: CUDA Cores, Tensor Cores, fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, hgemm, sgemv, warp/block reduce, elementwise, softmax, layernorm, rmsnorm.

Cuda
deepseekv2-profile deepseekv2-profile Public

Forked from madsys-dev/deepseekv2-profile

Jupyter Notebook 1
Habana-LLM-Viewer Habana-LLM-Viewer Public

Python 10 3