936187425

Follow

Hengyu Pan 936187425

Follow

Interested Area: LLM/Blockchain。

9 followers · 9 following

USTC
Hefei,China
08:00 (UTC +08:00)
hypan@mail.ustc.edu.cn

Achievements

Achievements

Block or Report

Block or report 936187425

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

vectorch-ai/ScaleLLM vectorch-ai/ScaleLLM Public

A high-performance inference system for large language models, designed for production environments.

C++ 329 24
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 22.9k 3.2k
cuda_hgemm_study cuda_hgemm_study Public

Forked from Bruce-Lee-LY/cuda_hgemm

The repository is to study the CUDA tensor core forked from Bruce-Lee-LY. Thanks to Bruce-Lee-LY!

Cuda 1
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

The repository is for learning the FlashInfer and add some notes

Cuda
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

The reposity is to learn the cutlass by the flash-attention demo

Python
LoRA LoRA Public

Forked from microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python