weishengying

Follow

weishengying weishengying

Follow

7 followers · 1 following

Achievements

Achievements

Block or Report

Block or report weishengying

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

MoE MoE Public

MoE layer for pytorch

C++ 1
AutoAWQ AutoAWQ Public

Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1
Megatron-LM Megatron-LM Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
DeepSpeed DeepSpeed Public

Forked from microsoft/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python
Hands-on-GEMM Hands-on-GEMM Public

Forked from AyakaGEMM/Hands-on-GEMM

Cuda