shixianc

Follow

😁

i like mcdonalds

shixianc shixianc

😁

i like mcdonalds

Follow

tcui1101@gmail.com

0 followers · 1 following

shixianc/README.md

Hi there, I'm Shixian Cui 👋

Interested in machine learning, especially model inference optimization.

Connect with me:

emai: tcui1101@gmail.com

linkedin: shixian cui

school projects

Pinned Loading

ray-project/ray ray-project/ray Public

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33.8k 5.7k
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29.7k 4.5k
triton-inference-server/server triton-inference-server/server Public

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8.3k 1.5k
NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8.6k 974
triton-inference-server/model_navigator triton-inference-server/model_navigator Public

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 183 25
QwenLM/Qwen2-Audio QwenLM/Qwen2-Audio Public

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1.2k 80