AI/ML Platform @Roblox; Working on @vllm-project when I catch a breath
-
Roblox
- San Mateo
-
00:15
- 7h behind - in/rogerywang
- @rogerw0108
Stars
A Datacenter Scale Distributed Inference Serving Framework
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
how to optimize some algorithm in cuda.
Entropy Based Sampling and Parallel CoT Decoding
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
A high-throughput and memory-efficient inference and serving engine for LLMs