Ph.D. Student @ UofT. Large Model Optimization @ NVIDIA.
-
Univeristy of Toronto
- Toronto, CA
-
19:33
(UTC -04:00) - in/zhanda-zhu-828024210
Highlights
- Pro
Pinned Loading
-
CentML/Mist
CentML/Mist Public[EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
-
UofT-EcoSystem/Tempo
UofT-EcoSystem/Tempo PublicMemory footprint reduction for transformer models
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.