Kuntai Du KuntaiDu

Hi, I'm Kuntai Du 👋

I'm a PhD student @ UChicago, graduating, working in Large Language Model Inference. Check my home page for more about me!

🚀 Working on vLLM project() as vLLM core team member and committer. My contributions:
- Performance comparison with other LLM inference engines: the end of the blog.
- Features: Disaggregated prefilling and CPU offloading.
💾 Contributing to the LMCache project, exploring fun ideas in KV caches.