Skip to content

Latest commit

 

History

History
29 lines (23 loc) · 1.96 KB

File metadata and controls

29 lines (23 loc) · 1.96 KB
title Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation
date 2025-02-06 09:00:00
tags
MLSys Seminar
draft false
authors
mlsys
hosts/host_zhang_hao
hosts/host_junda_chen
speakers Speaker: Prof. Tianqi Chen, CMU
summary In this talk, we will discuss the lessons learned in building an efficient large language model deployment system for both server and edge settings. We will cover general techniques in machine learning compilation and system support for efficient structure generation. We will also discuss the future opportunities in system co-design for cloud-edge model deployments.
images
/static/images/events/seminar_2025_0206/tqchen.jpg

This week, our MLSys seminar is pleased to present a talk by Prof. Tianqi Chen scheduled on **Thursday, February 6 @ 6:30 PM (PST)**. We welcome all interested students and faculty to attend the talk on Zoom: https://ucsd.zoom.us/j/94713375885

Talk title: Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation

Talk Abstract: In this talk, we will discuss the lessons learned in building an efficient large language model deployment system for both server and edge settings. We will cover general techniques in machine learning compilation and system support for efficient structure generation. We will also discuss the future opportunities in system co-design for cloud-edge model deployments.

![tqchen](/static/images/events/seminar_2025_0206/tqchen.jpg)

**Bio:** Tianqi Chen is currently an Assistant Professor at the Machine Learning Department and Computer Science Department of Carnegie Mellon University. He is also a distinguished engineer at NVIDIA. He received his PhD from the Paul G. Allen School of Computer Science & Engineering at the University of Washington. He has created many major learning systems that are widely adopted: XGBoost, Apache TVM, and MLC-LLM.