A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
-
Updated
May 14, 2021
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
Accelerating Deep Learning Training (DLT) from Storage Perspective
TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Enhancing TVM Autotuner to Optimize for Energy Efficiency
My collection of Distributed system papers, especially for ML.
Code for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
Federated Learning Systems Paper List
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
Optimal Sparse Decision Trees
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
A scalable & efficient active learning/data selection system for everyone.
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
Deep Learning Energy Measurement and Optimization
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
Add a description, image, and links to the mlsys topic page so that developers can more easily learn about it.
To associate your repository with the mlsys topic, visit your repo's landing page and select "manage topics."