Code for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
-
Updated
Sep 26, 2024 - Python
Code for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
Add a description, image, and links to the sosp topic page so that developers can more easily learn about it.
To associate your repository with the sosp topic, visit your repo's landing page and select "manage topics."