A curated map of AFD, PD disaggregation, KV-cache systems, MoE serving, and re-aggregation baselines for LLM serving.
moe awesome-list disaggregation afd mixture-of-experts kv-cache llm-serving inference-systems prefill-decode
-
Updated
May 15, 2026 - Python