Analyze LLM inference: FLOPs, memory, Roofline model. Supports GQA, MoE, MLA, RoPE, SwiGLU. 19 models × 20+ hardware platforms.
-
Updated
Apr 16, 2026 - Python
Analyze LLM inference: FLOPs, memory, Roofline model. Supports GQA, MoE, MLA, RoPE, SwiGLU. 19 models × 20+ hardware platforms.
Add a description, image, and links to the roofline topic page so that developers can more easily learn about it.
To associate your repository with the roofline topic, visit your repo's landing page and select "manage topics."