This repository collects scripts and results for various inference experiments. Directories are organized by task to facilitate future reference and reuse.
For context, see the official vLLM blog post: vLLM Sleep Mode, this link: https://blog.vllm.ai/2025/10/26/sleep-mode.html
The script used in the blogpost can be found in /sleep_mode