- Washington D.C.
-
13:49
(UTC -04:00) - in/drew-c-611b22120
Popular repositories Loading
-
-
vllm-rdna4-container-patches
vllm-rdna4-container-patches PublicRuntime patches + CPU-offload flag to run vLLM on Radeon RX 9070 XT (gfx1201/RDNA 4) in a container, including 14B Q4_K_M GGUF models on 16 GB of VRAM. Layered on top of bluefalcon13/vllm-rocm.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.