llama-cpp-rx580

A sample Dockerfile that supports running LLMs with llama.cpp on the AMD RX580 GPU

The RX580 is a wonderful but slightly old GPU, so getting it to run modern LLMs is a little tricky. The most robust method I've found is to compile llama.cpp with the Vulkan backend. To isolate the mess of so many different driver versions from my host machine, I created this Docker container. It bakes in everything that's needed to run a modern LLM, specifically Qwen3-VL:8b.

I'm sharing it here in case it helps anyone else. As configured, the parameters for llama.cpp will consume 8104M / 8147M of the GPU's VRAM. If you need to reduce that slightly, I recommend reducing the batch size or context length.

Many thanks to Running Large Language Models on Cheap Old RX 580 GPUs with llama.cpp and Vulkan for guidance.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compose.yml		compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama-cpp-rx580

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

peterjdolan/llama-cpp-rx580

Folders and files

Latest commit

History

Repository files navigation

llama-cpp-rx580

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages