AMD ROCm docker images support (+ optimization) #94

michaelfeil · 2024-02-09T05:05:20Z

I am planning to evaluate hardware agnostic options

adapt poetry setup for optional deps
build a Docker Image for AMD Mi250/300
optimize settings e.g. torch.compile()/float16 etc for AMD

The text was updated successfully, but these errors were encountered:

hiepxanh · 2024-02-20T03:16:47Z

yes I love this, I have an amd device and happy to try with it

michaelfeil · 2024-02-20T03:40:37Z

Awesome, love the proactivity!!! let me create a draft PR. Do you have ROCm installed and can use pytorch with ROCM? @hiepxanh

michaelfeil · 2024-02-20T03:43:38Z

Here would be some instructions on how to get in installed.
https://rocm.docs.amd.com/projects/install-on-linux/en/develop/how-to/3rd-party/pytorch-install.html

In my opinion, it should run out of the box with rocm, the question would be to build and run a docker + the performance.

hiepxanh · 2024-03-05T13:29:52Z

I have ROCm and AMD rx 6600 card, but I get a lot of issue while testing, pytorch not support windows, the unbutu image eat 20gbs for just only ROCm, I think we not ready now. I see vulkan work great with llama.cpp, maybe that is a good option to run the model. Let's keep this issue open while I'm watching AMD team.

michaelfeil · 2024-03-06T00:07:19Z

Sadly, Ubuntu/Linux (no WSL) and a error free installation of rocm is a strict requirement for ROCm.

hiepxanh · 2024-03-06T00:42:37Z

Yes, it correct, the WSL and docker is great place to install, I have failed last time. I will do futher test once I have freetime

peebles · 2024-06-10T18:08:11Z

I am running Ubuntu 22.10, with a Navi 23 [Radeon RX 6650 XT] and ROCm drivers installed (rocminfo and clinfo). I'll give infinity a shot on this system. When I run --help, I do not see rocm listed as a device, and when I run infinity with no special options, it picks "cpu" as a device. How should I run?

hvico · 2024-06-15T16:52:29Z

I can report Infinity works perfectly well using ROCm accelerated PytTorch on a 7900XTX.

Just one tip, if you're not using the MI250X and MI300X series like me, set this variable before starting Infinity, to avoid PyTorch errors complaining about no HIPBLAS support:

TORCH_BLAS_PREFER_HIPBLASLT=0

Ref: comfyanonymous/ComfyUI#3698

peebles · 2024-06-15T23:20:28Z

Did you build infinity-emb from scratch using a different pytorch that the one in pyproject.toml? Personally I am using the pre-built docker container image michaelf34/infinity:latest. @hvico if you have a custom build, I'd like to know your recipe!

hvico · 2024-06-17T17:54:14Z

Did you build infinity-emb from scratch using a different pytorch that the one in pyproject.toml? Personally I am using the pre-built docker container image michaelf34/infinity:latest. @hvico if you have a custom build, I'd like to know your recipe!

Hi. I didn't, I just installed the latest pip wheel, and then installed the official nightly Python ROCm packages (replacing the torch distribution by that one).

To dockerize this I froze that virtualenv, started from the officlal ROCm Pytorch docker image, and added the proper pip install -r requirements.txt from the exported file.

I am not sharing that file because it has many other dependencies unrelated with infinity, so it makes no sense to use that as a template. But this is the main procedure I followed.

Hope it helps.

michaelfeil added the help wanted Extra attention is needed label Feb 9, 2024

michaelfeil changed the title ~~AMD ROCm support optimization~~ AMD ROCm support (+ optimization) Feb 9, 2024

michaelfeil changed the title ~~AMD ROCm support (+ optimization)~~ AMD ROCm docker images support (+ optimization) Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMD ROCm docker images support (+ optimization) #94

AMD ROCm docker images support (+ optimization) #94

michaelfeil commented Feb 9, 2024 •

edited

hiepxanh commented Feb 20, 2024

michaelfeil commented Feb 20, 2024

michaelfeil commented Feb 20, 2024 •

edited

hiepxanh commented Mar 5, 2024

michaelfeil commented Mar 6, 2024

hiepxanh commented Mar 6, 2024

peebles commented Jun 10, 2024

hvico commented Jun 15, 2024

peebles commented Jun 15, 2024

hvico commented Jun 17, 2024

AMD ROCm docker images support (+ optimization) #94

AMD ROCm docker images support (+ optimization) #94

Comments

michaelfeil commented Feb 9, 2024 • edited

hiepxanh commented Feb 20, 2024

michaelfeil commented Feb 20, 2024

michaelfeil commented Feb 20, 2024 • edited

hiepxanh commented Mar 5, 2024

michaelfeil commented Mar 6, 2024

hiepxanh commented Mar 6, 2024

peebles commented Jun 10, 2024

hvico commented Jun 15, 2024

peebles commented Jun 15, 2024

hvico commented Jun 17, 2024

michaelfeil commented Feb 9, 2024 •

edited

michaelfeil commented Feb 20, 2024 •

edited