add support for AMD / ROCm / HIP #707

ehartford · 2023-12-06T23:04:12Z

I want to again request AMD support, since it is now much more popular and usable than it has been

wsippel · 2023-12-10T18:07:35Z

AMD is working on it: https://github.com/ROCmSoftwarePlatform/flash-attention

I've not tested it yet, but a new branch with WMMA optimizations for Radeon 7000 was added just yesterday it seems.

nktice · 2023-12-19T03:19:37Z

I have composed this guide for my AMD AI configuration...
https://github.com/nktice/AMD-AI
The ROCm project that had done flash attention has appeared to work with 5.73.
[ https://github.com/nktice/AMD-AI/blob/main/ROCm-5.7.md - I've not tested much, but the exllamav2 warnings that appear when it's not in use disappear once it's installed in this case... ]

Alas it does not work with the ROCm 6 at time of writing.
[ https://github.com/nktice/AMD-AI/blob/main/ROCm6.0.md - in this case exllamav2 crashes if flash attention ( same as above ) is installed. ]

An issue with this is that the AMD fork is always behind
and hard to maintain compared to the main content and developers.

What would be helpful is for AMD's content to be included
back into the source, so that they do not have to start from scratch again
every time there is any update to the main flash-attention code.

ehartford · 2024-01-18T21:02:59Z

@tridao is it possible to merge this to support ROCm?

https://github.com/ROCmSoftwarePlatform/flash-attention

tridao · 2024-01-18T21:06:32Z

https://github.com/ROCmSoftwarePlatform/flash-attention

I think that's a fork maintained by AMD folks and it's not meant to be merged.

ehartford · 2024-01-18T21:12:35Z

I doubt they would disapprove of merging, Seems just a rift of communication. I will reach out.

nktice · 2024-04-03T06:03:03Z

https://github.com/ROCmSoftwarePlatform/flash-attention

I think that's a fork maintained by AMD folks and it's not meant to be merged.

As it's been a while, and they haven't updated or integrated...
I'd like to mention - AMD rarely updates or maintains such things...
It's common for them to abandon such projects with little notice...
Like for example their bits-and-bytes conversion is well out of date -
https://github.com/ROCm/bitsandbytes
Leading to others improvising for themselves to get things working -
[ Here's the most recent working bitsandbytes I've found that works with ROCm... it's well out of date, but not quite as abandoned as AMD's own... ]
https://github.com/arlo-phoenix/bitsandbytes-rocm-5.6
There's been no quarrel about peoples' forked versions, and there are a few - but without their help it is something of a mess of mixed offerings.

It is more likely they offered an example of what could be done - and how to do it, so that the 'community' could take it from there. [ If that's not the case, then they'd clearly mention that, or keep it private. ]

I have contacted exllamav2 about the version issue, here is what they said - AMD's offered version isn't of much use...
turboderp/exllamav2#397 (comment)

RichardFevrier · 2024-04-10T10:12:25Z

Maybe @howiejayz could be part of this conversation =)

howiejayz · 2024-04-11T06:59:14Z

Maybe @howiejayz could be part of this conversation =)

Unfortunately I am no longer working on this project :( But as far as I know the other team is still working on this project and it will be long-term support.

ehartford mentioned this issue Jan 18, 2024

Merge to upstream flash-attention repo ROCm/flash-attention#35

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for AMD / ROCm / HIP #707

add support for AMD / ROCm / HIP #707

ehartford commented Dec 6, 2023

wsippel commented Dec 10, 2023

nktice commented Dec 19, 2023

ehartford commented Jan 18, 2024

tridao commented Jan 18, 2024

ehartford commented Jan 18, 2024

nktice commented Apr 3, 2024 •

edited

RichardFevrier commented Apr 10, 2024

howiejayz commented Apr 11, 2024

add support for AMD / ROCm / HIP #707

add support for AMD / ROCm / HIP #707

Comments

ehartford commented Dec 6, 2023

wsippel commented Dec 10, 2023

nktice commented Dec 19, 2023

ehartford commented Jan 18, 2024

tridao commented Jan 18, 2024

ehartford commented Jan 18, 2024

nktice commented Apr 3, 2024 • edited

RichardFevrier commented Apr 10, 2024

howiejayz commented Apr 11, 2024

nktice commented Apr 3, 2024 •

edited