-
-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GFX Target Overriding #83
Comments
Hi, sorry for not answering the issue, I was taking a break, I will work on this for today's update |
Ohh nice, Thats pretty fast response, thanks ^^ |
By the way, now that I'm going to add overrides for the Ollama instance into the gui, is there any other override you think I should add? |
I just added this feature, if you want any other override let me know |
You should prolly add: CUDA_VISIBLE_DEVICES for GPU Selection on Cuda |
Cant wait to test it in the next release |
Ollama should output everything as far as I know, do you know if Ollama has different binaries for cpu and gpu usage? |
I dont know, but it says differnt things when running the installer |
and they have different Docker images, for me: |
I see, it has a different tag so it's probably different |
it is basicly the command for amd in docker from the doc + my overide |
https://github.com/ollama/ollama/blob/main/docs/linux.md It says here you need to install rocm, that could be it |
I would exspect that to, or it is jsut named different, becuase it bundles the rocm library with itselve |
you will have to get all required libs inside of the flatpak, system libraries wont help |
I thought you allready had all needed libs in the flatpak |
I didn't know it needed ROCM, honestly I didn't even know it was a different thing, I thought the amdgpu driver would work out of the box |
no, rocm is what nvvidia toolkit is, but for amd |
I see, I'll work on this for next release, thanks for pointing it out, I don't have a compatible AMD GPU so I couldn't test it haha |
also I wouldt even be able to get the libraries instaled on my system, becuase im running an immutable one, but its working inside the docker, so it should also be able to get it work with the flatpak |
mine isnt 100% supported, but near enough and works flawlessley when using the override with the docker container |
then good luck, just tell me when you need a tester xD |
btw what distro do you use? I've been wanting to use an immutable distro for my laptop |
I installed rocm-flatpak and set Alpaca to use that flatpak driver, but I'm also unable to get it to use my 7900xtx. Any ideas? The flatpak driver recognizes my gpu fine so I'm not sure why it's not being used.
Ollama does utilize my GPU properly when installed on the host. I'm on Fedora40. |
Alpaca doesnt currently use the right ollama binary, and is also not linked to the flatpak rocm binaries. (please read the full issue comments) @Jeffser (The Creator of this project) Said, that he will work on this in the future, because it seemes like theres quite some stuff to do for AMD support it completeley. for now you can just run ollama inside of a docker, or nativley and link to it in alpaca, for questions how to run ollama localy, or in a docker, see the ollama wiki. |
Im currentley utelizing vanilla Kinoite, but I gonna give Ublue a changse soon, especiially, becuase on native Kinoite WebRTC is jittery with my GPU for some reason, and KDE connect isnt working with with the firefox flatpak, so I gonna see, if with their kinoite image: https://github.com/ublue-os/main/pkgs/container/kinoite-main im gonna have a better experince (i will only have to reabase, so switching and trying out will be easy) |
Is your feature request related to a problem? Please describe.
My GPU isnt utilisied
Describe the solution you'd like
The ability to Override GFX targets, like described in the ollama Docs:
https://github.com/ollama/ollama/blob/main/docs/gpu.md#overrides
Describe alternatives you've considered
Running Ollama Manually in a Toolbx
Additional context
I got an RX 6650XT which has a target of: gfx1032, but the nearest supported one is: 1030, so the Override for me would be:
HSA_OVERRIDE_GFX_VERSION="10.3.0"
Yes, I tried setting it as an Flatpak override, but that didnt work
The text was updated successfully, but these errors were encountered: