llama.cpp GPU support #46

alexl83 · 2023-05-14T10:20:56Z

Hi, since commit 905d87b70aa189623d500a28602d7a3a755a4769
llama.cpp support GPU inference with nvidia CUDA via command-line switches like --gpu-layers
Could you please consider adding support to GPT-LLAMA.CPP aswell?

Thank you!

The text was updated successfully, but these errors were encountered:

msj121 · 2023-06-11T15:51:16Z

@alexl83
Just looking at the code, since you compile llama.cpp in theory it would appear to me that you can install with cuda support, then you are just passing the argument like any of the others listed, like threads.

ie:
npm start ngl 4 to hoist on to 4 gpu layers. I don't have a compatible setup to test though.

alexl83 changed the title ~~llama.cpp GPU support!~~ llama.cpp GPU support May 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp GPU support #46

llama.cpp GPU support #46

alexl83 commented May 14, 2023 •

edited

Loading

msj121 commented Jun 11, 2023

llama.cpp GPU support #46

llama.cpp GPU support #46

Comments

alexl83 commented May 14, 2023 • edited Loading

msj121 commented Jun 11, 2023

alexl83 commented May 14, 2023 •

edited

Loading