gpu support #51

mudler · 2023-05-14T09:50:14Z

This PR adds GPU support (inference untested, just tried build and run on colab).

OpenBLAS accelleration

To build and run with OpenBLAS, for example:

BUILD_TYPE=openblas make libbinding.a
CGO_LDFLAGS="-lopenblas" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run -tags openblas ./examples -m "/model/path/here" -t 14

GPU

To build with CuBLAS:

BUILD_TYPE=cublas make libbinding.a
CGO_LDFLAGS="-lcublas -lcudart -L/usr/local/cuda/lib64/" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/model/path/here" -t 14

mudler · 2023-05-16T14:22:31Z

Ok, tested on colab and seems to work - I'll merge as-is as I don't have the HW to test locally and don't want this to be pending over here. As it is opt-in there is on harm into merging this and iterate over if doesn't work correctly

cubuzz · 2023-05-16T23:31:29Z

I apologize if it's something I'm missing, but trying to build with CuBLAS yields

$ BUILD_TYPE=cublas make libbinding.a

cd build && cp -rf CMakeFiles/ggml.dir/ggml-cuda.cu.o ../llama.cpp/ggml-cuda.o
cp: cannot stat 'CMakeFiles/ggml.dir/ggml-cuda.cu.o': No such file or directory
make: *** [Makefile:162: llama.cpp/ggml-cuda.o] Error 1

Do I need to grab some other dependencies?

=======================================================================

RESOLVED: Somehow llama.cpp wasn't being built. Building it manually, then changing back fixes the issue.

SilverViper · 2023-06-05T08:10:23Z

I'm trying to build the Flowise example with docker-compose and I have the same error:
cp: cannot stat 'CMakeFiles/ggml.dir/ggml-cuda.cu.o': No such file or directory

Any hint on how I go about building llama.cpp manually? Or to prevent the error?

mudler added 5 commits May 14, 2023 11:17

allow to set GPU layers

3501b34

Optional linking with build tags

b3260cf

Add cuda build type

a32926a

Update instructions

904aaeb

Add to build tags all info - might just work for most cases

7f9ae42

mudler mentioned this pull request May 14, 2023

feat: add support for cublas/openblas in the llama.cpp backend mudler/LocalAI#258

Merged

mudler marked this pull request as ready for review May 16, 2023 14:22

mudler merged commit 7a952ea into master May 16, 2023
2 checks passed

mudler deleted the gpu branch May 16, 2023 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu support #51

gpu support #51

mudler commented May 14, 2023 •

edited

mudler commented May 16, 2023

cubuzz commented May 16, 2023 •

edited

SilverViper commented Jun 5, 2023 •

edited

gpu support #51

gpu support #51

Conversation

mudler commented May 14, 2023 • edited

OpenBLAS accelleration

GPU

mudler commented May 16, 2023

cubuzz commented May 16, 2023 • edited

SilverViper commented Jun 5, 2023 • edited

mudler commented May 14, 2023 •

edited

cubuzz commented May 16, 2023 •

edited

SilverViper commented Jun 5, 2023 •

edited