llama-cpp-binaries

llama.cpp server in a Python wheel.

Installation

git clone --recurse-submodules https://github.com/oobabooga/llama-cpp-binaries
cd llama-cpp-binaries
CMAKE_ARGS="-DGGML_CUDA=ON -DGGML_NATIVE=off -DCMAKE_CUDA_ARCHITECTURES=all" pip install -v .

Usage

import subprocess

from llama_cpp_binaries import get_binary_path

server_binary_path = get_binary_path()

# start with subprocess.Popen(...)

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.github/workflows		.github/workflows
llama.cpp @ b96c3a4		llama.cpp @ b96c3a4
llama_cpp_binaries		llama_cpp_binaries
.gitmodules		.gitmodules
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama-cpp-binaries

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llama-cpp-binaries

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages