llama-cpp

Here are 4 public repositories matching this topic...

llama.go is like llama.cpp in pure Golang!

llama gpt alpaca vicuna gpt3 gpt4 llm chatgpt dalai llama-cpp gpt4all

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

go llama-cpp gguf stable-diffusion-cpp llama-box

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

self-hosted mlx openai-api llm llamacpp llama-cpp vllm llm-inference localllm localllama llama-server llm-router mlx-lm

go binding for llama.cpp, offer low level and high level api

go llama gpt chatgpt llamacpp llama-cpp

Add a description, image, and links to the llama-cpp topic page so that developers can more easily learn about it.

To associate your repository with the llama-cpp topic, visit your repo's landing page and select "manage topics."