gguf

Star

Here are 6 public repositories matching this topic...

LostRuins / koboldcpp

Star

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Nov 18, 2024
C++

janhq / cortex.cpp

Star

Local AI API Platform

onnx onnxruntime llamacpp gguf tensorrt-llm

Updated Nov 18, 2024
C++

mgonzs13 / llama_ros

Star

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings llama gpt ros2 vlm reranking llm langchain llava llamacpp ggml gguf rerank llavacpp

Updated Nov 18, 2024
C++

gpustack / llama-box

Star

LLM inference server implementation based on llama.cpp.

cpp llama gguf openai-compatible-api

Updated Nov 18, 2024
C++

PrithivirajDamodaran / blitz-embed

Sponsor

Star

C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.

embedding bert-embeddings baai ggml gguf

Updated Mar 4, 2024
C++

developer239 / llama.cpp-ts

Star

llama.cpp 🦙 LLM inference in TypeScript

nodejs typescript llama node-addon-api llm llms ggml meta-ai gguf llama3

Updated Sep 26, 2024
C++

Improve this page

Add a description, image, and links to the gguf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf

Here are 6 public repositories matching this topic...

LostRuins / koboldcpp

janhq / cortex.cpp

mgonzs13 / llama_ros

gpustack / llama-box

PrithivirajDamodaran / blitz-embed

developer239 / llama.cpp-ts

Improve this page

Add this topic to your repo