gguf

Star

Here are 6 public repositories matching this topic...

developer239 / llama.cpp-ts

Star

llama.cpp 🦙 LLM inference in TypeScript

nodejs typescript llama node-addon-api llm llms ggml meta-ai gguf llama3

Updated Sep 26, 2024
C++

PrithivirajDamodaran / blitz-embed

Sponsor

Star

C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcome.

embedding bert-embeddings baai ggml gguf

Updated Mar 4, 2024
C++

gpustack / llama-box

Star

LLM inference server implementation based on llama.cpp.

cpp llama gguf openai-compatible-api

Updated Oct 6, 2024
C++

mgonzs13 / llama_ros

Star

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp embeddings llama gpt ros2 vlm reranking llm langchain llava llamacpp ggml gguf rerank llavacpp

Updated Oct 6, 2024
C++

janhq / cortex.cpp

Star

Run and customize Local LLMs.

onnx onnxruntime llamacpp gguf tensorrt-llm

Updated Oct 5, 2024
C++

LostRuins / koboldcpp

Star

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Oct 6, 2024
C++

Improve this page

Add a description, image, and links to the gguf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf

Here are 6 public repositories matching this topic...

developer239 / llama.cpp-ts

PrithivirajDamodaran / blitz-embed

gpustack / llama-box

mgonzs13 / llama_ros

janhq / cortex.cpp

LostRuins / koboldcpp

Improve this page

Add this topic to your repo