llamacpp

Star

Here are 24 public repositories matching this topic...

edp1096 / my-llama

Star

Go bindings for llama.cpp and webui

golang llama alpaca vicuna llamacpp

Updated Jun 22, 2023
C++

adithya-s-k / LLM-InferenceNet

Star

LLM InferenceNet is a C++ project designed to facilitate fast and efficient inference from Large Language Models (LLMs) using a client-server architecture. It enables optimized interactions with pre-trained language models, making deployment on edge devices easier.

cpp llama llm llamacpp

Updated Jul 28, 2023
C++

opyate / godot-llm-experiment

Star

Getting an LLM to work with Godot.

godot-engine godot4 gdextension llamacpp

Updated Oct 11, 2023
C++

P1ayer-1 / Llama-LibTorch

Star

Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5

machine-learning pytorch libtorch llamacpp llama2

Updated Jan 5, 2024
C++

alsritter / chatglm.init

Star

This project accelerates local deployment of chatglm and vector inference using PyTorch compiled in C++, and includes an OpenAI API Mock script for quick setup of local speed testing services. This setup enhances performance and efficiency, ideal for high-performance applications and development testing.

mock cpp pytorch openai chatgpt chatglm llamacpp chatglm3

Updated Jan 20, 2024
C++

abdeladim-s / pyllamacpp

Star

Python bindings for llama.cpp

llama llms langchain llamacpp

Updated Feb 29, 2024
C++

hwpoison / llama-server-chat-terminal-client

Star

Lightweight chat terminal-interface for llama.cpp server compilable for windows and linux.

chat llama teminal-application llamacpp mistral-7b llama-server

Updated Mar 1, 2024
C++

inferflow / inferflow

Star

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

niansa / discord_llama

Star

Multi-Model and multi-tasking llama Discord Bot - Mirror of: https://gitlab.com/niansa/discord_llama

ai discord-bot llama cpp20 llm llamacpp llm-inference llama2

Updated Mar 27, 2024
C++

rendezqueue / rendezllama

Star

CLI for llama.cpp with various commands to guide, edit, and regenerate tokens on the fly.

ai chatbot llama large-language-models llm llamacpp

Updated Apr 7, 2024
C++

MorganRO8 / Lucys_Labyrinth

Star

A game made for a school project, dedicated to my daughter.

game sfml llm llamacpp

Updated Apr 9, 2024
C++

staghado / vit.cpp

Star

Inference Vision Transformer (ViT) in plain C/C++ with ggml

c cpu ai computer-vision cpp image-classification edge-computing vision-transformer whisper-cpp llamacpp ggml

Updated Apr 11, 2024
C++

AutonomicPerfectionist / PipeInfer

Sponsor

Star

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

inference llm llamacpp speculative-decoding

Updated Apr 15, 2024
C++

hazelnutcloud / godot-llama-cpp

Star

Run large language models in Godot.

godot llamacpp

Updated May 21, 2024
C++

gotzmann / booster

Star

Booster - open platform for serving LLM models

openai llama gpt llm chatgpt llamacpp llama-cpp vllm ggml exllama oobabooga ollama

Updated May 29, 2024
C++

mybigday / llama.node

Star

Node.js binding of Llama.cpp

nodejs node-js llama llamacpp llama-cpp llama2

Updated Jun 3, 2024
C++

tinyBigGAMES / LMEngine

Sponsor

Star

Local LLM Inference

c pascal library cpp windows-10 indiedev win64 windows-11 llamacpp llm-inference

Updated Jun 4, 2024
C++

mgonzs13 / llama_ros

Star

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp llama gpt ros2 vlm llm llava llamacpp ggml gguf llavacpp

Updated Jun 5, 2024
C++

Nexesenex / kobold.cpp

Star

A 3rd party testground for KoboldCPP, a simple one-file way to run various GGML/GGUF models with KoboldAI's UI. (for KCCP Frankenstein, in CPU mode, CUDA, CLBLAST, or VULKAN)

llamacpp koboldcpp

Updated Jun 6, 2024
C++

Adriankhl / godot-llm

Star

LLM in Godot

gamedev game-development godotengine godot godot-engine gdextension llamacpp llm-inference

Updated Jun 6, 2024
C++

Improve this page

Add a description, image, and links to the llamacpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llamacpp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamacpp

Here are 24 public repositories matching this topic...

edp1096 / my-llama

adithya-s-k / LLM-InferenceNet

opyate / godot-llm-experiment

P1ayer-1 / Llama-LibTorch

alsritter / chatglm.init

abdeladim-s / pyllamacpp

hwpoison / llama-server-chat-terminal-client

inferflow / inferflow

niansa / discord_llama

rendezqueue / rendezllama

MorganRO8 / Lucys_Labyrinth

staghado / vit.cpp

AutonomicPerfectionist / PipeInfer

hazelnutcloud / godot-llama-cpp

gotzmann / booster

mybigday / llama.node

tinyBigGAMES / LMEngine

mgonzs13 / llama_ros

Nexesenex / kobold.cpp

Adriankhl / godot-llm

Improve this page

Add this topic to your repo