#

llama

Here are 33 public repositories matching this topic...

llama.cpp

ggerganov / llama.cpp

LLM inference in C/C++

Updated Jul 11, 2024
C++

mudler / LocalAI

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Updated Jul 11, 2024
C++

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

falcon llama large-language-models llm local-inference llm-inference bamboo-7b

Updated Jul 1, 2024
C++

vitoplantamura / OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run SDXL on a RPI Zero 2 but also Mistral 7B on desktops and servers.

raspberry-pi machine-learning llama mistral onnx tinyml stable-diffusion

Updated Jun 19, 2024
C++

janhq / cortex

Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

ai cuda llama accelerated inference-engine openai-api llm stable-diffusion llms llamacpp llama2 gguf tensorrt-llm

Updated Jul 11, 2024
C++

alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

inference llama gpt model-serving llm llmops llm-serving

Updated Jul 11, 2024
C++

azkadev / llama

LLaMA (Language Learning for Machine Translation) adalah proyek riset yang diprakarsai oleh Facebook AI Research (FAIR) yang bertujuan untuk meningkatkan kualitas terjemahan mesin menggunakan pendekatan yang lebih alami dan berfokus pada bahasa asal.

android windows macos linux dart ios ai chatbot indonesia llama flutter chatgpt chatgpt-dart chatgpt4 chatgpt-offline

Updated Apr 13, 2024
C++

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

performance gpu model production cuda efficiency inference transformer llama speculative serving llm llm-inference llama3

Updated Jul 11, 2024
C++

tenstorrent / tt-metal

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

metal accelerator ml falcon resnet llama low-level-programming mistral llm stable-diffusion mixtral tenstorrent

Updated Jul 11, 2024
C++

intel / xFasterTransformer

intel inference transformer xeon llama model-serving llm chatglm qwen

Updated Jul 10, 2024
C++

ngxson / wllama

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

webassembly wasm llama llm llamacpp

Updated Jul 10, 2024
C++

UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices

llama multimodal large-language-models

Updated Jul 4, 2024
C++

kuvaus / LlamaGPTJ-chat

Simple chat program for LLaMa, GPT-J, and MPT models.

cli ai cpp mpt llama gpt gptj gpt4all

Updated Aug 2, 2023
C++

mybigday / llama.rn

React Native binding of llama.cpp

android ios react-native llama llm llama-cpp

Updated Jul 10, 2024
C++

guinmoon / llmfarm_core.swift

Swift library to work with llama and other large language models.

swift ai falcon llama gpt-2 rwkv gptneox starcoder llama2

Updated Jul 10, 2024
C++

yoshoku / llama_cpp.rb

llama_cpp provides Ruby bindings for llama.cpp

ruby gem ai llama llm

Updated Jul 9, 2024
C++

trzy / llava-cpp-server

LLaVA server (llama.cpp).

llama multimodal vision-transformer llm llava llama2

Updated Oct 20, 2023
C++

gotzmann / booster

Booster - open platform for serving LLM models

openai llama gpt llm chatgpt llamacpp llama-cpp vllm ggml exllama oobabooga ollama

Updated Jul 7, 2024
C++

mgonzs13 / llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

cpp llama gpt ros2 vlm llm llava llamacpp ggml gguf llavacpp

Updated Jul 11, 2024
C++

nrl-ai / CustomChar

Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.

cpp tts llama stt llm whisper-cpp llama-cpp ggml llama-v2

Updated Dec 5, 2023
C++

Improve this page

Add a description, image, and links to the llama topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama topic, visit your repo's landing page and select "manage topics."