llmops
Here are 130 public repositories matching this topic...
A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
May 26, 2024 - Python
Open-source observability for your LLM application, based on OpenTelemetry
-
Updated
May 26, 2024 - Python
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
-
Updated
May 26, 2024 - Python
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 11 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
-
Updated
May 26, 2024 - Python
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
-
Updated
May 26, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
May 26, 2024 - Python
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
-
Updated
May 26, 2024 - Python
🎯 Your free LLM evaluation toolkit helps you assess the accuracy of facts, how well it understands context, its tone, and more. This helps you see how good your LLM applications are.
-
Updated
May 26, 2024 - Python
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
-
Updated
May 26, 2024 - Python
OpenLIT is an open-source GenAI and LLM observability platform native to OpenTelemetry with traces and metrics in a single application 🔥 🖥 . Open source GenAI and LLM Application Performance Monitoring (APM) & Observability tool
-
Updated
May 26, 2024 - Python
This is an AI-Gemini Chatbot LLM And Large Image Model Application. You can use this project run into local and ask you images like your talking with in realtime
-
Updated
May 26, 2024 - Python
⚡ From Zero to Monitoring LLMs in 5 minutes ⚡
-
Updated
May 26, 2024 - Python
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
-
Updated
May 25, 2024 - Python
A python library to enable GenAI and LLMOps within Google Cloud Platform
-
Updated
May 25, 2024 - Python
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
-
Updated
May 25, 2024 - Python
Improve this page
Add a description, image, and links to the llmops topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the llmops topic, visit your repo's landing page and select "manage topics."