"LightRAG: Simple and Fast Retrieval-Augmented Generation"
-
Updated
Apr 22, 2025 - Python
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Standardized Serverless ML Inference Platform on Kubernetes
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Optimizing inference proxy for LLMs
Control GenAI interactions with power, precision, and consistency using Conversation Modeling paradigms
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
Neural Network Compression Framework for enhanced OpenVINO™ inference
Gurubase lets you add an "Ask AI" button to your technical docs, turning your content into an AI assistant. It uses web pages, PDFs, YouTube videos, and GitHub repos as sources to generate instant, accurate answers with references. Deploy it via Slack, Discord, GitHub or a web widget.
OpenAI-Compatible RESTful APIs for Amazon Bedrock
A hub for various industry-specific schemas to be used with VLMs.
Open-source generalized AI agent for everyday task automations.
Python client library for Modal
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.
podcastfy.ai gradio demo app
Add a description, image, and links to the genai topic page so that developers can more easily learn about it.
To associate your repository with the genai topic, visit your repo's landing page and select "manage topics."