LLM budget control and cost governance for AI agents. Python library for token budgets, usage limits and guardrails for OpenAI, Anthropic, LangChain, LangGraph and agentic systems.
-
Updated
Mar 23, 2026 - Python
LLM budget control and cost governance for AI agents. Python library for token budgets, usage limits and guardrails for OpenAI, Anthropic, LangChain, LangGraph and agentic systems.
A production-ready CLI Chatbot featuring an advanced RAG pipeline with Hybrid Search, Cross-Encoder Reranking, and a Multi-Model Fallback Router (Anthropic/Ollama). It includes Smart Memory context compression and a native LLM-as-a-Judge evaluation framework.
Hybrid SAP ticket routing: Rule Engine → TF-IDF → LLM fallback. Three-layer decision system with zero unnecessary API calls.
Add a description, image, and links to the llm-fallback topic page so that developers can more easily learn about it.
To associate your repository with the llm-fallback topic, visit your repo's landing page and select "manage topics."