A cost-aware, agentic Retrieval-Augmented Generation (RAG) platform featuring a 5-layer caching architecture, LlamaIndex orchestration, and token economics monitoring.
-
Updated
Apr 3, 2026 - Python
A cost-aware, agentic Retrieval-Augmented Generation (RAG) platform featuring a 5-layer caching architecture, LlamaIndex orchestration, and token economics monitoring.
Add a description, image, and links to the pretext topic page so that developers can more easily learn about it.
To associate your repository with the pretext topic, visit your repo's landing page and select "manage topics."