Ask questions over your own documents using hybrid AI search.
RAG Chat Interface
Monitoring Dashboard
- Hybrid retrieval — BM25 keyword search + vector semantic search
- Cross-encoder reranking for higher precision
- Citation enforcement — declines if context doesn't support the answer
- LangSmith tracing — every query tracked end-to-end
- Monitoring dashboard — latency, quality scores, declined queries
- Chat UI built with Streamlit
- LLM: OpenAI GPT-4o-mini
- Embeddings: text-embedding-3-small
- Vector DB: ChromaDB
- Keyword Search: BM25
- Reranker: ms-marco-MiniLM-L-6-v2
- Tracing: LangSmith
- UI: Streamlit
- Add your documents to
./docs/ - Create
.envfile with your API keys - Run
python ingest.py - Run
streamlit run app.py - Run
streamlit run dashboard.py --server.port 8502for monitoring

