📚 Production RAG System

Ask questions over your own documents using hybrid AI search.

📸 Screenshots

RAG Chat Interface

Monitoring Dashboard

✨ Features

Hybrid retrieval — BM25 keyword search + vector semantic search
Cross-encoder reranking for higher precision
Citation enforcement — declines if context doesn't support the answer
LangSmith tracing — every query tracked end-to-end
Monitoring dashboard — latency, quality scores, declined queries
Chat UI built with Streamlit

🛠️ Tech Stack

LLM: OpenAI GPT-4o-mini
Embeddings: text-embedding-3-small
Vector DB: ChromaDB
Keyword Search: BM25
Reranker: ms-marco-MiniLM-L-6-v2
Tracing: LangSmith
UI: Streamlit

🚀 How to Run

Add your documents to ./docs/
Create .env file with your API keys
Run python ingest.py
Run streamlit run app.py
Run streamlit run dashboard.py --server.port 8502 for monitoring

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
eval		eval
screenshots		screenshots
.gitignore		.gitignore
README.md		README.md
app.py		app.py
chain.py		chain.py
dashboard.py		dashboard.py
ingest.py		ingest.py
monitor.py		monitor.py
monitoring.db		monitoring.db
requirements.txt		requirements.txt
retrieve.py		retrieve.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Production RAG System

📸 Screenshots

✨ Features

🛠️ Tech Stack

🚀 How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📚 Production RAG System

📸 Screenshots

✨ Features

🛠️ Tech Stack

🚀 How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages