DocsChat RAG 🚀

Enterprise-grade RAG system for technical documentation with advanced retrieval strategies and precise source citation

Features • Demo • Quick Start • Architecture • Documentation

📋 Overview

DocsChat RAG is a production-ready Retrieval-Augmented Generation system designed for querying technical documentation (Python, React, FastAPI) with enterprise-grade architecture patterns, multiple retrieval strategies, and accurate source citations.

What Makes This Different?

🎯 Hybrid Search: Combines semantic (vector) and keyword (BM25) search with RRF fusion
🔍 HyDE Query Transformation: Hypothetical Document Embeddings for improved retrieval
📊 Intelligent Reranking: Cohere API integration for relevance optimization
📖 Source Citation: Precise page numbers and document references
💬 Conversational Memory: Multi-turn conversation context
🏗️ Clean Architecture: SOLID principles, dependency injection, comprehensive testing
🐳 Production Ready: Docker containerization, CI/CD pipelines, monitoring

✨ Features

Core Capabilities

Multi-Source Ingestion: Scrapes and processes Python, React, and FastAPI official documentation
Advanced Chunking: Semantic and recursive text splitting strategies
Hybrid Retrieval:
- Semantic search (cosine similarity)
- Keyword search (BM25)
- RRF-based fusion
Query Enhancement: HyDE transformation for better retrieval
Smart Reranking: Cohere cross-encoder for relevance scoring
Citation Tracking: Page numbers and source URLs preserved
Conversational Context: Last N turns memory management

Technical Highlights

SOLID Principles: Clean, maintainable, extensible codebase
Type Safety: Comprehensive type hints with mypy validation
Testing: Unit and integration tests with pytest
Documentation: Detailed docstrings (Google style) and architecture docs
CI/CD: GitHub Actions for linting, testing, and deployment
Observability: Structured logging with loguru
Containerization: Docker and docker-compose setup

🎥 Demo

Live Demo: docschat-rag.streamlit.app (coming soon)

Example Query

User: "How do I handle CORS in FastAPI?"

DocsChat:
To handle CORS in FastAPI, use the CORSMiddleware:

```python
from fastapi import FastAPI
from fastapi.middleware.cors import CORSMiddleware

app = FastAPI()

app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
)

Sources:

FastAPI Documentation: Advanced User Guide > CORS (Page 47)
Official Docs Link


---

## 🚀 Quick Start

### Prerequisites

- Python 3.11+
- Poetry (recommended) or pip
- OpenAI API key
- Cohere API key (optional, for reranking)

### Installation

```bash
# Clone repository
git clone https://github.com/RomanRosa/docschat-rag.git
cd docschat-rag

# Install dependencies with Poetry
poetry install

# OR with pip
pip install -r requirements.txt

# Setup environment variables
cp .env.example .env
# Edit .env with your API keys

Environment Variables

# .env
OPENAI_API_KEY=your_openai_api_key_here
COHERE_API_KEY=your_cohere_api_key_here

# LLM Configuration
LLM_MODEL=gpt-4o-mini
EMBEDDING_MODEL=text-embedding-3-small
TEMPERATURE=0.0

# Retrieval Configuration
TOP_K=10
RERANK_TOP_K=3
CHUNK_SIZE=1000
CHUNK_OVERLAP=200

# Vector Store
CHROMADB_PERSIST_DIR=./data/vectorstore

Run Locally

# 1. Ingest documentation (one-time setup)
poetry run python scripts/ingest_docs.py --sources python react fastapi

# 2. Build vector index
poetry run python scripts/rebuild_index.py

# 3. Launch Streamlit UI
poetry run streamlit run src/ui/app.py

Access at: http://localhost:8501

Docker Quick Start

# Build and run with docker-compose
docker-compose up -d

# Access at http://localhost:8501

🏗️ Architecture

High-Level Design

┌─────────────┐
│   User UI   │  Streamlit Chat Interface
└──────┬──────┘
       │
       ▼
┌─────────────────────────────────────────────┐
│           RAG Pipeline Orchestrator         │
│  ┌─────────────────────────────────────┐    │
│  │  Query Processor                    │    │
│  │  - Validation                       │    │
│  │  - HyDE Transformation              │    │
│  └────────────┬────────────────────────┘    │
│               │                             │
│  ┌────────────▼────────────────────────┐    │
│  │  Hybrid Retriever                   │    │
│  │  ┌──────────────┐  ┌──────────────┐ │    │
│  │  │  Semantic    │  │  Keyword     │ │    │
│  │  │  (Vector)    │  │  (BM25)      │ │    │
│  │  └──────┬───────┘  └──────┬───────┘ │    │
│  │         │                  │        │    │
│  │         └────────┬─────────┘        │    │
│  │                  │                  │    │
│  │         ┌────────▼─────────┐        │    │
│  │         │  RRF Fusion      │        │    │
│  │         └────────┬─────────┘        │    │
│  └──────────────────┼─────────────────┘     │
│                     │                       │
│  ┌──────────────────▼─────────────────┐     │
│  │  Cohere Reranker                   │     │
│  └──────────────────┬─────────────────┘     │
│                     │                       │
│  ┌──────────────────▼─────────────────┐     │
│  │  LLM Generator (GPT-4o-mini)       │     │
│  │  - Prompt with context             │     │
│  │  - Citation injection              │     │
│  │  - Memory management               │     │
│  └────────────────────────────────────┘     │
└─────────────────────────────────────────────┘

Key Components

Module	Responsibility	Key Classes
`ingestion/`	Document scraping, parsing, chunking	`PythonDocsIngester`, `SemanticChunker`
`vectorization/`	Embeddings generation, vector storage	`OpenAIEmbedder`, `ChromaVectorStore`
`retrieval/`	Search strategies, reranking	`HybridRetriever`, `CohereReranker`
`generation/`	LLM calls, prompt templating	`OpenAIGenerator`, `ConversationalMemory`
`pipeline/`	End-to-end orchestration	`RAGPipeline`, `QueryProcessor`
`ui/`	Streamlit interface	`ChatInterface`, `SourcePanel`

See ARCHITECTURE.md for detailed design documentation.

📚 Documentation

Architecture Guide: System design, component diagrams, design decisions
API Reference: Module APIs, class references
Development Guide: Local setup, testing, contributing
Deployment Guide: Docker, Streamlit Cloud, AWS deployment

🧪 Testing

# Run all tests
poetry run pytest

# Run with coverage
poetry run pytest --cov=src --cov-report=html

# Run specific test module
poetry run pytest tests/unit/test_retrieval.py

# Run integration tests
poetry run pytest tests/integration/

Test Coverage

Current coverage: 92% (target: 90%+)

🛠️ Development

Setup Development Environment

# Install dev dependencies
poetry install --with dev

# Setup pre-commit hooks
pre-commit install

# Run linting
poetry run ruff check src/
poetry run black src/
poetry run mypy src/

# Format code
poetry run black src/

Code Style

Formatter: Black (line length: 100)
Linter: Ruff (replaces flake8, isort, pylint)
Type Checker: Mypy (strict mode)
Docstrings: Google style
Commits: Conventional Commits

📦 Tech Stack

Category	Technology
Framework	LangChain 0.1.0
LLM	OpenAI GPT-4o-mini
Embeddings	OpenAI text-embedding-3-small
Vector DB	ChromaDB
Reranking	Cohere API
UI	Streamlit
Testing	Pytest, pytest-cov
CI/CD	GitHub Actions
Containerization	Docker, docker-compose
Logging	Loguru

🗺️ Roadmap

Phase 1: MVP ✅ (Current)

Phase 2: Enhancement 🚧 (In Progress)

HyDE query transformation
Advanced reranking
Conversational memory
Performance benchmarking

Phase 3: Production 📋 (Planned)

Phase 4: Advanced 🔮 (Future)

Multi-modal support (code screenshots)
Custom fine-tuned embeddings
Graph RAG integration
Real-time doc updates

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

Development Workflow

Fork the repository
Create feature branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'feat(retrieval): add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open Pull Request

📄 License

This project is licensed under the MIT License - see LICENSE file for details.

🙏 Acknowledgments

LangChain for RAG framework
ChromaDB for vector storage
OpenAI for LLM and embeddings
Cohere for reranking API
Streamlit for rapid UI development

📞 Contact

Francisco Román Peña de la Rosa

⭐ Star History

If this project helped you, please consider giving it a ⭐!

Made with ❤️ by Roman de la Rosa

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

DocsChat RAG 🚀

📋 Overview

What Makes This Different?

✨ Features

Core Capabilities

Technical Highlights

🎥 Demo

Example Query

Environment Variables

Run Locally

Docker Quick Start

🏗️ Architecture

High-Level Design

Key Components

📚 Documentation

🧪 Testing

Test Coverage

🛠️ Development

Setup Development Environment

Code Style

📦 Tech Stack

🗺️ Roadmap

Phase 1: MVP ✅ (Current)

Phase 2: Enhancement 🚧 (In Progress)

Phase 3: Production 📋 (Planned)

Phase 4: Advanced 🔮 (Future)

🤝 Contributing

Development Workflow

📄 License

🙏 Acknowledgments

📞 Contact

⭐ Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages