LangChain RAG Tutorial

A comprehensive, production-ready tutorial for building Retrieval-Augmented Generation (RAG) systems using LangChain.

🚀 Quick Start

# Clone repository
git clone https://github.com/gianlucamazza/langchain-rag-tutorial.git
cd langchain-rag-tutorial

# Setup environment
python3 -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt

# Configure API key
echo "OPENAI_API_KEY=sk-proj-your-key-here" > .env

# Start learning
jupyter notebook notebooks/00_index.ipynb

📖 Full guide: docs/GETTING_STARTED.md

📚 What You'll Learn

Fundamentals (30-40 min)

Master the core concepts of RAG:

Document Loading & Splitting - Process and chunk text efficiently
Embeddings Comparison - OpenAI vs HuggingFace benchmarks
Simple RAG - Build your first end-to-end RAG system

📘 Start with Fundamentals →

Advanced Architectures (4-5 hours)

Explore 15 production-ready patterns:

Architecture	Complexity	Use Case	Key Feature
Memory RAG	⭐⭐	Chatbots	Conversation history
Branched RAG	⭐⭐⭐	Research	Multi-query parallel retrieval
HyDe	⭐⭐⭐	Ambiguous queries	Hypothetical documents
Adaptive RAG	⭐⭐⭐⭐	Mixed workloads	Intelligent query routing
Corrective RAG	⭐⭐⭐⭐	High accuracy	Quality check + web fallback
Self-RAG	⭐⭐⭐⭐⭐	Self-correcting	Autonomous refinement
Agentic RAG	⭐⭐⭐⭐⭐	Complex reasoning	Multi-tool agent loops
Contextual RAG ✨	⭐⭐⭐	Technical docs	Context-augmented chunks
Fusion RAG ✨	⭐⭐⭐	Best ranking	Reciprocal Rank Fusion
SQL RAG ✨	⭐⭐⭐⭐	Analytics/BI	Natural Language to SQL
GraphRAG ✨	⭐⭐⭐⭐⭐	Knowledge graphs	Entity relationships + multi-hop
Multimodal RAG 🆕	⭐⭐⭐⭐	Images + text	GPT-4 Vision + OCR
Fine-tuning Guide 🆕	⭐⭐⭐⭐	Domain embeddings	Custom embedding models
RAGAS Evaluation	-	Quality metrics	Comprehensive RAG assessment

🔬 Explore Advanced Patterns →

📖 Documentation

Comprehensive docs organized by topic:

🚀 Getting Started - 5-minute quick start
🛠️ Installation - Detailed setup guide
📚 API Reference - Shared module documentation
🏗️ Architecture - Design decisions
🐛 Troubleshooting - Common issues & solutions
⚡ Performance - Benchmarks & optimization
❓ FAQ - Frequently asked questions
🚀 Deployment - Production deployment
📝 Examples - Usage patterns
🤝 Contributing - Contribution guidelines
📜 Changelog - Version history

🏗️ Project Structure

llm_rag/
├── docs/                          # 📖 Modular documentation
│   ├── GETTING_STARTED.md        # Quick start guide
│   ├── INSTALLATION.md           # Setup instructions
│   ├── API_REFERENCE.md          # Shared module API
│   └── ... (8 more specialized docs)
├── notebooks/
│   ├── 00_index.ipynb            # 🎯 START HERE - Navigation hub
│   ├── fundamentals/             # Core RAG concepts (01-03)
│   │   ├── 01_setup_and_basics.ipynb
│   │   ├── 02_embeddings_comparison.ipynb
│   │   └── 03_simple_rag.ipynb
│   └── advanced_architectures/   # Advanced patterns (04-18)
│       ├── 04_rag_with_memory.ipynb
│       ├── 05_branched_rag.ipynb
│       ├── 06_hyde.ipynb
│       ├── 07_adaptive_rag.ipynb
│       ├── 08_corrective_rag.ipynb
│       ├── 09_self_rag.ipynb
│       ├── 10_agentic_rag.ipynb
│       ├── 11_comparison.ipynb           # All 12 architectures
│       ├── 12_contextual_rag.ipynb ✨     # v1.1.0
│       ├── 13_fusion_rag.ipynb ✨         # v1.1.0
│       ├── 14_sql_rag.ipynb ✨            # v1.1.0
│       ├── 15_graphrag.ipynb ✨           # v1.1.0
│       ├── 16_evaluation_ragas.ipynb ✨   # v1.1.0 - Quality metrics
│       ├── 17_multimodal_rag.ipynb 🆕    # v1.2.0 - Images + Text
│       └── 18_finetuning_embeddings.ipynb 🆕  # v1.2.0 - Custom embeddings
├── templates/                     # 🚀 Production deployment templates (NEW v1.2.0)
│   ├── fastapi/                  # REST API template
│   ├── streamlit/                # Web UI template
│   └── lambda/                   # AWS Lambda serverless
├── tests/                        # 🧪 Test suite (NEW v1.2.0)
│   ├── conftest.py              # pytest fixtures
│   ├── test_utils.py            # Utility tests
│   └── test_config.py           # Config tests
├── shared/                        # 🔧 Reusable utilities (1500+ lines)
│   ├── config.py                 # Configuration management
│   ├── utils.py                  # Utility functions
│   ├── loaders.py                # Document loading
│   └── prompts.py                # Prompt templates (30+ prompts)
├── data/                         # 💾 Vector stores, Chinook DB (gitignored)
├── Dockerfile                    # 🐳 Docker support (NEW v1.2.0)
├── docker-compose.yml            # 🐳 Full stack orchestration (NEW v1.2.0)
├── Makefile                      # 🛠️ Development commands (NEW v1.2.0)
├── pytest.ini                    # 🧪 Test configuration (NEW v1.2.0)
├── .pre-commit-config.yaml       # 🔍 Pre-commit hooks (NEW v1.2.0)
├── requirements.txt              # Production dependencies
├── requirements-dev.txt          # Development dependencies (NEW v1.2.0)
├── .env.example                  # 🔑 API key template
└── README.md                     # This file

✨ Key Features

Core Capabilities:

✅ 12 RAG Architectures - From simple to graph-based
✅ Multimodal RAG 🆕 - GPT-4 Vision + OCR for images + text
✅ Fine-tuning Guide 🆕 - Train custom domain-specific embeddings
✅ RAGAS Evaluation - Comprehensive quality metrics
✅ SQL & Graph Support - Structured data + relationships
✅ Modular Design - Reusable shared utilities (DRY)
✅ Vector Store Persistence - No re-embedding needed
✅ Comprehensive Benchmarks - Performance & cost analysis

Production-Ready Infrastructure: 🆕

✅ Docker Support - Multi-stage builds with Redis, Prometheus, Grafana
✅ 3 Deployment Templates - FastAPI, Streamlit, AWS Lambda
✅ Testing Suite - pytest with 70%+ coverage target
✅ CI/CD Pipelines - GitHub Actions (testing, linting, coverage)
✅ Development Tools - Makefile, pre-commit hooks, linting
✅ Security Best Practices - Non-root Docker, API key management

Technical Stack:

LangChain v1.0+ - Framework & LCEL
OpenAI GPT-4o-mini + GPT-4 Vision - Fast LLM + multimodal
FAISS - Vector similarity search
Sentence Transformers + Accelerate - Fine-tuning embeddings
NetworkX - Graph algorithms
SQLAlchemy - Database abstraction
RAGAS - RAG evaluation framework
Tesseract OCR + Pillow - Image processing
FastAPI + Streamlit - Production deployment
Docker + Docker Compose - Containerization
pytest + GitHub Actions - Testing & CI/CD
Python 3.9+ - Modern type hints

🔍 See Architecture Details →

💡 Architecture Selection Guide

Choose based on your needs:

Your Need	Architecture	Docs
🚀 Fast & simple	Simple RAG	03_simple_rag.ipynb
💬 Chatbot with memory	Memory RAG	04_rag_with_memory.ipynb
📚 Research tool	Fusion RAG	13_fusion_rag.ipynb ✨
🔍 Ambiguous queries	Contextual RAG	12_contextual_rag.ipynb ✨
⚖️ Cost optimization	Adaptive RAG	07_adaptive_rag.ipynb
🎯 High accuracy	Fusion / CRAG	13_fusion_rag.ipynb ✨
🔄 Self-correcting	Self-RAG	09_self_rag.ipynb
🤖 Complex reasoning	Agentic RAG	10_agentic_rag.ipynb
📊 Analytics/BI	SQL RAG	14_sql_rag.ipynb ✨
🕸️ Knowledge graphs	GraphRAG	15_graphrag.ipynb ✨
🖼️ Images + text 🆕	Multimodal RAG	17_multimodal_rag.ipynb 🆕
🎯 Custom embeddings 🆕	Fine-tuning Guide	18_finetuning_embeddings.ipynb 🆕
📈 Quality evaluation	RAGAS	16_evaluation_ragas.ipynb ✨

Rule of thumb: Start with Simple RAG → Add Contextual for quality → Use specialized for specific needs.

Performance tip: For domain-specific content with <75% accuracy, consider fine-tuning embeddings (notebook 18) for +15-25% improvement.

❓ Need help choosing? See FAQ →

🚀 Production Deployment 🆕

Ready to deploy? Choose from 3 production-ready templates:

🐳 Docker (Recommended)

# Quick start with Docker Compose
docker-compose up -d

# Or build custom image
docker build -t langchain-rag:latest .
docker run -p 8000:8000 --env-file .env langchain-rag:latest

Features: Multi-stage builds, Redis caching, Prometheus + Grafana monitoring, non-root security

🚂 FastAPI REST API

Complete REST API with automatic documentation:

cd templates/fastapi
pip install -r requirements.txt
uvicorn app:app --reload
# Visit http://localhost:8000/docs for Swagger UI

Features: Pydantic validation, CORS, health checks, error handling, async support

🎨 Streamlit Web UI

Interactive web application:

cd templates/streamlit
pip install -r requirements.txt
streamlit run streamlit_app.py

Features: Real-time queries, source document display, architecture selection, metrics visualization

⚡ AWS Lambda (Serverless)

Deploy to AWS Lambda:

cd templates/lambda
zip -r function.zip .
aws lambda create-function --function-name rag-api --zip-file fileb://function.zip

Features: S3 vector store integration, cold start optimization, API Gateway ready

📖 Full guide: docs/DEPLOYMENT.md

📊 Performance at a Glance

Architecture	Latency	Cost/Query	Accuracy	Best For
Simple RAG	~2s	$0.002	Good	General Q&A
Contextual RAG ✨	~2-3s	$0.002	Very Good	Technical docs
Fusion RAG ✨	~5-8s	$0.006	Excellent	Research
SQL RAG ✨	~2-5s	$0.004	Perfect*	Analytics
GraphRAG ✨	~3-8s	$0.010+	Excellent**	Relationships
Adaptive RAG	Variable	$0.003	Very Good	Mixed workloads
Agentic RAG	~30s	$0.012	Excellent	Complex tasks

*For structured data | **For multi-hop queries

Full benchmarks: 11_comparison.ipynb | RAGAS Evaluation

🚦 Prerequisites

Python 3.9+ (3.10+ recommended)
OpenAI API Key (Get one here)
~2GB RAM (4GB+ recommended for fine-tuning)
~2GB disk space (dependencies + models)
System packages (for multimodal): Tesseract OCR, Poppler (see INSTALLATION.md)

📖 Detailed requirements →

🎓 Learning Path

Recommended sequence:

Setup (10 min): GETTING_STARTED.md
Navigation Hub (5 min): 00_index.ipynb
Fundamentals (30-40 min): Notebooks 01-03
Choose Your Path:
- 🏃 Fast track: Simple RAG → Contextual RAG → Your use case
- 🔬 Deep dive: Complete all 12 architectures
- 📊 Comparison: Jump to 11_comparison.ipynb
- 📈 Evaluation: Try 16_evaluation_ragas.ipynb

Total time:

Fast track: ~1-2 hours
Complete tutorial: ~5-7 hours
With multimodal + evaluation: ~7-9 hours
Production deployment: +1-2 hours

🤝 Contributing

Contributions welcome! See CONTRIBUTING.md for guidelines.

Ways to contribute:

🐛 Report bugs
✨ Suggest features
📝 Improve documentation
💻 Submit pull requests

🛠️ Development 🆕

Quick commands with Makefile:

make install      # Install all dependencies
make test         # Run test suite with coverage
make lint         # Run code quality checks
make format       # Auto-format code (black, isort)
make docker-build # Build Docker image
make docker-run   # Run full stack with Docker Compose
make clean        # Clean cache and build files

Testing:

# Run all tests with coverage
pytest tests/ -v --cov=shared --cov-report=html

# Run specific test file
pytest tests/test_utils.py -v

# Run with markers
pytest -m "not slow"  # Skip slow tests

Pre-commit hooks:

# Install hooks (runs on every commit)
pre-commit install

# Run manually
pre-commit run --all-files

CI/CD: Automated testing and linting on every push via GitHub Actions (Python 3.9, 3.10, 3.11)

📖 More details: docs/CONTRIBUTING.md

📜 License

MIT License - see LICENSE for details.

TL;DR: Free to use commercially, modify, and distribute. Just include the license.

🔗 Resources

📖 Documentation: docs/
🐛 Issues: GitHub Issues
💬 Discussions: GitHub Discussions
🌐 LangChain Docs: python.langchain.com

💬 Getting Help

📖 Check FAQ first
🔍 Search existing issues
🐛 Open a new issue
💬 Ask in Discussions

⭐ If this helps you, please star the repo!

Latest: v1.2.1 - Critical import fixes, Python 3.9 compatibility, accelerate support | Made with ❤️ using Claude Code | View Changelog

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LangChain RAG Tutorial

🚀 Quick Start

📚 What You'll Learn

Fundamentals (30-40 min)

Advanced Architectures (4-5 hours)

📖 Documentation

🏗️ Project Structure

✨ Key Features

💡 Architecture Selection Guide

🚀 Production Deployment 🆕

🐳 Docker (Recommended)

🚂 FastAPI REST API

🎨 Streamlit Web UI

⚡ AWS Lambda (Serverless)

📊 Performance at a Glance

🚦 Prerequisites

🎓 Learning Path

🤝 Contributing

🛠️ Development 🆕

📜 License

🔗 Resources

💬 Getting Help

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
data		data
docs		docs
monitoring		monitoring
notebooks		notebooks
scripts		scripts
shared		shared
templates		templates
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.mdformat.toml		.mdformat.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

gianlucamazza/langchain-rag-tutorial

Folders and files

Latest commit

History

Repository files navigation

LangChain RAG Tutorial

🚀 Quick Start

📚 What You'll Learn

Fundamentals (30-40 min)

Advanced Architectures (4-5 hours)

📖 Documentation

🏗️ Project Structure

✨ Key Features

💡 Architecture Selection Guide

🚀 Production Deployment 🆕

🐳 Docker (Recommended)

🚂 FastAPI REST API

🎨 Streamlit Web UI

⚡ AWS Lambda (Serverless)

📊 Performance at a Glance

🚦 Prerequisites

🎓 Learning Path

🤝 Contributing

🛠️ Development 🆕

📜 License

🔗 Resources

💬 Getting Help

About

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages