RAG Project

This project implements a Retrieval Augmented Generation (RAG) pipeline that allows you to ask questions about PDF documents.

Project Structure

rag_pipeline.py - Python script implementing the RAG pipeline
rag_pipeline.ipynb - Jupyter Notebook version of the RAG pipeline
rag_api_helper.py - Helper script for the API
api/ - Express API server that exposes the RAG pipeline via HTTP
chroma_db_meta/ - Directory containing the ChromaDB vector database (created after running the pipeline)

Getting Started

Prerequisites

Python 3.10+ with pip
Node.js 16+ with npm
Ollama installed and running with the mistral model

Setup

Create a Python virtual environment and install dependencies:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install langchain langchain-community langchain-classic chromadb pypdf ollama ipywidgets

Make sure you have a PDF document named ai.pdf in the project root directory.
Run the Jupyter notebook to build the RAG pipeline and create the ChromaDB vector database:

jupyter notebook rag_pipeline.ipynb

Run all cells in the notebook

Using the RAG Pipeline

There are three ways to use the RAG pipeline:

1. Jupyter Notebook

Open rag_pipeline.ipynb and use the interactive Q&A cells at the bottom of the notebook.

2. Python Script

Run the Python script from the command line:

python rag_pipeline.py

3. API Server

Start the Express API server:

cd api
npm install
npm start

Then query the API:

curl -X POST http://localhost:3000/ask -H "Content-Type: application/json" -d '{"question":"What is the main concept of the document?"}'

Or use the test script:

node api/test-api.js "Your question here"

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
api		api
.gitignore		.gitignore
README.md		README.md
ai.pdf		ai.pdf
rag_api_helper.py		rag_api_helper.py
rag_pipeline.ipynb		rag_pipeline.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Project

Project Structure

Getting Started

Prerequisites

Setup

Using the RAG Pipeline

1. Jupyter Notebook

2. Python Script

3. API Server

About

Uh oh!

Releases

Packages

Languages

muthutm1701/pdf-qa-rag-langchain

Folders and files

Latest commit

History

Repository files navigation

RAG Project

Project Structure

Getting Started

Prerequisites

Setup

Using the RAG Pipeline

1. Jupyter Notebook

2. Python Script

3. API Server

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages