Simple RAG System

A Retrieval-Augmented Generation (RAG) system for querying vector embeddings and LLMs.

Overview

This project demonstrates a simple implementation of a RAG system that:

Processes and extracts data from PDF documents
Splits them into chunks and generates vector embeddings
Stores these embeddings in a Chroma vector database
Allows users to query the system in natural language
Retrieves relevant context and generates accurate answers

Requirements

Python 3.8+
Ollama (for embeddings and inference)

Installation

Clone this repository:

git clone https://github.com/emarashliev/simple-rag.git
cd simple-rag

Install the required packages:

pip install -r requirements.txt

Make sure Ollama is installed and running on your system. Visit Ollama's website for installation instructions.

ollama pull mistral
ollama serve

Usage

Populating the Database

Before querying, you need to populate the vector database with your documents:

python populate_database.py --data_path data --chunk_size 500 --chunk_overlap 50

Options:

--data_path: Directory containing PDF documents (default: 'data')
--chunk_size: Size of text chunks (default: 500)
--chunk_overlap: Overlap between chunks (default: 50)
--clear_db: Clear the existing database before adding new documents

Querying the System

To ask questions:

python query_data.py --query "How many players can play Monopoly?" --model mistral

Options:

--query: The question you want to ask
--model: The Ollama model to use (default: mistral)
--k: Number of similar documents to retrieve (default: 4)

Running Tests

To run the test suite:

pytest test_rag.py

Project Structure

data/: Contains PDF documents
chroma/: Vector database storage (created during database population)
populate_database.py: Script to process documents and populate the database
query_data.py: Script to query the RAG system
get_embedding_function.py: Provides embedding functionality using Ollama
test_rag.py: Tests for validating the RAG system's responses

How it Works

Document Processing: PDFs are loaded, parsed, and split into chunks with overlaps
Embedding Generation: Text chunks are converted to vector embeddings
Vector Storage: Embeddings are stored in a Chroma vector database
Retrieval: When a query is received, the system finds semantically similar chunks
Response Generation: Retrieved context is sent to an LLM along with the query to generate a response

Extending the System

To add more data:

Add PDFs to the data/ directory
Run populate_database.py with the --clear_db flag if you want to rebuild the entire database

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple RAG System

Overview

Requirements

Installation

Usage

Populating the Database

Querying the System

Running Tests

Project Structure

How it Works

Extending the System

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
get_embedding_function.py		get_embedding_function.py
populate_database.py		populate_database.py
query_data.py		query_data.py
requirements.txt		requirements.txt
test_rag.py		test_rag.py

License

emarashliev/simple-rag

Folders and files

Latest commit

History

Repository files navigation

Simple RAG System

Overview

Requirements

Installation

Usage

Populating the Database

Querying the System

Running Tests

Project Structure

How it Works

Extending the System

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages