RAG chat with PDF

This project implements a Retrieval-Augmented Generation (RAG) chatbot capable of interacting with PDF documents. It leverages LlamaIndex for document indexing and retrieval and Ollama for enhanced conversational AI capabilities.

Features 🚀

🤖 Model Support	Implemented	Description
Ollama (e.g. Llama3)	✅	Local Embedding and Generation Models powered by Ollama
OpenAI (e.g. GPT4)	✅	Embedding and Generation Models by OpenAI

🤖 Embedding Support	Implemented	Description
Ollama	✅	Local Embedding Models powered by Ollama
OpenAI		Embedding Models by OpenAI

📁 Data Support	Implemented	Description
PDF Ingestion	✅	Import PDF
CSV/XLSX Ingestion	planned ⏱️	Import Table Data
.DOCX	planned ⏱️	Import .docx files
Multi-Modal	planned ⏱️	Import and Transcribe Audio through AssemblyAI

✨ RAG Features	Implemented	Description
Hybrid Search	✅	Semantic Search combined with Keyword Search
Router	✅	Router Retriever base on your query (summary and specific contexts)
Query Transformations	planned ⏱️	Enhance retrieval by refining queries for improved relevance and accuracy.
Filtering	✅	Apply Filters (e.g. documents, document types etc.) before performing RAG
Reranking	✅	Rerank results based on context for improved results
RAG Evaluation	✅	Interface for Evaluating RAG pipelines
Agentic RAG	out of scope ❌	Agentic RAG pipelines
Graph RAG	out of scope ❌	Graph-based RAG pipelines

🗡️ Chunking Techniques	Implemented	Description
Sentence	✅	Chunk by Sentence
Semantic	✅	Chunk and group by semantic sentence similarity

Setup 🛠️

Build from Source 🏗️

1. Clone the repository

git clone https://github.com/johnPa02/local-rag-chat.git
cd local-rag-chat

2. Install the dependencies

pip install poetry
poetry install

3. Create a `.env` file in the root directory and add the following environment variables

cp .env.example .env

4. Ollama

This project supports Ollama models. Download and Install Ollama on your device (https://ollama.com/download). Make sure to install your preferred LLM using ollama run <model>.

Tested with llama3, llama3:70b and mistral. The bigger models generally perform better, but need more computational power.

Make sure Ollama Server runs in the background and that you don't ingest documents with different ollama models since their vector dimension can vary that will lead to errors

You can verify that by running the following command

ollama run llama3

5. Run the application

Before running the application, change the OLLAMA_BASE_URL in the configs.py file to http://ollama_server:11434
```
python app.py
```

Deploy with Docker 🐳

Run the following commands to deploy the application using Docker

docker-compose up --build

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
img		img
local_rag_chat		local_rag_chat
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
configs.py		configs.py
docker-compose.yml		docker-compose.yml
pipeline.py		pipeline.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG chat with PDF

Table of Contents

Features 🚀

Setup 🛠️

Build from Source 🏗️

1. Clone the repository

2. Install the dependencies

3. Create a `.env` file in the root directory and add the following environment variables

4. Ollama

5. Run the application

Deploy with Docker 🐳

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

johnPa02/local-rag-chat

Folders and files

Latest commit

History

Repository files navigation

RAG chat with PDF

Table of Contents

Features 🚀

Setup 🛠️

Build from Source 🏗️

1. Clone the repository

2. Install the dependencies

3. Create a .env file in the root directory and add the following environment variables

4. Ollama

5. Run the application

Deploy with Docker 🐳

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

3. Create a `.env` file in the root directory and add the following environment variables

Packages