EVRAG: Retrieval-Augmented Generation (RAG) with Automated Evaluation using DeepEval

End-to-end RAG pipeline with document ingestion, retrieval, LLM answer generation, and quantitative evaluation (Context Recall, Answer Relevancy, Faithfulness, Precision).

Overview

This project implements a Retrieval-Augmented Generation (RAG) system that answers domain-specific questions about EV charging from relevant PDF documents (e.g., reports). It includes an evaluation pipeline powered by DeepEval to measure:

Answer Relevancy — how semantically correct the model’s answer is,
Faithfulness — whether the answer is grounded in the retrieved context (no hallucinations),
Contextual Precision & Recall — how good the retriever is at finding relevant chunks.

The project is modular, reproducible, and structured for portfolio presentation.

Architecture

Project Structure

Features

Component	Description
PDFLoader	Extracts and preprocesses text from PDFs.
TextChunker	Splits text into overlapping semantic chunks for retrieval.
VectorStore (FAISS)	Stores embeddings for fast semantic similarity search.
LLMEngine	Generates final answers using a Google Gemini model.
DeepEval Integration	Provides automated evaluation metrics: Relevancy, Faithfulness, Precision, Recall.
Validation Reports	Produces `evaluation_results.xlsx` with all metric scores.
CLI Interface	Allows users to run inference or validation using command-line arguments.

Installation

1. Clone the repo

git clone https://github.com/hrz1365/EVRAG.git
cd EVRAG

2. Create and activate a virtual environment

python -m venv venv
source venv/bin/activate      # on macOS/Linux
venv\Scripts\activate         # on Windows

3. Install dependencies

pip install -r requirements.txt

4. Set your environment variable for Gemini

export GEMINI_API_KEY="your_api_key_here"     # macOS/Linux
setx GEMINI_API_KEY "your_api_key_here"       # Windows

Usage

1. Ask questions interactively

python main.py --mode query --query "What type of facility has power demand equal to around 5 MW?"

Answer: an outdoor professional sports stadium

2. Run full evaluation

python main.py --mode 'validate' --pdf './data/' --query 'What type of facility has power demand euqal to around 40 MW?' --validation './data/validation_set.xlsx' --report './data/validation_results.xlsx'

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
data		data
docs		docs
src		src
vector_store.faiss		vector_store.faiss
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EVRAG: Retrieval-Augmented Generation (RAG) with Automated Evaluation using DeepEval

Overview

Architecture

Project Structure

Features

Installation

1. Clone the repo

2. Create and activate a virtual environment

3. Install dependencies

4. Set your environment variable for Gemini

Usage

1. Ask questions interactively

2. Run full evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EVRAG: Retrieval-Augmented Generation (RAG) with Automated Evaluation using DeepEval

Overview

Architecture

Project Structure

Features

Installation

1. Clone the repo

2. Create and activate a virtual environment

3. Install dependencies

4. Set your environment variable for Gemini

Usage

1. Ask questions interactively

2. Run full evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages