📄 RAG PDF Assistant

A Retrieval-Augmented Generation (RAG) document assistant built with Flask, Pinecone, Gemini Embeddings, Groq API, and Google Gemini. Upload PDFs, DOCX, TXT, or MD files and intuitively chat with them using modern AI models.

🚀 Features

📂 Multi-format Uploads: Support for PDF, DOCX, TXT, and Markdown files.
💬 Interactive Chat: Query and chat with your documents using AI.
🔍 RAG-based Retrieval: Fast and accurate semantic search using Pinecone vector database.
🧠 Multiple LLM Support: Powered by Groq API (Llama 3) and Google Gemini.
🔐 Robust Authentication: Supports Google OAuth as well as standard Email/Password login.
🖼️ User Profiles: Custom profile picture uploads & Google profile pic sync.
👤 Data Isolation: Per-user namespaces in Pinecone for complete privacy.
🛡️ Admin Dashboard: Admin panel to monitor users and uploaded files.
🗑️ Data Management: Intuitive UI to delete files and clear vector stores.
📱 Responsive UI: Minimal and modern front-end for seamless user experience.
☁️ Lightweight & Cloud-Native: Zero local ML models — all embeddings and LLM calls are cloud-based API calls, requiring minimal server RAM.

🛠️ Tech Stack

Layer	Technology
Backend	Flask (Python)
Authentication	Flask-Login + Flask-Dance (Google OAuth)
Embeddings	Google Gemini (`gemini-embedding-001`)
Vector Store	Pinecone (Serverless)
LLMs	Groq API (Llama 3.3 70B) & Google Gemini
User Database	MongoDB Atlas
Frontend	HTML, CSS, Vanilla JS

📁 Project Structure

RAG_App/
├── app.py                  # Main Flask application & routes
├── models.py               # MongoDB user model & encrypted key storage
├── config.py               # Configuration & env variables
├── requirements.txt        # Python dependencies
├── render.yaml             # Render deployment blueprint
├── Dockerfile              # Docker containerization
├── .env.example            # Environment variable template
├── rag/
│   ├── chunker.py          # Document parsing & chunking logic
│   ├── embeddings.py       # Gemini embeddings + Pinecone upsert
│   ├── retriever.py        # Pinecone semantic search & retrieval
│   └── generator.py        # LLM integration for answer generation
├── templates/
│   ├── index.html          # File management & upload dashboard
│   ├── chat.html           # RAG chat interface
│   ├── login.html          # User login page
│   ├── register.html       # User registration page
│   ├── admin.html          # Admin dashboard
│   └── profile.html        # User profile & API key settings
├── static/                 # Static assets (CSS, JS, profile_pics)
├── uploads/                # User-uploaded files (isolated per user)
└── .github/workflows/
    ├── devsecops.yml       # Security scanning pipeline
    └── deploy.yml          # Docker build & GHCR push pipeline

⚙️ Setup & Installation

1. Clone the Repository

git clone https://github.com/param20h/PDF-Assistant-RAG.git
cd PDF-Assistant-RAG

2. Create and Activate Virtual Environment

python -m venv .venv

# Windows
.venv\Scripts\activate

# Linux/Mac
source .venv/bin/activate

3. Install Dependencies

pip install -r requirements.txt

4. Configure Environment Variables

Create a .env file using the template:

cp .env.example .env

Fill in the required server-side variables:

SECRET_KEY=<your-secret-key>
ENCRYPTION_KEY=<your-fernet-key>
MONGO_URI=<your-mongodb-atlas-uri>
GOOGLE_CLIENT_ID=<your-google-client-id>
GOOGLE_CLIENT_SECRET=<your-google-client-secret>

Generate keys:

# SECRET_KEY
python -c "import secrets; print(secrets.token_hex(32))"
# ENCRYPTION_KEY
python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"

5. Run the Application

python app.py

6. Access in Browser

Visit http://localhost:5000 in your web browser.

🔑 User Setup (Per-User API Keys)

After registering/logging in, each user must add their own API keys on the Profile page:

Service	Required?	Where to Get	Notes
Gemini API Key	✅ Required	aistudio.google.com	Free — used for embeddings & chat
Pinecone API Key	✅ Required	app.pinecone.io	Free tier available
Pinecone Index Name	✅ Required	Pinecone Dashboard	Create: dim `3072`, metric `cosine`
Groq API Key	Optional	console.groq.com	For Llama 3 chat generation

🌲 Pinecone Index Setup

Create a free account at pinecone.io
Create a Serverless index with:
- Dimension: 3072
- Metric: cosine
Copy your API key and index name into the Profile page

🌐 Google OAuth Setup

Go to Google Cloud Console → console.cloud.google.com
Create a new project and navigate to APIs & Services → Credentials
Click Create Credentials → OAuth Client ID
Set the Authorized redirect URI to: http://localhost:5000/login/google/authorized
Copy your Client ID and Client Secret into the .env file

🔄 How It Works (The RAG Pipeline)

Upload: User uploads a document (PDF, DOCX, TXT, or MD).
Chunking: The document is parsed and split into manageable textual chunks.
Embedding: Chunks are converted to 3072-dimensional vectors using gemini-embedding-001.
Vector Storage: Vectors are stored in the user's Pinecone namespace.
Querying: The user submits a question.
Retrieval: Pinecone retrieves the most semantically relevant chunks.
Generation: The retrieved context is passed to the selected LLM (Groq or Gemini) to generate an accurate, grounded answer.

🚀 Deployment

Deploy to Render (Recommended — Free)

Push your code to GitHub
Go to Render → New → Web Service
Connect your GitHub repository
Render auto-detects render.yaml and configures everything
Add environment variables: SECRET_KEY, ENCRYPTION_KEY, MONGO_URI, GOOGLE_CLIENT_ID, GOOGLE_CLIENT_SECRET
Update Google OAuth redirect URI to: https://your-app.onrender.com/login/google/authorized
Deploy!

Deploy with Docker

docker build -t rag-app .
docker run -p 5000:5000 --env-file .env rag-app

🔐 DevSecOps Pipeline

Tool	Purpose
`GitHub Actions`	CI/CD Pipeline
`Bandit`	SAST — Python security vulnerability scanning
`Gitleaks`	Hardcoded secret and credential detection
`Trivy`	Container and dependency vulnerability checking
`Snyk`	Advanced dependency vulnerability scanning
`OWASP ZAP`	DAST — Dynamic web security scanning
`SonarCloud`	Overall code quality and security analysis
`GHCR`	Docker image hosting via GitHub Container Registry

👨‍💻 Author

Name: Paramjit Singh (param20h)

📄 License

This project is licensed under the MIT License. Check the LICENSE file for more details.

⭐ Show Some Support!

If you found this project helpful or inspiring, please give it a ⭐!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 RAG PDF Assistant

🚀 Features

🛠️ Tech Stack

📁 Project Structure

⚙️ Setup & Installation

1. Clone the Repository

2. Create and Activate Virtual Environment

3. Install Dependencies

4. Configure Environment Variables

5. Run the Application

6. Access in Browser

🔑 User Setup (Per-User API Keys)

🌲 Pinecone Index Setup

🌐 Google OAuth Setup

🔄 How It Works (The RAG Pipeline)

🚀 Deployment

Deploy to Render (Recommended — Free)

Deploy with Docker

🔐 DevSecOps Pipeline

👨‍💻 Author

📄 License

⭐ Show Some Support!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
instance		instance
rag		rag
static		static
templates		templates
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
config.py		config.py
docker-compose.yml		docker-compose.yml
license		license
make_admin.py		make_admin.py
models.py		models.py
render.yaml		render.yaml
requirements.txt		requirements.txt
users.db		users.db

Folders and files

Latest commit

History

Repository files navigation

📄 RAG PDF Assistant

🚀 Features

🛠️ Tech Stack

📁 Project Structure

⚙️ Setup & Installation

1. Clone the Repository

2. Create and Activate Virtual Environment

3. Install Dependencies

4. Configure Environment Variables

5. Run the Application

6. Access in Browser

🔑 User Setup (Per-User API Keys)

🌲 Pinecone Index Setup

🌐 Google OAuth Setup

🔄 How It Works (The RAG Pipeline)

🚀 Deployment

Deploy to Render (Recommended — Free)

Deploy with Docker

🔐 DevSecOps Pipeline

👨‍💻 Author

📄 License

⭐ Show Some Support!

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages