🧠 RAG Knowledge Base

A production-ready Retrieval-Augmented Generation (RAG) system that allows users to upload PDFs and chat with them using AI. Built with 100% free and open-source technologies.

✨ Features

📄 PDF Upload: Upload and process multiple PDF documents
💬 AI Chat Interface: Ask questions and get intelligent answers
🔍 Vector Search: Semantic search using ChromaDB
📚 Source Citations: See which documents answers come from
⚡ Streaming Responses: Real-time AI responses with typewriter effect
🎨 Modern UI: Beautiful glassmorphism design with dark theme
🆓 100% Free: Uses free tiers and open-source models

🛠️ Tech Stack

Frontend: Streamlit
LLM: Google Gemini 1.5 Flash (Free Tier)
Embeddings: Sentence Transformers (all-MiniLM-L6-v2)
Vector DB: ChromaDB (Local)
Framework: LangChain
PDF Processing: PyPDF2

🚀 Quick Start

Prerequisites

Python 3.9 or higher
Google API Key (free from Google AI Studio)

Installation

Clone or download the project:
```
cd rag-knowledge-base
```

Create a virtual environment:

python -m venv venv

# On Windows
venv\Scripts\activate

# On macOS/Linux
source venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables:
- Copy .env.example to .env
- Add your Google API key:
```
GOOGLE_API_KEY=your_api_key_here
```
Run the application:
```
streamlit run app.py
```
Open your browser to http://localhost:8501

📖 Usage

Upload PDFs: Click the upload button and select your PDF files
Process Documents: Click "Process Documents" to analyze your PDFs
Start Chatting: Ask questions about your documents in the chat interface
View Sources: Expand source citations to see where answers came from
Manage Documents: Use the sidebar to view and delete documents

🏗️ Project Structure

rag-knowledge-base/
├── app.py                          # Main Streamlit application
├── requirements.txt                # Python dependencies
├── .env.example                    # Environment variables template
├── .streamlit/
│   └── config.toml                 # Streamlit configuration
├── src/
│   ├── components/
│   │   ├── chat_interface.py       # Chat UI component
│   │   ├── pdf_uploader.py         # PDF upload component
│   │   └── sidebar.py              # Sidebar component
│   ├── core/
│   │   ├── embeddings.py           # Embedding generation
│   │   ├── pdf_processor.py        # PDF processing
│   │   ├── rag_pipeline.py         # RAG logic
│   │   └── vector_store.py         # ChromaDB integration
│   └── utils/
│       ├── config.py               # Configuration
│       └── session_state.py        # State management
└── assets/
    └── styles.py                   # Custom CSS styles

🎨 UI Features

Glassmorphism Effects: Modern semi-transparent UI elements
Smooth Animations: Fade-ins, hover effects, and transitions
Gradient Accents: Beautiful color gradients throughout
Dark Theme: Professional dark mode with proper contrast
Responsive Design: Works on desktop, tablet, and mobile

🚀 Deployment

Deploy to Streamlit Cloud (Free)

Push to GitHub:

git init
git add .
git commit -m "Initial commit"
git remote add origin your-repo-url
git push -u origin main

Deploy on Streamlit Cloud:
- Go to share.streamlit.io
- Connect your GitHub repository
- Select app.py as the main file
- Add GOOGLE_API_KEY in Secrets Management
- Click Deploy!
Share your URL with recruiters!

🔑 Getting API Keys

Google Gemini API (Free)

Visit Google AI Studio
Click "Create API Key"
Copy and paste into your .env file

Free Tier Limits:

15 requests per minute
1,500 requests per day
Perfect for demos and personal projects!

💡 Tips for YC Recruiters

This project demonstrates:

✅ Modern AI Engineering: RAG, vector databases, LLM integration
✅ Production-Ready Code: Clean architecture, error handling, caching
✅ UX Excellence: Streaming responses, source citations, beautiful UI
✅ Full-Stack Skills: Frontend (Streamlit), backend (Python), AI/ML
✅ Deployment: Cloud-ready with free hosting

🤝 Contributing

Feel free to fork, improve, and submit pull requests!

📝 License

MIT License - feel free to use this project for your portfolio!

🌟 Showcase

Add screenshots or a demo video here after deployment!

Built with ❤️ for YC Recruiters

Powered by Google Gemini, ChromaDB, and LangChain

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.streamlit		.streamlit
assets		assets
src		src
.env.example		.env.example
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
start.bat		start.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 RAG Knowledge Base

✨ Features

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Installation

📖 Usage

🏗️ Project Structure

🎨 UI Features

🚀 Deployment

Deploy to Streamlit Cloud (Free)

🔑 Getting API Keys

Google Gemini API (Free)

💡 Tips for YC Recruiters

🤝 Contributing

📝 License

🌟 Showcase

About

Uh oh!

Releases

Packages

Languages

Poi5eN/rag-knowledge-base

Folders and files

Latest commit

History

Repository files navigation

🧠 RAG Knowledge Base

✨ Features

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Installation

📖 Usage

🏗️ Project Structure

🎨 UI Features

🚀 Deployment

Deploy to Streamlit Cloud (Free)

🔑 Getting API Keys

Google Gemini API (Free)

💡 Tips for YC Recruiters

🤝 Contributing

📝 License

🌟 Showcase

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages