RAG Question Answering System

A powerful Retrieval Augmented Generation (RAG) application that combines real-time web search, Wikipedia knowledge, and local language models to provide comprehensive answers to user questions.

🚀 Features

Multi-Source Knowledge Retrieval: Integrates Google Search and Wikipedia for comprehensive information gathering
Local LLM Processing: Uses Ollama with Llama 3.1 for intelligent response generation
Vector Database Storage: Implements Chroma DB with NVIDIA embeddings for semantic search
Real-time Streaming: Provides live response generation with custom UI feedback
Persistent Knowledge Base: Maintains conversation history and learned information
User-Friendly Interface: Clean Streamlit web application with expandable output containers

🛠️ Prerequisites

Before running this application, ensure you have:

Python 3.8 or higher
Anaconda distribution installed
Ollama installed and running
Google Serper API key
NVIDIA API key
At least 8GB RAM (recommended for local LLM)

📦 Installation

1. Clone the Repository

git clone https://github.com/yourusername/rag-qa-system.git
cd rag-qa-system

2. Create a conda Environment

conda create -n env_name python=3.8
conda activate env_name

3. Install Dependencies

pip install -r requirements.txt

4. Install and Setup Ollama

# Install Ollama (visit https://ollama.ai/ for OS-specific instructions)
# After installation on your machine, run
ollama --version
# Pull the required model
ollama pull llama3.1

5. Setup API Keys

Edit the file named id.py in the project root directory:

# id.py
SERPER_API_KEY = "your_serper_api_key_here"
nvapi_key = "your_nvidia_api_key_here"

🔑 API Key Setup

Google Serper API

Visit Serper.dev
Sign up for a free account
Get your API key from the dashboard
Add it to your id.py file

NVIDIA API

Visit NVIDIA AI Foundation Models
Sign up and get your API key
Add it to your id.py file

🚀 Usage

1. Start the Application

streamlit run RAG.py

2. Access the Interface

Open your browser and navigate to http://localhost:8501

3. Ask Questions

Enter your question in the text input field
Watch as the system retrieves information and generates answers in real-time
Expand the output container to see the complete response

📁 Project Structure

rag-qa-system/
├── app.py                 # Main application file
├── id.py                  # API keys configuration
├── requirements.txt       # Python dependencies
├── README.md             # Project documentation
├── chroma_db/            # Vector database storage (created automatically)
└── .gitignore           # Git ignore file

🔧 Configuration

Customizing the LLM

To use a different Ollama model, modify the llm initialization in RAG.py:

llm = ChatOllama(model="your-preferred-model", temperature=0, streaming=True)

Adjusting Text Chunking

Modify the text splitter parameters for different chunk sizes:

splitter = CharacterTextSplitter(chunk_size=1500, chunk_overlap=300)

Vector Database Location

Change the persistence directory for the Chroma database:

persist_directory = "./your-custom-db-path"

🐛 Troubleshooting

Common Issues

1. Ollama Connection Error

# Ensure Ollama is running
ollama serve
# Pull the required model
ollama pull llama3.1

2. API Key Errors

Verify your API keys are correctly set in id.py
Check API key quotas and permissions

3. ChromaDB Persistence Issues

Delete the chroma_db folder and restart the application
Ensure proper write permissions in the project directory

4. Memory Issues

Reduce chunk size in the text splitter
Close other memory-intensive applications

🔒 Security Notes

Never commit your id.py file to version control
Add id.py to your .gitignore file
Use environment variables in production deployments

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

🙏 Acknowledgments

LangChain for the RAG framework
Ollama for local LLM hosting
Streamlit for the web interface
ChromaDB for vector storage
NVIDIA for embedding models

Note: Make sure to keep your API keys secure and never share them publicly!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Question Answering System

🚀 Features

🛠️ Prerequisites

📦 Installation

1. Clone the Repository

2. Create a conda Environment

3. Install Dependencies

4. Install and Setup Ollama

5. Setup API Keys

🔑 API Key Setup

Google Serper API

NVIDIA API

🚀 Usage

1. Start the Application

2. Access the Interface

3. Ask Questions

📁 Project Structure

🔧 Configuration

Customizing the LLM

Adjusting Text Chunking

Vector Database Location

🐛 Troubleshooting

Common Issues

🔒 Security Notes

🤝 Contributing

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
RAG.py		RAG.py
README.md		README.md
id.py		id.py
nrpe-3.2.1.tar.gz		nrpe-3.2.1.tar.gz
requirements.txt		requirements.txt

karthiksankhar-dev/rag-qa-system

Folders and files

Latest commit

History

Repository files navigation

RAG Question Answering System

🚀 Features

🛠️ Prerequisites

📦 Installation

1. Clone the Repository

2. Create a conda Environment

3. Install Dependencies

4. Install and Setup Ollama

5. Setup API Keys

🔑 API Key Setup

Google Serper API

NVIDIA API

🚀 Usage

1. Start the Application

2. Access the Interface

3. Ask Questions

📁 Project Structure

🔧 Configuration

Customizing the LLM

Adjusting Text Chunking

Vector Database Location

🐛 Troubleshooting

Common Issues

🔒 Security Notes

🤝 Contributing

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages