📚 RAG System - Retrieval-Augmented Generation with OpenAI & Pinecone

This project implements a Retrieval-Augmented Generation (RAG) system using OpenAI's GPT-4, Pinecone for vector storage, and LangChain for seamless document retrieval and query processing.

🚀 Features

✅ Document Processing: Supports TXT, PDF, and CSV files.
✅ Embeddings with OpenAI: Converts text into vector embeddings.
✅ Efficient Search: Uses Pinecone to store and retrieve relevant information.
✅ Modular Architecture: Well-structured codebase for easy scalability and maintenance.
✅ Logging & Error Handling: Helps identify issues efficiently.

🏗️ Project Structure

🔧 Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/patrick-cuppi/rag-system
cd rag_system

2️⃣ Install Dependencies

pip install -r requirements.txt

3️⃣ Configure Environment Variables

Create a .env file in the root directory and add the following:

OPENAI_API_KEY=your_openai_api_key
PINECONE_API_KEY=your_pinecone_api_key
PINECONE_ENV=your_pinecone_environment
PINECONE_INDEX=your_pinecone_index_name

📥 Adding Documents

Place your TXT, PDF, or CSV files inside the data/documents/folder.

🏃 Running the RAG System

python main.py

You'll be prompted to enter a question based on the stored documents.

💡 How It Works

Loads Documents → Extracts text from supported formats.

Embeds the Content → Converts documents into vector representations using OpenAI.

Stores in Pinecone → Enables fast and efficient retrieval.

Retrieves & Generates Answers → Finds relevant information and uses GPT-4 to generate a response.

✨ Example Usage

Enter your question (or type 'exit' to quit): What is the main topic of the document?

🔹 Response: "The document discusses advanced machine learning techniques for image processing."

🛠️ Troubleshooting

If you encounter any issues:

Ensure API keys are correct in .env.

Verify Pinecone index exists.

Run pip install -r requirements.txt to reinstall dependencies.

Check logs in rag_pipeline.py for detailed errors.

📜 License

This project is licensed under the MIT License.

🤝 Contributions

Pull requests and improvements are welcome! Feel free to submit issues or enhancements.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
config		config
modules		modules
public		public
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📚 RAG System - Retrieval-Augmented Generation with OpenAI & Pinecone

🚀 Features

🏗️ Project Structure

🔧 Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Configure Environment Variables

📥 Adding Documents

🏃 Running the RAG System

💡 How It Works

✨ Example Usage

🛠️ Troubleshooting

📜 License

🤝 Contributions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

patrick-cuppi/rag-system

Folders and files

Latest commit

History

Repository files navigation

📚 RAG System - Retrieval-Augmented Generation with OpenAI & Pinecone

🚀 Features

🏗️ Project Structure

🔧 Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Configure Environment Variables

📥 Adding Documents

🏃 Running the RAG System

💡 How It Works

✨ Example Usage

🛠️ Troubleshooting

📜 License

🤝 Contributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages