RAG Python

A flexible Retrieval-Augmented Generation (RAG) chat system that supports multiple document retrieval sources (Vectorize, Pinecone, or none) combined with OpenAI's GPT-4o-mini for intelligent answer generation.

Features

Multiple RAG Sources: Choose between Vectorize, Pinecone, or no external knowledge base
Interactive CLI: Beautiful command-line interface with loading animations and colored output
Context-Aware Responses: Generates answers based on retrieved documents and conversation context
Extensible Architecture: Easy to add new RAG sources by implementing the base interface
Environment-Aware: Automatically checks for required environment variables based on selected source

Quick Start

Install dependencies:

uv sync

Set up environment variables (see Configuration section below)
Run the application:

uv run main.py

Configuration

Choosing a RAG Source

Edit main.py and modify the RAG_SOURCE variable:

from rag_source_base import RAGSourceType

# Choose one of:
RAG_SOURCE = RAGSourceType.NONE       # No document retrieval (OpenAI only)
RAG_SOURCE = RAGSourceType.VECTORIZE  # Use Vectorize.io for retrieval
RAG_SOURCE = RAGSourceType.PINECONE   # Use Pinecone (mock implementation)

Environment Variables

Create a .env file in the project root with the required variables:

Core (Always Required)

OPENAI_API_KEY=your-openai-api-key-here

For Vectorize Source (RAGSourceType.VECTORIZE)

OPENAI_API_KEY=your-openai-api-key-here
VECTORIZE_PIPELINE_ACCESS_TOKEN=your-vectorize-token
VECTORIZE_ORGANIZATION_ID=your-organization-id
VECTORIZE_PIPELINE_ID=your-pipeline-id

For Pinecone Source (RAGSourceType.PINECONE)

OPENAI_API_KEY=your-openai-api-key-here
PINECONE_API_KEY=your-pinecone-api-key
PINECONE_ENVIRONMENT=your-pinecone-environment
PINECONE_INDEX_NAME=your-pinecone-index-name

For No External Source (RAGSourceType.NONE)

OPENAI_API_KEY=your-openai-api-key-here

Getting API Keys

OpenAI API Key

Go to OpenAI API
Create a new API key
Add billing information to your OpenAI account

Vectorize API

Sign up at Vectorize.io
Create a pipeline for your documents
Get your organization ID, pipeline ID, and access token from the dashboard

Pinecone API

Sign up at Pinecone
Create a new index
Get your API key and environment from the dashboard
Note: Currently using mock implementation - replace with actual Pinecone client

Project Structure

rag-python/
├── main.py              # Main application entry point
├── rag_chat.py             # Core RAG chat logic
├── rag_source_base.py      # Base interface for RAG sources
├── vectorize_wrapper.py    # Vectorize.io implementation
├── pinecone_wrapper.py     # Pinecone mock implementation
├── cli_interface.py        # Command-line interface and styling
├── example_usage.py        # Programmatic usage examples
├── pyproject.toml          # Project dependencies
├── uv.lock                # Dependency lock file
└── .env                   # Environment variables (create this)

Usage

Interactive Mode

Run the main application:

python main.py

The application will:

Check your environment variables
Initialize the selected RAG source
Start an interactive chat session
Display retrieved documents (if using external source)
Generate and show AI responses

Programmatic Usage

from rag_chat import RAGChat
from cli_interface import CLIInterface
from vectorize_wrapper import VectorizeWrapper

# With Vectorize
cli = CLIInterface("My RAG App")
vectorize_source = VectorizeWrapper()
rag = RAGChat(cli, rag_source=vectorize_source)

answer = rag.chat("What is machine learning?")
print(answer)

# Without external source
rag_no_source = RAGChat(cli, rag_source=None)
answer = rag_no_source.chat("What is machine learning?")
print(answer)

How It Works

With External RAG Source (Vectorize/Pinecone)

User asks a question
System queries the RAG source for relevant documents
Retrieved documents are formatted as context
Context + question sent to OpenAI GPT-4o-mini
AI generates answer based on provided context
Answer displayed with source information

Without External Source

User asks a question
Question sent directly to OpenAI GPT-4o-mini
AI generates answer from general knowledge
Answer displayed

Adding New RAG Sources

To add a new RAG source (e.g., Chroma, Weaviate):

Create a wrapper class:

from rag_source_base import RAGSourceBase

class MyNewWrapper(RAGSourceBase):
    def retrieve_documents(self, question: str, num_results: int = 5):
        # Implement your retrieval logic
        return documents

    def get_required_env_vars(self):
        return ["MY_API_KEY", "MY_INDEX_NAME"]

Add to enum in rag_source_base.py:

class RAGSourceType(Enum):
    NONE = "none"
    VECTORIZE = "vectorize"
    PINECONE = "pinecone"
    MY_NEW_SOURCE = "my_new_source"  # Add this

Update get_rag_source() in main.py:

elif RAG_SOURCE == RAGSourceType.MY_NEW_SOURCE:
    wrapper = MyNewWrapper()
    return wrapper, wrapper.get_required_env_vars()

Troubleshooting

Common Issues

Missing environment variables:

Check your .env file exists and has the correct variables
Ensure no extra spaces around the = in your .env file

API key errors:

Verify your OpenAI API key is valid and has billing enabled
Check Vectorize/Pinecone credentials are correct

Import errors:

Run uv sync to install all dependencies
Ensure you're using Python 3.8+

No documents found:

Verify your Vectorize pipeline has documents indexed
Check your Pinecone index has embeddings

Getting Help

Check the application logs for specific error messages
Verify all environment variables are set correctly
Test with RAG_SOURCE = RAGSourceType.NONE to isolate issues
Ensure your API keys have the necessary permissions

Dependencies

python >= 3.8
litellm - Multi-provider LLM interface
vectorize-client - Vectorize.io Python client
python-dotenv - Environment variable management

See pyproject.toml for complete dependency list.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Python

Features

Quick Start

Configuration

Choosing a RAG Source

Environment Variables

Core (Always Required)

For Vectorize Source (RAGSourceType.VECTORIZE)

For Pinecone Source (RAGSourceType.PINECONE)

For No External Source (RAGSourceType.NONE)

Getting API Keys

OpenAI API Key

Vectorize API

Pinecone API

Project Structure

Usage

Interactive Mode

Programmatic Usage

How It Works

With External RAG Source (Vectorize/Pinecone)

Without External Source

Adding New RAG Sources

Troubleshooting

Common Issues

Getting Help

Dependencies

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
cli_interface.py		cli_interface.py
main.py		main.py
pinecone_wrapper.py		pinecone_wrapper.py
pyproject.toml		pyproject.toml
rag_chat.py		rag_chat.py
rag_source_base.py		rag_source_base.py
uv.lock		uv.lock
vectorize_wrapper.py		vectorize_wrapper.py

AgentEngineer-ing/rag-python

Folders and files

Latest commit

History

Repository files navigation

RAG Python

Features

Quick Start

Configuration

Choosing a RAG Source

Environment Variables

Core (Always Required)

For Vectorize Source (RAGSourceType.VECTORIZE)

For Pinecone Source (RAGSourceType.PINECONE)

For No External Source (RAGSourceType.NONE)

Getting API Keys

OpenAI API Key

Vectorize API

Pinecone API

Project Structure

Usage

Interactive Mode

Programmatic Usage

How It Works

With External RAG Source (Vectorize/Pinecone)

Without External Source

Adding New RAG Sources

Troubleshooting

Common Issues

Getting Help

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages