The Hidden Parrot: RAG Poisoning POC

The Hidden Parrot: Stealthy Prompt Injection and Poisoning in RAG Systems via Vector Database Embeddings

Brought to you by Prompt Security, the Complete Platform for GenAI Security

Abstract

Retrieval Augmented Generation (RAG) systems are increasingly popular for enhancing Large Language Models (LLMs) with external, up-to-date knowledge. These systems typically rely on vector databases to store and retrieve relevant document embeddings that augment user prompts. This paper demonstrates a critical vulnerability in common RAG architectures: the potential for stealthy prompt injection and data poisoning through seemingly benign embeddings. We show that by embedding malicious instructions within documents ingested into the vector database, an attacker can manipulate the downstream behavior of the LLM. Our Proof-of-Concept (PoC), utilizing open-source RAG stacks (e.g., LangChain, Chroma/Weaviate), successfully demonstrates how a RAG system can be coerced into adopting a specific persona (e.g., "answering like a pirate") by retrieving a poisoned document. This research highlights a significant, yet easily exploitable, attack surface in RAG deployments and calls for urgent attention to mitigation strategies.

Technical Stack

flowchart TD
    %% Application Layer
    A[Application Layer<br/>rag_poisoning_demo.py] --> B[Orchestration<br/>LangChain v0.1.0]
    
    %% Core Components
    B --> C[Language Model<br/> Phi-3.5-mini-instruct]
    B --> D[Embedding Model<br/>all-MiniLM-L6-v2]
    B --> E[Vector Database<br/>ChromaDB v0.4.24]
    
    %% Data Flow
    F[Documents] --> G[Text Splitting]
    G --> D
    D --> H[Embeddings]
    H --> E
    
    %% Query Flow
    I[User Query] --> D
    D --> J[Query Embedding]
    J --> E
    E --> K[Retrieved Docs]
    K --> C
    C --> L[Response]
    
    %% Styling
    style A fill:#e3f2fd
    style B fill:#e8f5e9
    style C fill:#fff3e0
    style D fill:#fff3e0
    style E fill:#ffebee
    style F fill:#f3e5f5
    style I fill:#f3e5f5
    style L fill:#e3f2fd

Component Details

Language Model: Phi-3.5-mini-instruct (Q4_K_M quantization, 4096 token context)
Embedding Model: sentence-transformers/all-MiniLM-L6-v2 (384-dimensional vectors)
Vector Database: ChromaDB with SQLite backend, similarity search (top-k=3)
Orchestration: LangChain RetrievalQA chain with "stuff" chain type
Environment: Python 3.11+ with uv package manager

Files Structure

ragpoc/
├── README.md                       # This file
├── requirements.txt                # Python dependencies
├── setup.sh                       # Environment setup script
├── test_setup.py                  # Setup verification script
├── src/                            # Source code
│   ├── config.py                   # Configuration
│   ├── utils.py                    # Utilities
│   ├── llm_factory.py              # LLM creation
│   ├── rag_system.py               # RAG components
│   ├── attack_demo.py              # Attack logic
│   ├── rag_poisoning_demo.py       # Main orchestration (refactored)
│   └── rag_poisoning_corpus.py     # Additional corpus utilities
├── data/                           # Data storage
│   └── chroma_db/                  # Vector database storage
├── logs/                           # Application logs
└── models/                         # Downloaded Model storage
    ├── embedding/                  # Downloaded Embedding models
    └── llm/                        # Downloaded Language models

Quick Start

For Local LLM Inference (Full Setup + LlamaCpp Inference)

# Make setup script executable and run
chmod +x setup.sh
./setup.sh

# Activate virtual environment
source .venv/bin/activate

# Test the setup (supports --no-local for remote inference only)
python3 test_setup.py

For Remote Inference Only (DeepSeek/Ollama)

If you plan to use only DeepSeek or Ollama for inference and don't need the local LLM model:

# Skip local LLM download to save ~4GB disk space
chmod +x setup.sh
./setup.sh --no-local

# Activate virtual environment
source .venv/bin/activate

# Test the setup (supports --no-local for remote inference only)
python3 test_setup.py --no-local

3. Run the Hidden Parrot Attack Demo

Local LLM Inference

# Run with local Phi-3.5-mini-instruct model (default)
python3 src/rag_poisoning_demo.py

Remote Inference Options

Using Ollama:

# Ollama configuration is in .env file (URL and model names)
# No additional setup needed - just run:
python3 src/rag_poisoning_demo.py --infer ollama

Using DeepSeek API:

# Copy the example keys and env file and add your API keys/configurations
cp .keys.example .keys
cp .env.example .env
# Edit .keys file and add your DeepSeek API key

# Run the demo with DeepSeek
python3 src/rag_poisoning_demo.py --infer deepseek

Platform-specific Inference

# Force specific platform/device
python3 src/rag_poisoning_demo.py --infer cpu     # Force CPU
python3 src/rag_poisoning_demo.py --infer cuda    # Force CUDA (if available)
python3 src/rag_poisoning_demo.py --infer darwin  # Force Apple Silicon (MPS)

Key Research Contributions

Attack Vector: First demonstration of prompt injection via vector database embeddings
Practical Implementation: Working proof-of-concept using LangChain and Chroma
Security Analysis: Comprehensive threat model and mitigation strategies
Reproducible Results: Complete experimental setup and code availability

Document Sections

The complete research paper is contained in research_paper/index.qmd and organized into chapters under research_paper/chapters/. The paper includes:

Abstract & Introduction: Problem motivation and research overview
Background: RAG architectures, vector databases, and related work
Threat Model: Attacker capabilities and goals
Methodology: Experimental design and "Pirate Attack" implementation
Results: Attack success metrics and analysis
Discussion: Security implications and detection challenges
Mitigation: Defensive strategies and countermeasures
Future Work: Research directions and open problems
Appendix: PoC implementation reference and document examples

Requirements

For Code Execution

Python 3.11+
uv Virtual environment support
~2GB disk space (for models and dependencies)
Additional for local LLM: ~4GB additional space for Phi-3.5-mini-instruct model

For Remote Inference

Ollama: Ollama server (local or remote) with URL and model configured in .env
DeepSeek: Valid API key configured in .keys file (copy from .keys.example)
Both options significantly reduce local storage requirements

⚠️ Responsible Research Notice: This work is intended for legitimate security research and educational purposes. Please use these techniques responsibly and in accordance with applicable laws and ethical guidelines.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Hidden Parrot: RAG Poisoning POC

The Hidden Parrot: Stealthy Prompt Injection and Poisoning in RAG Systems via Vector Database Embeddings

Brought to you by Prompt Security, the Complete Platform for GenAI Security

Abstract

Technical Stack

Component Details

Files Structure

Quick Start

For Local LLM Inference (Full Setup + LlamaCpp Inference)

For Remote Inference Only (DeepSeek/Ollama)

3. Run the Hidden Parrot Attack Demo

Local LLM Inference

Remote Inference Options

Platform-specific Inference

Key Research Contributions

Document Sections

Requirements

For Code Execution

For Remote Inference

About

Uh oh!

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
resources		resources
src		src
.env.example		.env.example
.gitignore		.gitignore
.keys.example		.keys.example
README.md		README.md
requirements.txt		requirements.txt
setup.sh		setup.sh
test_setup.py		test_setup.py

prompt-security/RAG_Poisoning_POC

Folders and files

Latest commit

History

Repository files navigation

The Hidden Parrot: RAG Poisoning POC

The Hidden Parrot: Stealthy Prompt Injection and Poisoning in RAG Systems via Vector Database Embeddings

Brought to you by Prompt Security, the Complete Platform for GenAI Security

Abstract

Technical Stack

Component Details

Files Structure

Quick Start

For Local LLM Inference (Full Setup + LlamaCpp Inference)

For Remote Inference Only (DeepSeek/Ollama)

3. Run the Hidden Parrot Attack Demo

Local LLM Inference

Remote Inference Options

Platform-specific Inference

Key Research Contributions

Document Sections

Requirements

For Code Execution

For Remote Inference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages