RAG PoC

Overview and Demo

This repository is a small Retrieval-Augmented Generation (RAG) proof-of-concept.

This demonstrates how a RAG app can answer questions using extra information that you provide, that is not commonly known by public apps like ChatGPT or Gemini.

For example, suppose you have a private document containing lore about a fictional character called Zorvath. That document might look like:

Zorvath the Perpetually Confused is a semi-corporeal librarian...
He was sentenced by the Interdimensional Council
to wander realities cataloging paradoxes that may or may not exist...
Zorvath cannot remember his own name without consulting a
laminated card he keeps losing...

Full lore text: Source

This you could use this app to ask any questions about it.

Tap "Browse files", select tests/data/lore-data.txt, then tap "Ingest Document".

Enter a question like "Why is Zorvath always writing things down?" and tap "Ask".

Review the answer and sources.

Running this application locally

Prerequisites

Linux (or WSL, e.g. \\wsl$\Ubuntu\home\$USER\...)
Docker
git
An OpenAI API key (set via OPENAI_API_KEY in your .env)

One-time setup

Copy the example environment and fill it in:

cp .env.example .env

Edit .env and set OPENAI_API_KEY and any DB credentials you want

Build and run locally

Set env vars:

source ./set-env.sh

Build and start the stack:

docker compose up -d --build

First time only: Initialize the database (create pgvector extension and documents table)

Run these commands after the db container is running.

docker compose exec db psql -U $POSTGRES_USER -d $POSTGRES_DB -c "CREATE EXTENSION IF NOT EXISTS vector;"
docker compose exec db psql -U $POSTGRES_USER -d $POSTGRES_DB -c 'CREATE TABLE IF NOT EXISTS documents (id TEXT PRIMARY KEY, content TEXT NOT NULL, embedding VECTOR(1536), metadata JSONB);'

Smoke test the backend and frontend

Backend: check health: curl http://localhost:8000/health

Frontend: open in browser: http://localhost:8501

Running tests

Option A — run tests inside a container (matches CI)

This mounts the repository into a temporary backend container and runs pytest with the correct PYTHONPATH so the backend package is importable.

# from repo root (Bash)
docker compose run --rm -v "$PWD:/workspace" -w /workspace backend bash -lc "PYTHONPATH=/workspace pytest -q tests/backend -vv"

Option B — run tests locally in a venv

python3 -m venv .venv
source .venv/bin/activate
pip install -r backend/requirements.txt
pytest -q tests/backend -vv

Using the app

You can now run the App Demo (at the top of this document) yourself!

Appendix: Developer notes on how it works

Tech stack

It includes:

FastAPI backend
Streamlit frontend
OpenAI for embeddings and completion (via environment variable)
Postgres + pgvector for vector storage
Docker Compose for local development and easy EC2 deployment

Backend API implementation

When the /ingest API is called, it does this:

calls emb = openai.embed_texts([text]) to translate to embedding format.
calls vector_store.upsert(doc_id, text, emb[0], metadata) to store in vector db.

When /query API is called, it does this:

calls q_emb = openai.embed_texts([question]) to translate question to embedding format
calls rows = vector_store.query(q_emb, top_k) to query vector db for relevant data
creates a prompt like:

Use the following context to answer the question:
Context: (the information gathered from step 2, in embedding format)
Question: (the question from step 1, in enbedding format)
Answer:

calls openai.generate_answer(prompt)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
backend		backend
docs/images		docs/images
frontend		frontend
scripts		scripts
tests		tests
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
EXECUTION_PLAN.md		EXECUTION_PLAN.md
README.md		README.md
docker-compose.yml		docker-compose.yml
set-env.sh		set-env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG PoC

Overview and Demo

Running this application locally

Prerequisites

One-time setup

Build and run locally

Running tests

Using the app

Appendix: Developer notes on how it works

Tech stack

Backend API implementation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG PoC

Overview and Demo

Running this application locally

Prerequisites

One-time setup

Build and run locally

Running tests

Using the app

Appendix: Developer notes on how it works

Tech stack

Backend API implementation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages