tinyRAG 🍜

Local-first RAG assistant with:

Next.js frontend (frontend/)
FastAPI backend (api.py)
ChromaDB vector store (data/chroma)
SQLite app database (data/app.db)

Demo

More screenshots

What's New in v0.2.0

Added true hybrid retrieval:
- dense vector retrieval (Chroma)
- sparse lexical retrieval (BM25)
- rank fusion with RRF
Added optional HyDE retrieval branch for ambiguous queries.
Added page metadata support for chunk citations.
Expanded retrieval testing with RFC2812 benchmark scenarios.

Retrieval Modes

Vector only
Vector + BM25 (default recommended)
Vector + BM25 + HyDE (optional, best for some vague queries)

Features

Workspace and chat management
PDF upload and background indexing
Hybrid search + HyDE
JSON chat exporting
Streaming chat responses (SSE)
All core data stays on your machine: SQLite + ChromaDB
Markdown/LaTeX rendering in chat
Use it on LAN by pointing clients to your backend host

Architecture

Frontend: Next.js + React + shadcn/ui
Backend: FastAPI + aiohttp + sentence-transformers
Retrieval: Chroma vector search (workspace-scoped)
Persistence: SQLite for app entities and metadata

Requirements

Python 3.11+
uv
Node.js 20+
pnpm 9+
Ollama running with your selected model

Quick Start (Windows)

Setup requirements

Fast API uv sync
Next.JS pnpm install (inside frontend directory)

Start backend + frontend:

.\Start.ps1

Open app:

Frontend: http://localhost:3000
Backend: http://localhost:8000

Configuration

Backend

File: CONFIG.toml
Main options:
- server upload path and model name
- Chroma persistence path/collection
- embedding model

Frontend API base

Set in frontend/.env:

NEXT_PUBLIC_API_BASE_URL=http://localhost:8000

For LAN usage, set it to your backend machine IP:

NEXT_PUBLIC_API_BASE_URL=http://192.168.x.x:8000

Important: restart frontend after changing .env.

Build Instructions

Frontend production build

cd frontend
pnpm build
pnpm start -- --hostname 0.0.0.0 --port 3000

Backend production run

uv run uvicorn api:app --host 0.0.0.0 --port 8000

Project Structure

src/
  api.py
  chroma.py
  database.py
  CONFIG.toml
  shared.py
  hybrid_search.py
Start.ps1
frontend/
  app/
  components/
  package.json
frontend_lite
testing/
  tinyAgent

Testing features

frontend_lite - small frontend version without framework dependecies
tinyAgent - my approach to make tiny codex like agent (very early stage)

Roadmap

Hybrid retrieval (vector + keyword)
Config/settings UI
Improved evaluation and observability
Optional packaging for non-dev users

License

This project is licensed under the GNU General Public License v3.0 (GPL-3.0). See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.vscode		.vscode
docs		docs
frontend		frontend
frontend_lite		frontend_lite
src		src
testing/tinyAgent		testing/tinyAgent
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
Start.ps1		Start.ps1
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tinyRAG 🍜

Demo

What's New in v0.2.0

Retrieval Modes

Features

Architecture

Requirements

Quick Start (Windows)

Configuration

Backend

Frontend API base

Build Instructions

Frontend production build

Backend production run

Project Structure

Testing features

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tinyRAG 🍜

Demo

What's New in v0.2.0

Retrieval Modes

Features

Architecture

Requirements

Quick Start (Windows)

Configuration

Backend

Frontend API base

Build Instructions

Frontend production build

Backend production run

Project Structure

Testing features

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages