mini_rag — small local RAG demo

A compact Retrieval-Augmented-Generation (RAG) app for local Q&A over PDFs / text files.
User uploads PDFs / text files → text is extracted & split → embeddings are created → a vector index is built and queried via CLI or FastAPI.

Features

Document loading
- PDF parsing via pypdf.
- Safe text chunking with overlap (memory-safe implementation).
Embeddings
- Local embeddings with sentence-transformers (all-MiniLM-L6-v2 by default).
- OpenAI embeddings supported if API key is provided.
Vector search
- FAISS-based index (with NumPy fallback on Windows).
Interfaces
- CLI entrypoints (query_cli.py, python -m mini_rag.app).
- FastAPI server for indexing and querying.
Testing
- Pytest-based test suite (src/mini_rag/tests).
- Parallel execution with pytest-xdist.
- Unit tests for loaders, chunking, indexing, and the pipeline.
CI/CD
- GitHub Actions workflow (.github/workflows/ci.yml) runs tests on push/PR.
- Split requirements:
  - requirements.txt — runtime dependencies.
  - requirements-dev.txt — runtime + test/dev dependencies.

Project Structure

mini\_rag/
├── src/
│   └── mini\_rag/
│       ├── app.py               # CLI + FastAPI entrypoint
│       ├── docs\_loader.py       # PDF/text loading + chunking
│       ├── embed.py / embedders # Embedding utilities
│       ├── indexer.py           # Embedding + vector index build
│       ├── ingest.py            # Document ingestion
│       ├── query\_cli.py         # CLI for queries
│       ├── retrieval.py         # Retrieval helpers
│       ├── utils.py             # Utility functions
│       └── tests/               # pytest tests
│           ├── test\_docs\_loader.py
│           ├── test\_indexer.py
│           ├── test\_pipeline.py
│           └── conftest.py
├── notebooks/
│   └── demo.md
├── requirements.txt             # runtime dependencies
├── requirements-dev.txt         # runtime + dev/test deps
├── run\_sanity.py                # basic sanity check script
└── .github/workflows/ci.yml     # GitHub Actions CI config

Quick start (local)

Install runtime dependencies

python -m pip install -r requirements.txt

(Optional) Dev setup with tests

python -m pip install -r requirements-dev.txt

Run CLI
```
python -m mini_rag.app
```
Run FastAPI server
```
uvicorn src.mini_rag.app:app --reload
```

Run tests

pytest -n auto --maxfail=1 --showlocals --durations=10

Notes

On Windows, faiss-cpu may be tricky to install; if missing, a NumPy-based fallback indexer is used.
Large PDFs are streamed into overlapping chunks using the safe chunk_text implementation.
PyPDF2 is deprecated → this project uses pypdf.

Roadmap

Add coverage reports to CI.
Expand FastAPI endpoints with file upload support.
Experiment with hybrid indexes (BM25 + embeddings).
Optional Dockerfile for reproducible setup.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github		.github
notebooks		notebooks
src/mini_rag		src/mini_rag
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_sanity.py		run_sanity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mini_rag — small local RAG demo

Features

Project Structure

Quick start (local)

Notes

Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Raydir27/mini_rag

Folders and files

Latest commit

History

Repository files navigation

mini_rag — small local RAG demo

Features

Project Structure

Quick start (local)

Notes

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages