🔬 Quaero

High-Performance, Local-First RAG Document Assistant

Transform your local documents into an intelligent, queryable knowledge base—without the bloat.

🎯 Overview

Quaero is a streamlined, local-first Retrieval-Augmented Generation (RAG) engine. Built for developers, researchers, and engineers, it completely bypasses heavy frameworks like LangChain in favor of a custom, memory-flat ingestion pipeline and blazing-fast vector search via LanceDB.

Your data never leaves your machine.

✨ The Engineering Edge

Tiered Ingestion Router: Automatically routes files to the most efficient parser (e.g., C-bound PyMuPDF for PDFs, native python-docx for Word, and raw streaming for code/text) while bouncing binary executables at the door.
Memory-Flat Processing: Reads and hashes massive files (like 1,000-page textbooks) using lazy generators, keeping your RAM usage practically at zero during ingestion.
State Reconciliation: Native sync tracking detects when you modify or delete a physical file and automatically purges or updates the orphaned vectors via relational metadata.
Zero-Config Vector Search: Powered by LanceDB's PyArrow backend for native, sub-millisecond Cosine distance retrieval.

🚀 Quick Start

Prerequisites

Python 3.11+
Ollama installed and running locally.

1. Installation

Install directly via pip (or pipx for isolated environments):

pip install quaero

2. Initial Setup

Run the interactive wizard to configure your models and chunk sizes:

quaero setup

(We recommend embeddinggemma for embeddings and a fast, instruction-tuned model like gemma or llama3 for inference).

3. Build Your Knowledge Base

Point Quaero at a single file or an entire directory. It will recursively crawl and index supported formats.

quaero ingest /path/to/your/documents/

4. Start Querying

Launch the interactive terminal UI to chat with your documents:

quaero chat

Or execute a single-shot query:

quaero chat "What are the main persistence mechanisms described in the malware textbook?"

💻 CLI Command Reference

Quaero features a modern, Rich-powered CLI.

quaero status - View database health and vector counts.
quaero ingest - Ingest a file or directory.
quaero sync - Reconcile the vector database with your physical filesystem (purges orphans, updates modifications).
quaero config show - Display active thresholds, models, and chunk parameters.
quaero config set - Tune the engine on the fly (e.g., quaero config set score_threshold 0.6).
quaero db reset - Nuke the database and start fresh.

🏗️ Architecture

graph TD
    A[Local Filesystem] -->|quaero sync / ingest| B[Tiered Extraction Router]
    B --> C[Memory-Flat Text Splitter]
    C --> D[Ollama Embedding Engine]
    D --> E[(LanceDB Vector Store)]
    
    F[User Query] --> G[Cosine Similarity Search]
    G --> E
    E --> H[Context Assembly]
    H --> I[Ollama Inference]
    I --> J[Grounded Terminal Response]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
src/quaero		src/quaero
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔬 Quaero

🎯 Overview

✨ The Engineering Edge

🚀 Quick Start

Prerequisites

1. Installation

2. Initial Setup

3. Build Your Knowledge Base

4. Start Querying

💻 CLI Command Reference

🏗️ Architecture

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔬 Quaero

🎯 Overview

✨ The Engineering Edge

🚀 Quick Start

Prerequisites

1. Installation

2. Initial Setup

3. Build Your Knowledge Base

4. Start Querying

💻 CLI Command Reference

🏗️ Architecture

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages