RAG-lite TS

Simple by default, powerful when needed

A local-first TypeScript retrieval engine for semantic search over static documents. Built to be simple to use, lightweight, and hackable with zero external run-time dependencies.

Quick Start

Installation

npm install -g rag-lite-ts

Basic Usage

# Ingest documents
raglite ingest ./docs/

# Search your documents
raglite search "machine learning concepts"

# Get more results with reranking
raglite search "API documentation" --top-k 10 --rerank

Using Different Models

# Use higher quality model (auto-rebuilds if needed)
raglite ingest ./docs/ --model Xenova/all-mpnet-base-v2 --rebuild-if-needed

# Search automatically uses the correct model
raglite search "complex query"

Programmatic Usage

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Initialize and ingest documents
const ingestion = new IngestionPipeline('./db.sqlite', './vector-index.bin');
await ingestion.ingestDirectory('./docs/');

// Search your documents
const search = new SearchEngine('./vector-index.bin', './db.sqlite');
const results = await search.search('machine learning', { top_k: 10 });

Configuration Options

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Custom model configuration
const search = new SearchEngine('./vector-index.bin', './db.sqlite', {
  embeddingModel: 'Xenova/all-mpnet-base-v2',
  enableReranking: true,
  topK: 15
});

// Ingestion with custom settings
const ingestion = new IngestionPipeline('./db.sqlite', './vector-index.bin', {
  embeddingModel: 'Xenova/all-mpnet-base-v2',
  chunkSize: 400,
  chunkOverlap: 80
});

→ Complete CLI Reference | API Documentation

Features

📝 Simple: Get started with just new SearchEngine() - no complex setup required
🏠 Local-first: All processing happens offline on your machine
🚀 Fast: Sub-100ms queries for typical document collections
🔍 Semantic: Uses embeddings for meaning-based search, not just keywords
🛠️ Flexible: Simple constructors for basic use, advanced options when you need them
📦 Complete: CLI, programmatic API, and MCP server in one package
🎯 TypeScript: Full type safety with modern ESM architecture
🧠 Smart: Automatic model management and compatibility checking

How It Works

RAG-lite TS follows a simple pipeline:

Document Ingestion: Reads .md, .txt, .mdx, .pdf, and .docx files
Preprocessing: Cleans content (JSX components, Mermaid diagrams, code blocks)
Semantic Chunking: Splits documents at natural boundaries with token limits
Embedding Generation: Uses transformers.js models for semantic vectors
Vector Storage: Fast similarity search with hnswlib-wasm
Metadata Storage: SQLite for document info and model compatibility
Search: Embeds queries and finds similar chunks using cosine similarity
Reranking (optional): Cross-encoder models for improved relevance

Architecture

Documents → Preprocessor → Chunker → Embedder → Vector Index
                                        ↓
Query → Embedder → Vector Search → SQLite Lookup → Results

→ Document Preprocessing Guide | Model Management Details

Supported Models

RAG-lite TS supports multiple embedding models with automatic optimization:

Model	Dimensions	Speed	Use Case
`sentence-transformers/all-MiniLM-L6-v2`	384	Fast	General purpose (default)
`Xenova/all-mpnet-base-v2`	768	Slower	Higher quality, complex queries

Model Features:

Automatic downloads: Models cached locally on first use
Smart compatibility: Detects model changes and prompts rebuilds
Offline support: Pre-download for offline environments
Reranking: Optional cross-encoder models for better relevance

→ Complete Model Guide | Performance Benchmarks

Documentation

📚 Getting Started

CLI Reference - Installation and basic usage
API Reference - Simple constructors and programmatic usage

🔧 Customization & Advanced Usage

Configuration Guide - Custom settings and options
Model Selection Guide - Choose the right model for your needs
Path Storage Strategies - Document path management
Document Preprocessing - Content processing options

🛠️ Support

Troubleshooting Guide - Common issues and solutions

📊 Technical References

Embedding Models Comparison - Detailed benchmarks
Documentation Hub - Complete documentation index

Quick Links by User Type

User Type	Start Here	Next Steps
New Users	CLI Reference	API Reference
App Developers	API Reference	Configuration Guide
Performance Optimizers	Model Guide	Performance Benchmarks
Production Deployers	Configuration Guide	Path Strategies
Troubleshooters	Troubleshooting Guide	Preprocessing Guide

MCP Server Integration

RAG-lite TS includes a Model Context Protocol (MCP) server for integration with AI agents.

# Start MCP server
raglite-mcp

MCP Configuration:

{
  "mcpServers": {
    "rag-lite": {
      "command": "raglite-mcp",
      "args": []
    }
  }
}

Available Tools: search_documents, ingest_documents, rebuild_index, get_stats

→ Complete MCP Integration Guide

Development

Building from Source

# Clone and setup
git clone https://github.com/your-username/rag-lite-ts.git
cd rag-lite-ts
npm install

# Build and link for development
npm run build
npm link  # Makes raglite/raglite-mcp available globally

# Run tests
npm test
npm run test:integration

Project Structure

src/
├── index.ts              # Main exports and factory functions
├── search.ts             # Public SearchEngine API
├── ingestion.ts          # Public IngestionPipeline API
├── core/                 # Model-agnostic core layer
│   ├── search.ts         # Core search engine
│   ├── ingestion.ts      # Core ingestion pipeline
│   ├── db.ts             # SQLite operations
│   ├── config.ts         # Configuration system
│   └── types.ts          # Core type definitions
├── factories/            # Factory functions for easy setup
│   └── text-factory.ts   # Text-specific factories
├── text/                 # Text-specific implementations
│   ├── embedder.ts       # Text embedding generation
│   ├── reranker.ts       # Text reranking
│   └── tokenizer.ts      # Text tokenization
├── cli.ts                # CLI interface
├── mcp-server.ts         # MCP server
└── preprocessors/        # Content type processors

dist/                     # Compiled output

Design Philosophy

Simple by default, powerful when needed:

✅ Simple constructors work immediately with sensible defaults
✅ Configuration options available when you need customization
✅ Advanced patterns available for complex use cases
✅ Clean architecture with minimal dependencies
✅ No ORMs or heavy frameworks - just TypeScript and SQLite
✅ Extensible design for future capabilities

This approach ensures that basic usage is effortless while providing the flexibility needed for advanced scenarios.

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests for new functionality
Ensure all tests pass
Submit a pull request

We welcome contributions that maintain our clean architecture principles while enhancing functionality and developer experience.

License

MIT License - see LICENSE file for details.

Related Projects

transformers.js - Client-side ML models
hnswlib - Fast approximate nearest neighbor search

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
models		models
src		src
.gitignore		.gitignore
.npmignore		.npmignore
CHANGELOG.md		CHANGELOG.md
DOCUMENTATION_UPDATES.md		DOCUMENTATION_UPDATES.md
EXTENSIBILITY_VALIDATION.md		EXTENSIBILITY_VALIDATION.md
LICENSE		LICENSE
PERFORMANCE_VALIDATION_RESULTS.md		PERFORMANCE_VALIDATION_RESULTS.md
README.md		README.md
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-lite TS

Table of Contents

Quick Start

Installation

Basic Usage

Using Different Models

Programmatic Usage

Configuration Options

Features

How It Works

Architecture

Supported Models

Documentation

📚 Getting Started

🔧 Customization & Advanced Usage

🛠️ Support

📊 Technical References

Quick Links by User Type

MCP Server Integration

Development

Building from Source

Project Structure

Design Philosophy

Contributing

License

Related Projects

About

Uh oh!

Releases

Packages

Languages

License

raglite/rag-lite-ts

Folders and files

Latest commit

History

Repository files navigation

RAG-lite TS

Table of Contents

Quick Start

Installation

Basic Usage

Using Different Models

Programmatic Usage

Configuration Options

Features

How It Works

Architecture

Supported Models

Documentation

📚 Getting Started

🔧 Customization & Advanced Usage

🛠️ Support

📊 Technical References

Quick Links by User Type

MCP Server Integration

Development

Building from Source

Project Structure

Design Philosophy

Contributing

License

Related Projects

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages