rlm-rs

Recursive Language Model (RLM) CLI for Claude Code - handles long-context tasks via chunking and recursive sub-LLM calls.

Based on the RLM pattern from arXiv:2512.24601, enabling analysis of documents up to 100x larger than typical context windows.

Features

Hybrid Semantic Search: Combined semantic + BM25 search with RRF fusion
Auto-Embedding: Embeddings generated automatically during load
Pass-by-Reference: Retrieve chunks by ID for efficient subagent processing
Multiple Chunking Strategies: Fixed, semantic, and parallel chunking
SQLite State Persistence: Reliable buffer management across sessions
Regex Search: Fast content search with context windows
Memory-Mapped I/O: Efficient handling of large files
JSON Output: Machine-readable output for integration

How It Works

Installation

Via Cargo (Recommended)

cargo install rlm-rs

Via Homebrew

brew tap zircote/tap
brew install rlm-rs

From Source

git clone https://github.com/zircote/rlm-rs.git
cd rlm-rs
make install

Quick Start

# Initialize the database
rlm-rs init

# Load a large document (auto-generates embeddings)
rlm-rs load document.md --name docs --chunker semantic

# Search with hybrid semantic + BM25
rlm-rs search "your query" --buffer docs --top-k 10

# Retrieve chunk by ID (pass-by-reference)
rlm-rs chunk get 42

# Check status
rlm-rs status

# Regex search content
rlm-rs grep docs "pattern" --max-matches 20

# View content slice
rlm-rs peek docs --start 0 --end 3000

Commands

Command	Description
`init`	Initialize the RLM database
`status`	Show current state (buffers, chunks, DB info)
`load`	Load a file into a buffer with chunking (auto-embeds)
`search`	Hybrid semantic + BM25 search across chunks
`chunk get`	Retrieve chunk by ID (pass-by-reference)
`chunk list`	List chunks for a buffer
`chunk embed`	Generate embeddings (or re-embed with --force)
`chunk status`	Show embedding status
`list`	List all buffers
`show`	Show buffer details
`delete`	Delete a buffer
`peek`	View a slice of buffer content
`grep`	Search buffer content with regex
`write-chunks`	Write chunks to individual files
`add-buffer`	Add text to a new buffer
`export-buffers`	Export all buffers to JSON
`var`	Get/set context variables
`global`	Get/set global variables
`reset`	Delete all RLM state

Chunking Strategies

Strategy	Best For	Description
`semantic`	Markdown, code, JSON	Splits at natural boundaries (headings, paragraphs)
`fixed`	Logs, plain text	Splits at exact byte boundaries
`parallel`	Large files (>10MB)	Multi-threaded fixed chunking

# Semantic chunking (default)
rlm-rs load doc.md --chunker semantic

# Fixed chunking with overlap
rlm-rs load logs.txt --chunker fixed --chunk-size 150000 --overlap 1000

# Parallel chunking for speed
rlm-rs load huge.txt --chunker parallel --chunk-size 100000

Claude Code Integration

rlm-rs is designed to work with the rlm-rs Claude Code plugin, implementing the RLM architecture:

RLM Concept	Implementation
Root LLM	Main Claude Code conversation (Opus/Sonnet)
Sub-LLM	`rlm-subcall` agent (Haiku)
External Environment	`rlm-rs` CLI + SQLite

Development

Prerequisites

Rust 1.88+ (2024 edition)
cargo-deny for supply chain security

Build

# Using Makefile
make build          # Debug build
make release        # Release build
make test           # Run tests
make check          # Format + lint + test
make ci             # Full CI check
make install        # Install to ~/.cargo/bin

# Or using Cargo directly
cargo build --release
cargo test
cargo clippy --all-targets --all-features

Project Structure

src/
├── lib.rs           # Library entry point
├── main.rs          # CLI entry point
├── error.rs         # Error types
├── core/            # Core types (Buffer, Chunk, Variable)
├── chunking/        # Chunking strategies
├── storage/         # SQLite persistence
├── io/              # File I/O with mmap
└── cli/             # Command implementations

tests/
└── integration_test.rs

MSRV Policy

The Minimum Supported Rust Version (MSRV) is 1.88.

License

MIT License - see LICENSE for details.

Documentation

RLM-Inspired Design - How rlm-rs builds on the RLM paper
Plugin Integration - Integration with Claude Code, Copilot, Codex, and more
Architecture - Internal architecture and design
CLI Reference - Complete command documentation
API Reference - Rust library documentation

Acknowledgments

claude_code_RLM - Original Python RLM implementation by Brainqub3 that inspired the creation of this project
RLM Paper (arXiv:2512.24601) - Recursive Language Model pattern by Zhang, Kraska, and Khattab (MIT CSAIL)
Claude Code - AI-powered development environment

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.claude		.claude
.github		.github
.vscode		.vscode
Formula		Formula
docs		docs
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
clippy.toml		clippy.toml
deny.toml		deny.toml
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rlm-rs

Features

How It Works

Installation

Via Cargo (Recommended)

Via Homebrew

From Source

Quick Start

Commands

Chunking Strategies

Claude Code Integration

Development

Prerequisites

Build

Project Structure

MSRV Policy

License

Documentation

Acknowledgments

About

Uh oh!

Releases 8

Packages

Contributors 2

Uh oh!

Languages

License

zircote/rlm-rs

Folders and files

Latest commit

History

Repository files navigation

rlm-rs

Features

How It Works

Installation

Via Cargo (Recommended)

Via Homebrew

From Source

Quick Start

Commands

Chunking Strategies

Claude Code Integration

Development

Prerequisites

Build

Project Structure

MSRV Policy

License

Documentation

Acknowledgments

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 2

Uh oh!

Languages

Packages