vectorless

RAG without vector embeddings — tree-based retrieval powered by LLM navigation

vectorless is a Rust-based RAG (Retrieval-Augmented Generation) system that uses tree-based indexing and LLM navigation instead of traditional vector embeddings. No vector database required.

🌟 Features

Tree-Based Indexing — Documents are organized as hierarchical trees with summaries at each node
LLM Navigation — Intelligent traversal using LLM to find relevant content
No Vector Database — Eliminates infrastructure complexity and costs
Built in Rust — Blazing fast performance with memory safety
HTTP API — Simple RESTful API for easy integration
Multiple LLM Support — Pluggable LLM providers (ZAI, OpenAI, etc.)

🚀 Quick Start

Installation

# Clone the repository
git clone https://github.com/zTgx/vectorless
cd vectorless

# Build the project
cargo build --release

Run the RAG Service

# Set your LLM API credentials
export ZAI_API_KEY="your-api-key"
export ZAI_ENDPOINT="https://api.z.ai/api/coding/paas/v4"

# Start the HTTP server
cargo run -p vectorless-rag

The server will start on http://localhost:8080

Basic Usage

use vectorless_core::*;
use vectorless_llm::openai::OpenAIClient;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Initialize LLM client (OpenAI)
    let llm = OpenAIClient::new("your-api-key");

    // Or use ZAI
    // let llm = vectorless_llm::zai::ZaiClient::new("your-api-key");

    // Configure indexer
    let config = IndexerConfig::builder()
        .subsection_threshold(200)
        .max_segment_tokens(4000)
        .summary_model("gpt-4o")
        .max_summary_tokens(200)
        .build();

    // Parse document
    let root = parse_document_with_config(&llm, document_text, &config).await?;

    // Build summaries
    build_summaries_with_config(&llm, &root, &config).await?;

    // Save index
    save(&root, "index.json")?;

    // Query
    let answer = retrieve(&llm, "What is the main topic?", &root).await?;
    println!("Answer: {}", answer);

    Ok(())
}

📚 Architecture

vectorless/
├── core/      # Core indexing and retrieval logic
├── llm/       # LLM abstraction layer
├── rag/       # HTTP RAG service
├── agent/     # Agent framework
├── cli/       # Command-line interface
└── sdk/       # Client SDK

How It Works

Parse — Documents are segmented into sections based on structure
Index — A hierarchical tree is built with summaries for each node
Retrieve — The LLM navigates the tree to find relevant content
Generate — Results are used for RAG generation

🔌 HTTP API

Documents

# Create a document
POST /documents
{"title": "My Document"}

# Upload content
POST /documents/{id}/content
{"content": "Document content here..."}

# List documents
GET /documents

# Get document
GET /documents/{id}

# Delete document
DELETE /documents/{id}

Query

# Query the knowledge base
POST /query
{
  "query": "What is the main point?",
  "max_results": 3
}

Health

# Check service health
GET /health

⚙️ Configuration

Configuration is done via environment variables:

LLM Provider

Variable	Description	Default
`OPENAI_API_KEY`	OpenAI API key	-
`OPENAI_ENDPOINT`	OpenAI endpoint	`https://api.openai.com/v1`
`OPENAI_MODEL`	Model name	`gpt-4o`
`ZAI_API_KEY`	ZAI API key	-
`ZAI_ENDPOINT`	ZAI endpoint	`https://api.z.ai/api/paas/v4`
`ZAI_MODEL`	Model name	`glm-5`

Server

Variable	Description	Default
`RAG_HOST`	Server host	`0.0.0.0`
`RAG_PORT`	Server port	`8080`
`RAG_DATA_DIR`	Data directory	`./data`
`RAG_INDEX_DIR`	Index directory	`./indices`
`RAG_SUBSECTION_THRESHOLD`	Subsection token threshold	`200`
`RAG_MAX_SEGMENT_TOKENS`	Max segment tokens	`4000`
`RAG_MAX_SUMMARY_TOKENS`	Max summary tokens	`200`

🆚 Comparison

Vector RAG	Vectorless	Keyword Search
Requires embedding model	✅ No embeddings	No semantic understanding
Vector database costs	✅ Zero extra costs	Keyword matching only
Approximate results	✅ Precise retrieval	Limited relevance
Complex infrastructure	✅ Simple deployment	No context awareness

📖 Example

cargo run --package basic --bin basic

Output:

> Hello! How can I help you today?

Building index...
Parsing document...
Building summaries...
Saving index to index.json
Index built successfully!

Query: What is this document about?
Answer: This document is an introductory book about the Rust programming language.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under either of:

MIT License (LICENSE-MIT or http://opensource.org/licenses/MIT)
Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)

at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
agent		agent
benches		benches
cli		cli
core		core
docs		docs
examples/basic		examples/basic
llm		llm
rag		rag
sdk		sdk
tests		tests
.config-example.toml		.config-example.toml
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vectorless

🌟 Features

🚀 Quick Start

Installation

Run the RAG Service

Basic Usage

📚 Architecture

How It Works

🔌 HTTP API

Documents

Query

Health

⚙️ Configuration

LLM Provider

Server

🆚 Comparison

📖 Example

🤝 Contributing

📄 License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

vectorless

🌟 Features

🚀 Quick Start

Installation

Run the RAG Service

Basic Usage

📚 Architecture

How It Works

🔌 HTTP API

Documents

Query

Health

⚙️ Configuration

LLM Provider

Server

🆚 Comparison

📖 Example

🤝 Contributing

📄 License

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages