Iris : Information Retrieval with Semantics

Iris is a search core library written in Rust, designed for Information Retrieval with Semantics.

Iris provides the foundational mechanisms essential for advanced search capabilities:

Lexical search primitives for precise, exact-match retrieval
Vector-based similarity search for deep semantic understanding
Hybrid scoring and ranking to synthesize multiple signals into coherent results

Rather than functioning as a monolithic search engine, Iris is architected as a composable search core — a suite of modular building blocks designed to be embedded into applications, extended with custom logic, or orchestrated within distributed systems.

Documentation

Comprehensive documentation is available in the docs/ directory and online at https://mosuka.github.io/iris/:

Getting Started: Installation and basic usage.
Core Concepts: Architecture, Lexical Search, and Vector Search.
Advanced Features: ID Management, Persistence, and Deletions.
API Reference

Features

Pure Rust Implementation: Memory-safe and fast performance with zero-cost abstractions.
Hybrid Search: Seamlessly combine BM25 lexical search with HNSW vector search using configurable fusion strategies.
Multimodal capabilities: Native support for text-to-image and image-to-image search via CLIP embeddings.
Rich Query DSL: Term, phrase, boolean, fuzzy, wildcard, range, and geographic queries.
Flexible Analysis: Configurable pipelines for tokenization, normalization, and stemming (including CJK support).
Pluggable Storage: Interfaces for in-memory, file-system, and memory-mapped storage backends.

Quick Start

use iris::{Document, Engine, FieldOption, FusionAlgorithm, Schema, SearchRequestBuilder};
use iris::analysis::analyzer::standard::StandardAnalyzer;
use iris::lexical::{FieldOption as LexicalFieldOption, TextOption, TermQuery};
use iris::vector::{FlatOption, VectorOption, VectorSearchRequestBuilder};
use iris::storage::{StorageConfig, StorageFactory};
use iris::storage::memory::MemoryStorageConfig;
use std::sync::Arc;

fn main() -> iris::Result<()> {
    // 1. Create storage
    let storage = StorageFactory::create(StorageConfig::Memory(MemoryStorageConfig::default()))?;

    // 2. Define schema with separate lexical and vector fields
    let schema = Schema::builder()
        .add_field("content", FieldOption::Lexical(LexicalFieldOption::Text(TextOption::default())))
        .add_field("content_vec", FieldOption::Vector(VectorOption::Flat(FlatOption { dimension: 384, ..Default::default() })))
        .build();

    // 3. Create engine with analyzer and embedder
    let engine = Engine::builder(storage, schema)
        .analyzer(Arc::new(StandardAnalyzer::default()))
        .embedder(Arc::new(MyEmbedder))  // Your embedder implementation
        .build()?;

    engine.put_document("doc1",
        Document::new()
            .add_text("content", "Rust is a systems programming language")
            .add_text("content_vec", "Rust is a systems programming language")
    )?;
    engine.put_document("doc2",
        Document::new()
            .add_text("content", "Python is great for machine learning")
            .add_text("content_vec", "Python is great for machine learning")
    )?;
    engine.commit()?;

    // 4. Hybrid search (combines lexical keyword match + semantic similarity)
    let results = engine.search(
        SearchRequestBuilder::new()
            .with_lexical(Box::new(TermQuery::new("content", "programming")))
            .with_vector(VectorSearchRequestBuilder::new().add_text("content_vec", "systems language").build())
            .fusion(FusionAlgorithm::RRF { k: 60.0 })
            .build()
    )?;

    // 5. Display results with document content
    for hit in results {
        let content = hit.document.as_ref()
            .and_then(|doc| doc.fields.get("content"))
            .and_then(|v| v.as_text())
            .unwrap_or("");
        println!("[{}] {} (score={:.4})", hit.id, content, hit.score);
    }

    Ok(())
}

Examples

You can find usage examples in the examples/ directory:

Search

Unified Search - Lexical, Vector, and Hybrid search in one cohesive example
Multimodal Search - Text-to-image and image-to-image search

Query Types

Term Query - Basic keyword search
Boolean Query - Complex boolean expressions (AND, OR, NOT)
Phrase Query - Exact phrase matching
Fuzzy Query - Approximate string matching
Wildcard Query - Pattern-based search
Range Query - Numeric and date range queries
Geo Query - Geographic search
Span Query - Positional queries

Embeddings

Candle Embedder - Local BERT embeddings
OpenAI Embedder - Cloud-based embeddings

Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under either of

MIT License (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 334 Commits
.github/workflows		.github/workflows
benches		benches
docs		docs
examples		examples
resources		resources
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
README.md		README.md
check_output.txt		check_output.txt
check_output_2.txt		check_output_2.txt
test_compile.txt		test_compile.txt
test_serde.rs		test_serde.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Iris : Information Retrieval with Semantics

Documentation

Features

Quick Start

Examples

Search

Query Types

Embeddings

Contributing

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

mosuka/iris

Folders and files

Latest commit

History

Repository files navigation

Iris : Information Retrieval with Semantics

Documentation

Features

Quick Start

Examples

Search

Query Types

Embeddings

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages