Skip to content

v1.5.1 — Performance & Search Overhaul

Choose a tag to compare

@gambletan gambletan released this 18 Mar 12:31
· 186 commits to main since this release

What's New

Performance

  • 2x faster search — HNSW rebuild debounce, read-lock search, parking_lot::Mutex
  • Half memory usage — f16 vector quantization
  • SIMD dot product for similarity search
  • Read/write connection split + WAL tuning
  • Batch operations — batch delete, batch ingest with dedup, batch salience updates
  • Prepared statement caching for all reads and writes

Features

  • FTS5 full-text search — replaces regex-based search
  • Plugin system — configurable consolidation and index strategies
  • Lazy HNSW index — builds on demand, not on startup
  • CLI tools and install script
  • Namespace isolation for multi-tenant deployments
  • Event system and observability hooks
  • Docker image on GHCR (ghcr.io/gambletan/cortex-http)

Testing

  • 74 new tests for full coverage of perf optimizations
  • 131+ total tests

Benchmarks

  • Ingest: 62µs
  • Search: <100µs
  • 1,182x faster than Mem0 cloud

Full Changelog: v1.4.0...v1.5.1