Skip to content

Releases: weaviate/weaviate

v1.23.6 - Support for OpenAI's V3 embedding models, gRPC nested objects missing value and shard and replica selection Fixes

26 Jan 18:25
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

  • Refine shard and replica selection to distribute replicas across more nodes by @redouan-rhazouani in #4077
  • Fix nested obejcts with missing values by @dirkkul in #4072
  • Run CI (incl docker-push) after new commits have been merged into stable by @dirkkul in #4081
  • Add support for OpenAI's new V3 embedding models by @antas-marcin in #4085

Full Changelog: v1.23.5...v1.23.6

v1.23.5 - AWS module gRPC headers, OpenAI error response, Hybrid vector search and Keyword search with special characters Fixes

25 Jan 17:00
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

Full Changelog: v1.23.4...v1.23.5

v1.22.11 - Regex error in keyword search when passing special characters Fix

23 Jan 19:38
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

  • Fix regex error in keyword search when passing special characters in query by @antas-marcin in #4061

Full Changelog: v1.22.10...v1.22.11

v1.23.4 - gRPC API Fixes, Added support for SageMaker in text2ve-aws module, Sharded locks lock contention Fix

22 Jan 13:34
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

Full Changelog: v1.23.3...v1.23.4

v1.22.10 - Sharded locks lock contention Fix

22 Jan 11:55
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

Full Changelog: v1.22.9...v1.22.10

v1.23.3 - gRPC API improvements

15 Jan 08:44
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

  • Add support for returning phone number properties in gRPC queries by @tsmith023 in #3961
  • async: fix double close of chunk channel by @asdine in #3998
  • Unify text2vec module components by @antas-marcin in #3990
  • Fix uuid casing for references added with object, single refs and bat… by @dirkkul in #4001
  • Unify text2vec-contextionary module by @antas-marcin in #4004
  • [GRPC] Add structured references for filters + GRPC batch delete endpoint by @dirkkul in #3994
  • Allow discerning between nil and [] in ref props by @tsmith023 in #4006

Full Changelog: v1.23.2...v1.23.3

v1.23.2 - gRPC API generative search Fix, support for baseURL in Azure OpenAI endpoints in text2vec-openai module

08 Jan 12:56
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

New Contributors

Full Changelog: v1.23.1...v1.23.2

v1.23.1 - gRPC API enhancements, PQ stability Fixes, Cycle Manager Improvements

05 Jan 14:20
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

Full Changelog: v1.23.0...v1.23.1

v1.22.9 - PQ stability Fixes, Cycle Manager Improvements

05 Jan 09:34
Compare
Choose a tag to compare

Breaking Changes

none

New Features

none

Fixes

  • [v1.22.x] Fix issue with deleted node before PQ fitting started by @etiennedi in #3955
  • [v1.22.x] Handle vector cache miss during PQ fitting by @etiennedi in #3958
  • [v1.22.x] Fix broken PQ empty checks by @etiennedi in #3954
  • cyclemanager: abort routine if running while attempting to unregister by @asdine in #3964

Full Changelog: v1.22.8...v1.22.9

v1.23.0 - Binary quantization support, Startup time improvements, New Generative Anyscale module, gRPC API performance improvements

18 Dec 18:27
Compare
Choose a tag to compare

Breaking Changes

Nodes Status Response Verbosity

Getting the status for all nodes in a cluster can be a very expensive query when each node contains a large number of shards, as the metadata for each shard is included in the response. A new output verbosity option sets the default verbosity level to minimal, omitting individual shard metadata. The new verbose verbosity output level includes shard metadata, so using this will return a response body identical to the previous nodes status response before this release.

New Features

Binary Quantization / Brute Force Index

Exciting news! We've added a brute force search feature that efficiently runs straight from disk. You can choose between using original vectors or binary compression for faster processing and less disk read. Currently, compression is best for specific data types, but stay tuned! We're working towards a cool update where, with compression enabled, we'll mix disk and memory operations for even better performance. This awesome enhancement is on the horizon, so keep an eye out! 🚀✨

  • Binary quantization & brute force index type by @abdelr in #3690

Startup Time / MTTR Improvements

Previously our mean time to recovery (MTTR) / node startup time was significantly impacted by nodes which contain a large number of shards or tenants. This is because the database had to synchronously load each shard from disk before startup was complete and the requests were ready to be served. Well, say goodbye to those days!

We've introduced an amazing new feature: a lazy-loaded shard abstraction layer. This game-changer drastically speeds up node startup and recovery times. How? By loading shards in the background without blocking startup. This means your nodes are up and running almost instantly - talk about efficiency!

And here's the best part: if a request hits for a shard that's not yet loaded, Weaviate smartly fetches it on the spot, serving the request right away. The rest of the shards continue to load in the background seamlessly. This is a huge leap forward in performance and responsiveness. So, gear up to experience a smoother, faster, and more efficient DB.

Auto-Compression

The introduction of Product Quantization (PQ) was a huge step forward in efficient vector operations. And now, we're pushing the envelope even further with auto-compression.

Here's the deal: when your in-memory vector index hits a certain threshold, PQ kicks in automatically, compressing the index. This means smarter, smoother, and super-efficient handling of your data without lifting a finger.

  • Automatically trigger index compression after threshold by @asdine in #3771

Modules

Resource Guardrails

  • Set resource limits to prevent the server going OOM or swapping worker threads by @donomii in #3731

Performance Optimizations

  • Removes tombstones when merging with root segment by @aliszka in #3666
  • Add option to force compactions every cycle, where it is advantageous by @donomii in #3675
  • Improve Cursor Performance by reusing memory by @etiennedi in #3660
  • Improvement of stop condition within SearchByVectorDistance by @aliszka in #3742
  • Improved filtered flat search stop condition by @aliszka in #3753
  • Optional bloom filter and count net additions calculations by @aliszka in #3756
  • Implement pread for replace strategy cursor by @parkerduckworth in #3727
  • Setting optimal segments on default, based on the dimensions by @abdelr in #3790
  • Send uuid as a byte by @dirkkul in #3894

gRPC Improvements

  • Add config and support for gRPC TLS credentials by @mikewyer in #3794
  • Add filter for metadata to GRPC by @dirkkul in #3861
  • Introduce custom pb.Properties message to contain type-aware properties within search result by @tsmith023 in #3820
  • Add support for geo coordinates in GRCP by @dirkkul in #3883
  • Introduce ReturnAllNonrefProperties bool to PropertiesRequest by @tsmith023 in #3899

Nodes API

Internal System Restructuring

  • Restructure internal file structure to align with the class structure by @parkerduckworth in #3719
  • Migration of brute force index's buckets to main store by @aliszka in #3740
  • Unification of compressed vectors bucket names by @aliszka in #3865
  • Merge PQ bucket into shard store, migrate PQ files from flat file structure by @parkerduckworth in #3726

Other

Fixes

New Contributors

Full Changelog: v1.22.6...v1.23.0