New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Poor performance with scaling #4572
Comments
@rrajp : Thank you for reporting the issue. Could you provide more information on the type of query being used? Are you able to share code snippet here? |
Same problem. |
I did little bit of experiments and found that main cause of latency has been the hybrid search. I am using hybrid with BM25 (alpha=0.5). If I make only vector query, it usually returns in 10ms. Also, I query is not global but applicable to almost 5000 data points ( applied with where on document id). |
I'v tried vector search and hybrid search, both will cause timeout. I was using 1.23.10 earlier but it was very slow, normly it will costs 10 seconds or more with hybrid search, so I tried newest version, but I get timeout instead. I found some suggestion about setting timeout in python client, but it won't fix the slow response problem. |
FWIW we have same problems on bm25 search, super slow on moderately large index, and clearly second class citizen (no stemmer, etc). We just accepted the fact that we can't use it outside of creation of dataset and other offline scenario. |
@pommedeterresautee @scd10 @rrajp Until then, there are a couple of tips that can help:
About the scale of data, I have a couple of questions that may help me with further advise:
|
Tks for your answer. Our index is made of 40m of docs, each having 2 fields indexed for bm25 search. |
@amourao Thanks for the reply. For your questions:
|
Ok, both your setups are well though, the only thing that jumps to mind are the slightly large queries with 10~30 tokens. So, keyword search on Weaviate is quite disk-heavy, and relies on OS memory caching. |
How to reproduce this bug?
Ingested 18 million objects
Weaviate version 1.24.4 running on docker
Persistent data size ~180Gb on disk
Instance configuration: 256 Gb RAM , 128 Core
What is the expected behavior?
Average query latency should be < 1 sec
What is the actual behavior?
Every new query, It is taking anywhere between 5 to 15 sec per query.
Exact query second time hit take 3-4 sec
Supporting information
Server Version
1.24.4
Code of Conduct
The text was updated successfully, but these errors were encountered: