Skip to content

How to increase throughput for standalone instance? #25550

Discussion options

You must be logged in to vote

Reasons:

  • 1 million embeddings is a pretty small dataset. If you increase the data to 10 million, you will see higher CPU usage.
  • Standalone has only one query node, it searches one segment at the same time. If you use a cluster, and deploy several query nodes, they can search several segments parallelly. Large data set has more segments, suitable for parallel searching.
  • The client testing stress is not enough. Use multiple processes to call collection.search(), each process opens a connection. More intensive, more queries.

Replies: 3 comments 7 replies

Comment options

You must be logged in to vote
7 replies
@shauryagoel
Comment options

@yhmo
Comment options

@yhmo
Comment options

@shivanshu20
Comment options

@yhmo
Comment options

Answer selected by shauryagoel
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants