questions on milvus connection, load collection and search #40390
Unanswered
ranjith502
asked this question in
Q&A and General discussion
Replies: 1 comment 2 replies
-
Index data is stored in S3/minio, load_collection() is to read the index data from S3/minio to querynode's memory. I believe the fluctuation of "Connection Time" is mainly caused by the network. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello Team,
I have set up a Milvus server using Docker, which is running on a Virtual Machine (VM). From Databricks, I am performing data insertion and search operations.
Questions on Collection Loading & Indexing
I understand that before performing a search, we need to load the collection into memory. However, I have a question regarding index storage:
I am using HNSW, which is an in-memory index. If the index is already stored in memory, why do we still need to load the entire collection into memory before searching?
Fluctuations in Execution Time
I have observed significant fluctuations in execution time across different operations. Below are the recorded times for 10 queries:
Connection Time (ms):
[6, 77, 6, 83, 107, 89, 83, 7, 5, 6]
Load Collection Time (ms):
[107, 12, 11, 11, 97, 11, 15, 98, 100, 10]
Search Time (ms):
[13, 87, 12, 88, 11, 10, 82, 11, 12, 10]
I would like to understand:
What could be the main reasons for these fluctuations in execution time?
Is this expected behavior due to system resource allocation, network latency, or some internal optimizations in Milvus?
Are there any best practices to stabilize query and load times?
Memory Management Questions
Once a collection is loaded into memory, how long does it remain in memory? (e.g., 5 minutes, 10 minutes, or indefinitely until manually released?)
Do we need to manually release the collection after every query, or does Milvus handle memory management automatically?
Beta Was this translation helpful? Give feedback.
All reactions