-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Chaos] Create collection and search get stuck during datacoord is killed #5985
Comments
Actually, one more side effect is that the other operations such as create, insert, search are also blocked or performance degraded by flush hang |
I'm looking into this right now. |
Client retried too many times to connect with datacoord so the flush request blocked. Proxy handle request sequentially and other requests will wait util the previous request returns. IMHO, requests should be handled concurrently unless they actually have internal relations. This will be fixed after 2.0-RC1. @yanliang567 @czs007 |
logs from build master-20210703-111a24a: ERROR ci_test:api_request.py:25 (api_response) [Milvus API Exception]<function api_request at 0x7fd091091280>: <BaseException: (code=1, message=getSegmentsOfCollection, err:rpc error: code = DeadlineExceeded desc = context deadline exceeded)> |
/assign @ThreadDao |
Fault injection time period
The datacoord pod name before killed All pods name after chaos deleted
|
/unassign |
I didn't reproduce on commit |
Flush hang if datacoord deleted and restarted
Describe the bug
Flush hang until timeout if datacoord deleted
Steps/Code to reproduce behavior
Expected behavior
Actual
Method of installation
Docker/cpu
Docker/gpu
Build from source
Milvus version (master or released version)
Milvus 2.0 with cluster deployment
Built: Tue Jun 22 03:36:05 UTC 2021
GitCommit: 438e7fb
Additional context
The text was updated successfully, but these errors were encountered: