-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
insert data too quickly causing datanode crash #25680
Comments
however, if i change insert_params: batch: 1000 to 100, everything will be ok |
One or more of the specified parts could not be found. |
@iytprince2 if you want to benchmark milvus, please try the new repo: VectorDBBench /assign @iytprince2 |
@yanliang567 |
so i don't think it's problem of minio, i use the same one |
the error comes form S3 service, with status code 400. if you can offer details logs for datacoord and datanode we can help to check if there are more details. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
finally,i prove this to be the minio proxy problem, the proxy and backend nodes have too much delay, but i still have no idea of the mechanism. |
/reopen |
@iytprince2: Reopened this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
you may want to increase flush routine number or limit insert throughput? What is the expected throughput of your insertion |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Is there an existing issue for this?
Environment
Current Behavior
when i start command
python3 main.py recall --host xxx--engine milvus --dataset-name glove-200-angular --config-name milvus_recall_k8s.yaml
datanode start to fail
pod/my-release-milvus-datanode-6b885d46-cvjzv 0/1 Running 2 11m
pod/my-release-milvus-datanode-6b885d46-s6vj2 0/1 CrashLoopBackOff 2 11m
Expected Behavior
i can insert what i want
Steps To Reproduce
Milvus Log
[2023/07/17 13:12:52.433 +00:00] [WARN] [datanode/flush_task.go:230] ["flush task error detected"] [error="All attempts results:\nattempt #1:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #2:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #3:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #4:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #5:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #6:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #7:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #8:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #9:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\nattempt #10:All attempts results:\nattempt #1:One or more of the specified parts could not be found. The part may not have been uploaded, or the specified entity tag may not match the part's entity tag.\n\n"] []
[2023/07/17 13:12:52.433 +00:00] [ERROR] [datanode/flush_manager.go:775] ["flush pack with error, DataNode quit now"] [error="execution failed"] [stack="github.com/milvus-io/milvus/internal/datanode.flushNotifyFunc.func1\n\t/go/src/github.com/milvus-io/milvus/internal/datanode/flush_manager.go:775\ngithub.com/milvus-io/milvus/internal/datanode.(*flushTaskRunner).waitFinish\n\t/go/src/github.com/milvus-io/milvus/internal/datanode/flush_task.go:204"]
panic: execution failed
goroutine 877 [running]:
github.com/milvus-io/milvus/internal/datanode.flushNotifyFunc.func1(0xc062ade550)
/go/src/github.com/milvus-io/milvus/internal/datanode/flush_manager.go:777 +0x1611
github.com/milvus-io/milvus/internal/datanode.(*flushTaskRunner).waitFinish(0xc00138c180, 0xc001b81b00, 0xc000651430)
/go/src/github.com/milvus-io/milvus/internal/datanode/flush_task.go:204 +0xdb
created by github.com/milvus-io/milvus/internal/datanode.(*flushTaskRunner).init.func1
/go/src/github.com/milvus-io/milvus/internal/datanode/flush_task.go:121 +0xf1
Anything else?
No response
The text was updated successfully, but these errors were encountered: