-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The node is already decommissioned
error after second decommission of the same Scylla member which was recreated
#1293
Comments
In the second run of the same scenario the bug was not reproduced: https://jenkins.scylladb.com/view/staging/job/scylla-staging/job/valerii/job/vp-longevity-scylla-operator-3h-gke/100/consoleFull |
To me it looks like the ScyllaCluster's |
Ran 2 more test runs and both didn't face this bug. |
I believe the scenario you mentioned in the original post wasn't exactly followed here. Although you can see in
It doesn't look like the Pod was ever actually deleted.
and then the opposite. Neither is present in the logs. So what actually happened, and what reproduces the issue is:
The e2e implementing this scenario:
Nevertheless I believe this case should be handled by the operator. Once we've started decommissioning the node, it shall be deleted. |
Em, what then the following means?
|
Read the rest of the comment. Printing the log doesn't magically delete the Pod. As to what exactly happened there - I have no clue. All I can see from the operator logs is that the StatefulSet wasn't scaled down/up. |
I read everything. |
I understand that. All I'm saying is that it doesn't look like the Pod was actually deleted. It doesn't imply your test was wrong or anything. Without the audit logs I can't be a hundred percent sure that the Pod wasn't deleted, but looking at the operator logs it definitely wasn't triggered by it. The logs of the Scylla node don't look like the Pod was deleted either. |
Describe the bug
The following scenario was ran twice:
ScyllaCluster
object from3
to2
ScyllaCluster
object from2
to3
And on the second loop we get following error after the decomission step:
To Reproduce
Steps to reproduce the behavior:
Expected behavior
The mentioned scenario must work any number of times.
Logs
Environment:
The text was updated successfully, but these errors were encountered: