New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cassandra timeout during SIMPLE write query at consistency ALL in the AddRemoveDc nemesis #10274
Comments
Installation detailsKernel Version: 5.13.0-1019-aws Scylla version (or git commit hash): Cluster size: 4 nodes (i3.2xlarge) Scylla Nodes used in this run:
OS / Image: Test: Test id: Test name: Test config file(s): Issue descriptionAlmost same issue but the c-s command failed with different error:
Logs:
|
I expect this is the same issue as #10296 gossip not in synch with the nodes will cause this failure as well. |
I did not check the complete sequence but you can - I expect a simple reproducer is also possible to create for this case |
@asias can we verify from the logs somehow if this is indeed the same cause (old nodes not leaving the cluster)? |
@asias ^^ |
@eliransin / @asias ping |
I don't think this issue still happens, since we run this longevity and this nemesis many times since and I don't see more mentions of the reproducers. |
I am closing this. In #10296, we requested to get more info when the issue happens.
|
Installation details
Kernel Version: 5.13.0-1017-aws
Scylla version (or git commit hash):
5.1.dev-20220317.c45050895403
with build-id076f513b6143670def988c7626389b270411f8f7
Cluster size: 4 nodes (i3.2xlarge)
Scylla Nodes used in this run:
OS / Image:
ami-0f50a374dd30afb62
(aws: eu-north-1)Test:
longevity-lwt-3h-test
Test id:
e59d59d8-3892-4fda-8e50-e651387aff96
Test name:
longevity/longevity-lwt-3h-test
Test config file(s):
Issue description
Before the nemesis AddRemoveDc had started we made sure the cluster was up and running
During the nemesis the new DC (eu-north_nemesis_dc) was created and a new node (Node-5) was added to it.
Rebuild operation was performed on new DC
Repair operation was performed for each node
All the nodes had had the same schema version before the c-s was started
Finally, the following c-s command started
And it almost immediately returned the critical error that failed the whole test:
$ hydra investigate show-monitor e59d59d8-3892-4fda-8e50-e651387aff96
$ hydra investigate show-logs e59d59d8-3892-4fda-8e50-e651387aff96
Logs:
Jenkins job URL
The text was updated successfully, but these errors were encountered: