Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DocDB] During rolling upgrade with workload is running in parallel one of the node becomes unreachable. #22217

Open
1 task done
shantanugupta-yb opened this issue May 1, 2024 · 0 comments
Assignees
Labels
area/docdb YugabyteDB core features kind/bug This issue is a bug priority/high High Priority

Comments

@shantanugupta-yb
Copy link

shantanugupta-yb commented May 1, 2024

Jira Link: DB-11135

Description

During rolling upgrade with workload is running in parallel one of the node becomes unreachable.
Upgrade 2.18.7.0-b38 >> 2.21.0.0-b509
Upgrade 2.18.7.0-b38 >> 2.21.0.0-b504

On both the cluster, the N1 node is unreachable(Blue line in the attached snapshot). From the snapshot it can be seen that the rolling upgrade follows the sequence of N1>>N2>>N3. During rolling upgrade the connections of the node undergoing upgrade is distributed amongst the remaining nodes but when the final node N3 undergoes the rolling upgrade process it is seen that the distribution of connections is not equal amongst N1(229) and N2(75). Most likely due to 229 connections on N1 the node seems to be becoming unresponsive/unreachable.

Also observed that the unreachable AWS VM had CPU utilisation was pegged at 99+%.

image image

Issue Type

kind/bug

Warning: Please confirm that this issue does not contain any sensitive information

  • I confirm this issue does not contain any sensitive information.
@shantanugupta-yb shantanugupta-yb added area/docdb YugabyteDB core features priority/high High Priority status/awaiting-triage Issue awaiting triage labels May 1, 2024
@yugabyte-ci yugabyte-ci added the kind/bug This issue is a bug label May 1, 2024
@rthallamko3 rthallamko3 removed the status/awaiting-triage Issue awaiting triage label May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/docdb YugabyteDB core features kind/bug This issue is a bug priority/high High Priority
Projects
None yet
Development

No branches or pull requests

4 participants