-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EmergencyReparentShard still trying to get replication status from broken primary VTTablet #7559
Comments
Adding my work notes , getting same issue "failed to get replication status from" failed master. Control Panel shows two masters after the error. vttablet.out:
|
One more note from vttablet.out
|
@vkozjak which version of vitess did you run this with? |
Same issue here, and I'm using vitess-v11.0.0-aa798b8.tar.gz. |
Since, |
Closing the issue for now. Please reopen if further information or discussion is required |
Overview of the Issue
During network failure/corruption on VTTablet pods, it is possible to enter an unrecoverable state for a keyspace. This happens when primary VTTablet has lost its
mysql
container andEmergencyReparent
command still attempts to get replication status from a failed primary.Potentially related: #7523
Reproduction Steps
Steps to reproduce this issue, example:
EmergencyReparentShard
is failing, and trying toignore_replicas
on the broken primary still fails:Binary version
Example: Built from v9.0 release SHA
Version: daa608598 (Git branch 'HEAD') built on Wed Jan 27 22:05:48 UTC 2021 by vitess@d00a879dec03 using go1.15.6 linux/amd64
Operating system and Environment details
OS, Architecture, and any other information you can provide
about the environment.
cat /etc/os-release
):uname -sr
):uname -m
):Log Fragments
(see above)
The text was updated successfully, but these errors were encountered: