Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix for connector losing tablet list upon restart and RPC failures. #270

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

vaibhav-yb
Copy link
Collaborator

Problem

  1. If the connector restarts just one task, and if any tablet has been split, then chances are that we lose track of the tablet list after restart and we might start getting RuntimeException with a log message of the workaround saying that restarting the connector would solve it.
  2. Intermittently, we get a tablet=null error while calling RPCs.

Solution

  1. Now if we see that any tablet is missing from the list, we simply call the API GetTabletListToPollForCDC again to get its children and we add them to the list. By moving this way, we recreate the proper tablet list to be polled for.
  2. If we get a tablet=null error, we recreate YBClient and move ahead.

@vaibhav-yb vaibhav-yb self-assigned this Sep 25, 2023
@Sumukh-Phalgaonkar Sumukh-Phalgaonkar self-assigned this Oct 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants