Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cluster streaming: stream client should be resilient to source cluster topology changes #66722

Open
pbardea opened this issue Jun 22, 2021 · 4 comments
Labels
A-disaster-recovery A-tenant-streaming Including cluster streaming C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) T-cdc
Milestone

Comments

@pbardea
Copy link
Contributor

pbardea commented Jun 22, 2021

Today in cluster to cluster streaming, the destination cluster communicates with a single node on the source cluster. However, since this is a long-lived job that node may be removed from the cluster. To be resilient to topology changes in the source cluster, the stream client needs to be able to maintain a list of active nodes it can reach.

This will likely be exposed as a service that the stream client (running on the destination cluster) will either periodically poll, or register to receive notifications of node additions/removals from the cluster.

The set of nodes that the destination cluster thinks is active in the source cluster should be persisted in the job so that it can be resumed.

Epic CRDB-18753

Jira issue: CRDB-8201

@pbardea pbardea added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) A-disaster-recovery T-disaster-recovery labels Jun 22, 2021
@pbardea pbardea added this to Triage in Disaster Recovery Backlog via automation Jun 22, 2021
@pbardea pbardea moved this from Triage to Cluster Streaming in Disaster Recovery Backlog Jun 22, 2021
@livlobo livlobo added this to Triage in [DEPRECATED] CDC via automation Sep 29, 2021
@blathers-crl blathers-crl bot added the T-cdc label Sep 29, 2021
@livlobo livlobo removed this from Cluster Streaming in Disaster Recovery Backlog Sep 29, 2021
@amruss amruss moved this from Triage to Cluster Streaming in [DEPRECATED] CDC Oct 4, 2021
@amruss
Copy link
Contributor

amruss commented Oct 4, 2021

As of now (sept 2021) this isn't high priority work for C2C streaming migrations, but something we want to come back to

@blathers-crl
Copy link

blathers-crl bot commented Apr 5, 2022

cc @cockroachdb/cdc

@kenliu-crl
Copy link
Contributor

manually reviewed and brought up to date

@stevendanna stevendanna added the A-tenant-streaming Including cluster streaming label Jul 1, 2022
@blathers-crl
Copy link

blathers-crl bot commented Jul 1, 2022

cc @cockroachdb/tenant-streaming

@shermanCRL shermanCRL added this to the 22.2 milestone Jul 7, 2022
@miretskiy miretskiy removed this from Cluster Streaming in [DEPRECATED] CDC Jan 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-disaster-recovery A-tenant-streaming Including cluster streaming C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) T-cdc
Projects
None yet
Development

No branches or pull requests

6 participants