cluster streaming: stream client should be resilient to source cluster topology changes #66722

pbardea · 2021-06-22T15:57:11Z

Today in cluster to cluster streaming, the destination cluster communicates with a single node on the source cluster. However, since this is a long-lived job that node may be removed from the cluster. To be resilient to topology changes in the source cluster, the stream client needs to be able to maintain a list of active nodes it can reach.

This will likely be exposed as a service that the stream client (running on the destination cluster) will either periodically poll, or register to receive notifications of node additions/removals from the cluster.

The set of nodes that the destination cluster thinks is active in the source cluster should be persisted in the job so that it can be resumed.

Epic CRDB-18753

Jira issue: CRDB-8201

amruss · 2021-10-04T13:42:16Z

As of now (sept 2021) this isn't high priority work for C2C streaming migrations, but something we want to come back to

blathers-crl · 2022-04-05T20:11:20Z

cc @cockroachdb/cdc

kenliu-crl · 2022-05-24T04:39:37Z

manually reviewed and brought up to date

blathers-crl · 2022-07-01T09:48:51Z

cc @cockroachdb/tenant-streaming

pbardea added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) A-disaster-recovery T-disaster-recovery labels Jun 22, 2021

pbardea added this to Triage in Disaster Recovery Backlog via automation Jun 22, 2021

pbardea moved this from Triage to Cluster Streaming in Disaster Recovery Backlog Jun 22, 2021

livlobo added this to Triage in [DEPRECATED] CDC via automation Sep 29, 2021

blathers-crl bot added the T-cdc label Sep 29, 2021

livlobo removed this from Cluster Streaming in Disaster Recovery Backlog Sep 29, 2021

amruss moved this from Triage to Cluster Streaming in [DEPRECATED] CDC Oct 4, 2021

exalate-issue-sync bot removed the T-cdc label Feb 1, 2022

exalate-issue-sync bot assigned gh-casper Apr 5, 2022

exalate-issue-sync bot added T-cdc and removed T-disaster-recovery labels Apr 5, 2022

stevendanna added the A-tenant-streaming Including cluster streaming label Jul 1, 2022

shermanCRL added this to the 22.2 milestone Jul 7, 2022

jlinder unassigned gh-casper Sep 9, 2022

miretskiy removed this from Cluster Streaming in [DEPRECATED] CDC Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cluster streaming: stream client should be resilient to source cluster topology changes #66722

cluster streaming: stream client should be resilient to source cluster topology changes #66722

pbardea commented Jun 22, 2021 •

edited by exalate-issue-sync bot

amruss commented Oct 4, 2021

blathers-crl bot commented Apr 5, 2022

kenliu-crl commented May 24, 2022

blathers-crl bot commented Jul 1, 2022

cluster streaming: stream client should be resilient to source cluster topology changes #66722

cluster streaming: stream client should be resilient to source cluster topology changes #66722

Comments

pbardea commented Jun 22, 2021 • edited by exalate-issue-sync bot

amruss commented Oct 4, 2021

blathers-crl bot commented Apr 5, 2022

kenliu-crl commented May 24, 2022

blathers-crl bot commented Jul 1, 2022

pbardea commented Jun 22, 2021 •

edited by exalate-issue-sync bot