Refactor ndc history resender to handle multiple remote clusters #2866

yycptt · 2022-05-18T23:45:21Z

What changed?

Refactor ndc history resender to handle multiple remote clusters

Why?

Existing history resend is bind to particular remote cluster and can't be used for resending history from other remote clusters. This means we need to create one resender for each remote cluster and also worry about cluster metadata update.
Needed by Guarantee history task execution #2864, so that we don't need to refactor existing standby task executor implementation (or creating multiple standby executors and add callback for cluster metadata update) but still be able to handle standby tasks which need to resend history from different remote clusters.

How did you test it?

Existing tests

Potential risks

Is hotfix candidate?

wxing1292 · 2022-05-19T01:46:12Z

service/history/replication/task_executor.go

@@ -66,6 +67,7 @@ type (
 // NewTaskExecutor creates a replication task executor
 // The executor uses by 1) DLQ replication task handler 2) history replication task processor
 func NewTaskExecutor(
+	remoteCluster string,


should the remoteCluster be the active cluster from namespace cache?

If the question is why we still have one replication task executor for each remote cluster, I think we can refactor that too and have only one executor? cc @yux0

If it's about why not letting xdc resender itself figure out the remote cluster name from ns cache, then I thought about it as well. The main concern is that if the caller can't control which remote cluster the resender is talking to, there might be a mismatch between caller and resender. E.g. when caller is replication task executor for cluster A, the resend may resend from cluster B. Another example is in standby timer/transfer task executor, we may connect to cluster A to refresh task, but may resend history from cluster B. It's rare but will make behavior reasoning much harder.

If the question is why we still have one replication task executor for each remote cluster, I think we can refactor that too and have only one executor?

Yes. We can do this

…poralio#2866)

Refactor ndc history resender to handle multiple remote clusters

4657c2b

yycptt requested a review from yux0 May 18, 2022 23:45

yycptt requested a review from a team as a code owner May 18, 2022 23:45

wxing1292 reviewed May 19, 2022

View reviewed changes

wxing1292 approved these changes May 19, 2022

View reviewed changes

yux0 approved these changes May 19, 2022

View reviewed changes

Merge branch 'master' into refactor-history-resender

57f08cc

yux0 approved these changes May 19, 2022

View reviewed changes

yycptt enabled auto-merge (squash) May 19, 2022 21:20

yycptt disabled auto-merge May 19, 2022 21:20

fix tests

0727c6a

yycptt merged commit d36291f into temporalio:master May 19, 2022

yycptt deleted the refactor-history-resender branch May 19, 2022 23:36

Sushisource pushed a commit to Sushisource/temporal that referenced this pull request Jun 7, 2022

Refactor ndc history resender to handle multiple remote clusters (tem…

5a5e1bc

…poralio#2866)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor ndc history resender to handle multiple remote clusters #2866

Refactor ndc history resender to handle multiple remote clusters #2866

yycptt commented May 18, 2022 •

edited

wxing1292 May 19, 2022

yycptt May 19, 2022

yux0 May 19, 2022

Refactor ndc history resender to handle multiple remote clusters #2866

Refactor ndc history resender to handle multiple remote clusters #2866

Conversation

yycptt commented May 18, 2022 • edited

wxing1292 May 19, 2022

Choose a reason for hiding this comment

yycptt May 19, 2022

Choose a reason for hiding this comment

yux0 May 19, 2022

Choose a reason for hiding this comment

yycptt commented May 18, 2022 •

edited