Avoid change feed rewinds on shard moves #3711

nickva · 2021-08-24T22:59:03Z

When shards are moved to new nodes, and the user supplies a change sequence from the old shard map configuration, attempt to match missing nodes and ranges by inspecting current shard uuids in order to avoid rewinds.

Previously, if a node and range was missing, we randomly picked a node in the appropriate range, so 1/3 of the time we might have hit the exact node, but 2/3 of the time we would end up with a complete changes feed rewind to 0.

Unfortunately, this involves a fabric worker scatter gather operation to all shard copies. This should only happen when we get an old sequence. We rely on that happening rarely, mostly right after the shards moved, then users would get new sequence from the recent shard map.

jaydoane

LGTM, with just some minor formatting nits, typos, and a few questions about possible name improvements.

src/fabric/src/fabric_view_changes.erl

src/fabric/test/eunit/fabric_moved_shards_seq_tests.erl

src/fabric/src/fabric_db_uuids.erl

src/fabric/test/eunit/fabric_moved_shards_seq_tests.erl

jaydoane · 2021-08-26T17:44:56Z

src/fabric/src/fabric_view_changes.erl

+            % Since we are doing a best-effort approach to match moved shards,
+            % tollerate and log errors. This should also handle cases when the
+            % cluster is partially upgraded, as some nodes will not have the
+            % newer get_uuid fabric_rpc handler.


this is a very nice feature!

src/fabric/src/fabric_view_changes.erl

When shards are moved to new nodes, and the user supplies a change sequence from the old shard map configuration, attempt to match missing nodes and ranges by inspecting current shard uuids in order to avoid rewinds. Previously, if a node and range was missing, we randomly picked a node in the appropriate range, so 1/3 of the time we might have hit the exact node, but 2/3 of the time we would end up with a complete changes feed rewind to 0. Unfortunately, this involves a fabric worker scatter gather operation to all shard copies. This should only happen when we get an old sequence. We rely on that happening rarely, mostly right after the shards moved, then users would get new sequence from the recent shard map.

nickva force-pushed the check-for-moved-shards branch 6 times, most recently from 6e42f23 to 21ca354 Compare August 25, 2021 23:28

nickva marked this pull request as ready for review August 25, 2021 23:29

nickva force-pushed the check-for-moved-shards branch 5 times, most recently from c617864 to b8e648d Compare August 26, 2021 16:22

jaydoane approved these changes Aug 26, 2021

View reviewed changes

nickva force-pushed the check-for-moved-shards branch 4 times, most recently from aa179e2 to 7945755 Compare August 26, 2021 20:47

nickva force-pushed the check-for-moved-shards branch from 7945755 to 6e33c46 Compare August 26, 2021 21:14

nickva merged commit e83935c into 3.x Aug 26, 2021

nickva deleted the check-for-moved-shards branch August 26, 2021 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid change feed rewinds on shard moves #3711

Avoid change feed rewinds on shard moves #3711

Uh oh!

nickva commented Aug 24, 2021 •

edited

Loading

Uh oh!

jaydoane left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaydoane Aug 26, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Avoid change feed rewinds on shard moves #3711

Avoid change feed rewinds on shard moves #3711

Uh oh!

Conversation

nickva commented Aug 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jaydoane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaydoane Aug 26, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nickva commented Aug 24, 2021 •

edited

Loading