Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[xCluster]: Change xCluster Metrics Collection Interval from 1s to 15s #9417

Closed
rahuldesirazu opened this issue Jul 21, 2021 · 2 comments
Closed
Assignees
Labels
area/cdc Change Data Capture

Comments

@rahuldesirazu
Copy link
Contributor

We currently set --update_metrics_interval_ms = 1000, which means we collect metrics every 1s. This operation opens the cdc_state table and loops through all tablets under-replication, which can be an expensive operation. Since we don't need such fine-grained metrics, we can change this to 15000 to collect every 15s.

@rahuldesirazu rahuldesirazu added the area/cdc Change Data Capture label Jul 21, 2021
@rahuldesirazu rahuldesirazu self-assigned this Jul 21, 2021
@rahuldesirazu rahuldesirazu added this to To do in xCluster replication via automation Jul 21, 2021
@sanketkedia sanketkedia self-assigned this Jul 28, 2021
@sanketkedia
Copy link
Contributor

Taking this up.

@bmatican
Copy link
Contributor

bmatican commented Aug 9, 2021

talked to @sanketkedia offline, since he's short term focusing on PITR for the next 2.6, we can take this on in the short term xcluster work as well

rahuldesirazu added a commit that referenced this issue Aug 20, 2021
… and bump update_metrics_interval_ms to 15000

Summary:
#9662:
Currently a user has to set enable_log_retention_by_op_idx to true for xCluster setups. We can default this flag to true, since we only change WAL retention policy for entries in the cdc_state table. So for setups with no xCluster enabled, changing this flag will have no effect.

#9417:
We currently set --update_metrics_interval_ms = 1000, which means we collect metrics every 1s. This operation opens the cdc_state table and loops through all tablets under-replication, which can be an expensive operation. Since we don't need such fine-grained metrics, we can change this to 15000 to collect every 15s.

Test Plan: Jenkins

Reviewers: nicolas, jhe

Reviewed By: jhe

Subscribers: ybase, bogdan

Differential Revision: https://phabricator.dev.yugabyte.com/D12638
rahuldesirazu added a commit that referenced this issue Aug 31, 2021
… true by default and bump update_metrics_interval_ms to 15000

Summary:
#9662:
Currently a user has to set enable_log_retention_by_op_idx to true for xCluster setups. We can default this flag to true, since we only change WAL retention policy for entries in the cdc_state table. So for setups with no xCluster enabled, changing this flag will have no effect.

#9417:
We currently set --update_metrics_interval_ms = 1000, which means we collect metrics every 1s. This operation opens the cdc_state table and loops through all tablets under-replication, which can be an expensive operation. Since we don't need such fine-grained metrics, we can change this to 15000 to collect every 15s.

Original Commit : D12638 /2933bf506fbf7b6ac8fe6d6715fa4c03b8dcd550

Test Plan: Jenkins: rebase: 2.6

Reviewers: nicolas, jhe

Reviewed By: jhe

Subscribers: ybase, bogdan

Differential Revision: https://phabricator.dev.yugabyte.com/D12804
xCluster replication automation moved this from To do to Done Sep 9, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cdc Change Data Capture
Development

No branches or pull requests

4 participants