Skip to content

Issue #1405: ReplicationWorker should back-off retrying.#1406

Closed
reddycharan wants to merge 3 commits intoapache:masterfrom
reddycharan:fixreplicator
Closed

Issue #1405: ReplicationWorker should back-off retrying.#1406
reddycharan wants to merge 3 commits intoapache:masterfrom
reddycharan:fixreplicator

Conversation

@reddycharan
Copy link
Copy Markdown
Contributor

Descriptions of the changes in this PR:

ReplicationWorker should backoff replication
after threshold number of replication failures of a ledger.

Currently ReplicationWorker will do busy retrials if
replication is failed for a ledger, instead it should
backoff if replication had failed for threshold
number of times. This can be done by deferring
releasing of underreplicated lock by
'lockReleaseOfFailedLedgerGracePeriod' amount
of time.

Master Issue: #1405

ReplicationWorker should backoff replication
after threshold number of replication failures of a ledger.

Currently ReplicationWorker will do busy retrials if
replication is failed for a ledger, instead it should
backoff if replication had failed for threshold
number of times. This can be done by deferring
releasing of underreplicated lock by
'lockReleaseOfFailedLedgerGracePeriod' amount
of time.
Copy link
Copy Markdown
Contributor

@eolivelli eolivelli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about adding some counter? Not blocker for me

Copy link
Copy Markdown
Member

@jiazhai jiazhai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@reddycharan
Copy link
Copy Markdown
Contributor Author

@eolivelli added relevant log and counter

Copy link
Copy Markdown
Contributor

@eolivelli eolivelli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 looks good.
Thanks

@sijie sijie added this to the 4.8.0 milestone May 17, 2018
@sijie sijie closed this in 8c077a9 May 17, 2018
reddycharan added a commit to reddycharan/bookkeeper that referenced this pull request May 21, 2018
Descriptions of the changes in this PR:

ReplicationWorker should backoff replication
after threshold number of replication failures of a ledger.

Currently ReplicationWorker will do busy retrials if
replication is failed for a ledger, instead it should
backoff if replication had failed for threshold
number of times. This can be done by deferring
releasing of underreplicated lock by
'lockReleaseOfFailedLedgerGracePeriod' amount
of time.

Master Issue: apache#1405

Author: cguttapalem <cguttapalem@salesforce.com>

Reviewers: Enrico Olivelli <eolivelli@gmail.com>, Jia Zhai <None>, Sijie Guo <sijie@apache.org>

This closes apache#1406 from reddycharan/fixreplicator, closes apache#1405
reddycharan added a commit to reddycharan/bookkeeper that referenced this pull request Jul 6, 2018
…f retrying.

Descriptions of the changes in this PR:

ReplicationWorker should backoff replication
after threshold number of replication failures of a ledger.

Currently ReplicationWorker will do busy retrials if
replication is failed for a ledger, instead it should
backoff if replication had failed for threshold
number of times. This can be done by deferring
releasing of underreplicated lock by
'lockReleaseOfFailedLedgerGracePeriod' amount
of time.

Master Issue: apache#1405

Author: cguttapalem <cguttapalem@salesforce.com>

Reviewers: Enrico Olivelli <eolivelli@gmail.com>, Jia Zhai <None>, Sijie Guo <sijie@apache.org>

This closes apache#1406 from reddycharan/fixreplicator, closes apache#1405
reddycharan added a commit to reddycharan/bookkeeper that referenced this pull request Jul 9, 2018
…f retrying.

Descriptions of the changes in this PR:

ReplicationWorker should backoff replication
after threshold number of replication failures of a ledger.

Currently ReplicationWorker will do busy retrials if
replication is failed for a ledger, instead it should
backoff if replication had failed for threshold
number of times. This can be done by deferring
releasing of underreplicated lock by
'lockReleaseOfFailedLedgerGracePeriod' amount
of time.

Master Issue: apache#1405

Author: cguttapalem <cguttapalem@salesforce.com>

Reviewers: Enrico Olivelli <eolivelli@gmail.com>, Jia Zhai <None>, Sijie Guo <sijie@apache.org>

This closes apache#1406 from reddycharan/fixreplicator, closes apache#1405
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants