Add basic "batteries-included" `retry::Policy`s. #414

hdevalence · 2020-02-11T01:22:29Z

These are lifted out of the test and example code. Two policies are provided:

RetryLimit, with a bounded number of retry attempts;
RetryError, with unbounded retry attempts.

Both policies require Req: Clone as it's not possible to write a generic
retry policy without being able to clone requests.

The docs should be updated with a little blurb that explains when they should
not be used. Also, the Policy docs had an Attempts example corresponding
to RetryLimit (formerly Limit in the tests code); I removed it because
RetryLimit exists and can be view-sourced.

These are lifted out of the test and example code. Two policies are provided: * `RetryLimit`, with a bounded number of retry attempts; * `RetryError`, with unbounded retry attempts. Both policies require `Req: Clone` as it's not possible to write a generic retry policy without being able to clone requests. The docs should be updated with a little blurb that explains when they should not be used. Also, the `Policy` docs had an `Attempts` example corresponding to `RetryLimit` (formerly `Limit` in the tests code); I removed it because `RetryLimit` exists and can be view-sourced.

LucioFranco · 2020-02-11T01:43:15Z

tower-retry/src/policies.rs

+impl<Req: Clone, Res, E> Policy<Req, Res, E> for RetryLimit {
+    type Future = future::Ready<Self>;
+    fn retry(&self, _: &Req, result: Result<&Res, &E>) -> Option<Self::Future> {
+        if result.is_err() {


I wonder if there is a way to traitize this? For example, http retries you may want to retry 500s but not 400s. In this case, if I were to apply this to hyper I would only be able to retry system errors not http errors. What do you think about providing some method to allow users to specify what they want to retry?

One possibility would be to construct the RetryLimit with a closure F: FnMut(&E) -> bool; then callers who wanted to retry all errors could construct it as, e.g.,

let policy = RetryLimit::new(2, |_| true);

and callers who want to filter retries can stick whatever logic they want in the closure. (Does that seem like the right bound for the closure?)

Though since this only returns an error that wouldn't support retrying on http::Responses?

I think we could use some trait and a TraitFn adapter for a closure, what do you think?

Hmm, I'm not sure what kind of trait you're thinking of. I think I'm missing something: if the user-supplied logic has access to the whole response and error, how would that be different from just implementing Policy::retry directly?

This is closer to the related vector retrylogic thing I sent on discord, but the idea is that as a user I can just provide what i'd like to retry on instead of having to implement a retry policy myself.

I wouldn't copy this 1-1 but https://github.com/timberio/vector/blob/master/src/sinks/util/retries.rs#L25

I wonder if there is a way to traitize this? For example, http retries you may want to retry 500s but not 400s.

For example, finagle uses a ResponseClassifier in a similar way.

This should be removed when tower-rs/tower#414 lands but is good enough for our purposes for now.

LucioFranco

I would like to see a simple responseclassifer trait that could allow the user to pass in their own way to determine when to retry, otherwise this looks good to merge. Let me know if you have any questions.

hawkw · 2021-01-13T21:41:40Z

@hdevalence any interest in wrapping this up, or should we just close the PR?

hdevalence · 2021-02-02T23:39:45Z

I'd be happy to wrap this up, but I don't really know what the "good" design would be -- the current PR is just what worked for my needs. But, if there was clarity about what changes would be good, I'd be happy to make them, and I think it would be convenient if something like this was included in Tower.

hawkw · 2021-02-02T23:57:33Z

I'd be happy to wrap this up, but I don't really know what the "good" design would be -- the current PR is just what worked for my needs. But, if there was clarity about what changes would be good, I'd be happy to make them, and I think it would be convenient if something like this was included in Tower.

I wonder if @LucioFranco has any ideas about this.

LucioFranco · 2021-02-12T16:12:20Z

@rcoh maybe you can chime in here a bit?

rcoh · 2021-02-13T15:23:13Z

I don't necessarily want to impose our retry needs on others, but as a data point here are things we need:

Classifier interface that has access to the full Result<T, E> (and not just E)
The ability for the retry service to modify the responses, eg. set metadata about the number of retries required, set a custom error if you ran out of retries, etc. I sketched a possible design for this in Retry Policy POC #546

Our retry policy behavior (around number of retries, backoff length, etc.) is far too complex to be well served by something shared, unfortunately, so I don't think we have much need for batteries included policies.

hdevalence · 2021-02-13T19:43:14Z

Those things are all possible using the existing retry trait, right? And, if they're not, that's an issue with the trait itself, not with any default policies, correct?

My goal with this PR was just to include some basic implementations that are sufficient for simple retry logic — of course, if there are more complex use cases, nothing prevents a user from implementing the trait themselves, but I'm not sure that that should mean that it's a bad idea to include some simpler default behavior.

davidpdrsn

My 2 cents: I think this is worth merging (once the FIXMEs are fixed) as simple retry policies people can use if they want.

I also think some kind of response classifier is useful and we've actually been designing something over in tower-http https://github.com/tower-rs/tower-http/blob/classify-response/tower-http/src/classify.rs. The traits themselves aren't protocol specific so they could in theory live in tower. I think thats a separate discussion however.

davidpdrsn · 2021-05-04T20:10:13Z

@hdevalence sorry this has taken so long to land.

My opinion from 2 months still lands. If we fix the small docs things I would be okay with merging this. I think the discussion about whether or not to change Retry in a breaking way is a separate discussion.

Do you wanna drive this home or should I take over?

hdevalence · 2021-05-06T00:44:34Z

@davidpdrsn I think I've lost context on this one, so feel free to push it over the finish line.

hdevalence added 2 commits February 10, 2020 17:19

Make retry::Policy docs formatting consistent.

7ce4110

LucioFranco reviewed Feb 11, 2020

View reviewed changes

hdevalence added a commit to ZcashFoundation/zebra that referenced this pull request Feb 11, 2020

Add basic retry policies to zebra-network.

acccdda

This should be removed when tower-rs/tower#414 lands but is good enough for our purposes for now.

hdevalence mentioned this pull request Feb 11, 2020

Add basic retry policies to zebra-network. ZcashFoundation/zebra#246

Merged

dconnolly pushed a commit to ZcashFoundation/zebra that referenced this pull request Feb 11, 2020

Add basic retry policies to zebra-network.

abcc0a6

This should be removed when tower-rs/tower#414 lands but is good enough for our purposes for now.

LucioFranco requested changes Mar 23, 2020

View reviewed changes

jonhoo added A-retry Area: The tower "retry" middleware C-enhancement Category: A PR with an enhancement or a proposed on in an issue. S-waiting-on-author Status: awaiting some action (such as code changes) from the PR or issue author. labels Mar 31, 2020

davidpdrsn reviewed Feb 18, 2021

View reviewed changes

davidpdrsn mentioned this pull request May 4, 2021

improve flexiblity of retry policy #584

Merged

hdevalence closed this Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add basic "batteries-included" `retry::Policy`s. #414

Add basic "batteries-included" `retry::Policy`s. #414

hdevalence commented Feb 11, 2020

LucioFranco Feb 11, 2020

hdevalence Feb 11, 2020

LucioFranco Feb 11, 2020

hdevalence Feb 11, 2020

LucioFranco Feb 11, 2020

LucioFranco Feb 11, 2020

seanmonstar Feb 11, 2020

LucioFranco left a comment

hawkw commented Jan 13, 2021

hdevalence commented Feb 2, 2021

hawkw commented Feb 2, 2021

LucioFranco commented Feb 12, 2021

rcoh commented Feb 13, 2021

hdevalence commented Feb 13, 2021

davidpdrsn left a comment •

edited

Loading

davidpdrsn commented May 4, 2021

hdevalence commented May 6, 2021

Add basic "batteries-included" retry::Policys. #414

Add basic "batteries-included" retry::Policys. #414

Conversation

hdevalence commented Feb 11, 2020

LucioFranco Feb 11, 2020

Choose a reason for hiding this comment

hdevalence Feb 11, 2020

Choose a reason for hiding this comment

LucioFranco Feb 11, 2020

Choose a reason for hiding this comment

hdevalence Feb 11, 2020

Choose a reason for hiding this comment

LucioFranco Feb 11, 2020

Choose a reason for hiding this comment

LucioFranco Feb 11, 2020

Choose a reason for hiding this comment

seanmonstar Feb 11, 2020

Choose a reason for hiding this comment

LucioFranco left a comment

Choose a reason for hiding this comment

hawkw commented Jan 13, 2021

hdevalence commented Feb 2, 2021

hawkw commented Feb 2, 2021

LucioFranco commented Feb 12, 2021

rcoh commented Feb 13, 2021

hdevalence commented Feb 13, 2021

davidpdrsn left a comment • edited Loading

Choose a reason for hiding this comment

davidpdrsn commented May 4, 2021

hdevalence commented May 6, 2021

Add basic "batteries-included" `retry::Policy`s. #414

Add basic "batteries-included" `retry::Policy`s. #414

davidpdrsn left a comment •

edited

Loading