Flags for customizing retry behaviour #82

hansmi · 2020-06-14T18:31:59Z

I have a usecase where a retry every 5 seconds is excessive and would like to contribute customizations for the retries. The PushProx client currently hardcodes the timings for connection retries:

PushProx/cmd/client/main.go

Lines 211 to 217 in eeadbe7

    
           func newJitter() decorrelatedJitter { 
        
           	rand.Seed(time.Now().UnixNano()) 
        
           	return decorrelatedJitter{ 
        
           		min: 50 * time.Millisecond, 
        
           		cap: 5 * time.Second, 
        
           	} 
        
           }

The github.com/cenkalti/backoff package (documentation) provides an exponential backoff algorithm. I'd use that instead of the custom implementation and provide flags:

--retry.initial-wait=<duration>, default 50ms
--retry.multiplier=<float>, default 1.5
--retry.max-wait=<duration>, default 5s
--retry.max-elapsed=<duration>, default 0 (keep retrying forever, otherwise give up and terminate after given duration)

The default behaviour would be comparable to what's currently implemented while permitting customization.

@SuperQ @brian-brazil Would you be happy with such flags?

The text was updated successfully, but these errors were encountered:

SuperQ · 2020-06-14T19:05:19Z

Yes, this seems reasonable to me.

SuperQ · 2020-06-15T15:29:12Z

One thing about the backoff options. I'd like to keep the flags to a minimum to start, and set sane defaults for the rest. If you have suggestions for better defaults than the current ones, we can discuss changing them.

For now, I think the only real flag we would need is --retry.max-wait. The rest can stay hard coded.

I prefer to not over-flag things.

hansmi · 2020-06-15T15:54:20Z

I agree with you on not exposing too many flags. My primary concern are metered connections, i.e. mobile data, in case the server is not reachable. As such I'd strongly prefer to have at least the minimum and maximum wait duration configurable (--retry.initial-wait, --retry.max-wait). The multiplier can be hardcoded, or we could make it a hidden flag (Kingpin supports them). The max-elapsed is not necessary for my usecase and came from looking at the options the backoff package provides.

As for the flag prefix I'm not sure whether --retry. is good. It could be misinterpreted as "retries against the exporters" when it's actually retries for the proxy connection. I guess this can be addressed by mentioning it in the flag description.

SuperQ · 2020-06-15T21:09:39Z

initial-wait and max-wait sound fine to me. We can leave out max-elapsed and see if anyone really needs it.

For the naming convention, I would put this under the --proxy prefix. How about --proxy.retry.initial-wait. I'd also like to deprecate --proxy-url in favor of --proxy.url so that we have consistent naming.

hansmi · 2020-06-15T21:22:22Z

#83 implements the flags as discussed. Implementing unittests will take a few more changes which I'd like to do separately (experimental code exists already).

This is a quick fix until PR prometheus-community#82 [0] has landed upstream. [0]: prometheus-community#82

hansmi mentioned this issue Jun 15, 2020

Implement flags to control retry delays #83

Merged

ecksun pushed a commit to AiflooAB/PushProx that referenced this issue Sep 3, 2020

Increase jitter cap to 30 seconds

c5951a8

This is a quick fix until PR prometheus-community#82 [0] has landed upstream. [0]: prometheus-community#82

hansmi closed this as completed Mar 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flags for customizing retry behaviour #82

Flags for customizing retry behaviour #82

hansmi commented Jun 14, 2020

SuperQ commented Jun 14, 2020

SuperQ commented Jun 15, 2020

hansmi commented Jun 15, 2020

SuperQ commented Jun 15, 2020

hansmi commented Jun 15, 2020

Flags for customizing retry behaviour #82

Flags for customizing retry behaviour #82

Comments

hansmi commented Jun 14, 2020

SuperQ commented Jun 14, 2020

SuperQ commented Jun 15, 2020

hansmi commented Jun 15, 2020

SuperQ commented Jun 15, 2020

hansmi commented Jun 15, 2020