Interval of update loop varies and can slow down on slow backend #53

ties · 2021-10-31T09:52:53Z

As stayrtr operator I want stayrtr to keep fetching updates if the backend system is slow or not responsive.

If I want updates every 10 minutes, and a update takes 5 minutes, I want the next update to run 10 minutes after the previous one started. Not 15 minutes after (10 minutes after the previous finished).

Context

When running stayrtr from a slow connection (4G was not cooperating) I noticed that the update loop does not have a set interval but has a set delay. If the response of SLURM or the JSON are slow the loop takes (much) longer.

Root cause

Handling slow responses is a hard problem. It ends up being a tradeoff between liveliness of the whole system or getting all information.

For example, in my rpki-client wrapped I found that some repositories were so slow that they prevented me from updating on time. I decided to add a utility to timeout/abort fetching from slow repos. There I decided finishing an update was more important than having all information.

Desired behaviour

first of all:

exponential backoff on errors
have basic metrics for http behaviour. We have part of this, but last succesful response for url/response size/duration/status code should be tracked. And some metrics can be moved: RefreshStatusCode etc could be tracked from the http util.
make both updates (slurm + vrp-json) asynchronous, they can be performed in parallel.

then:

abort connection if retrieving the response takes longer than [timelimit] to send the response
schedule updates at set interval: "a update happens every interval". Not "interval after the previous update finishes"

The text was updated successfully, but these errors were encountered:

ties · 2021-10-31T09:57:21Z

It could also be that performing the update interval after the previous one finished is the desired behaviour. In that case this one can be closed (and I'll make a separate issue for the http metrics part).

randomthingsandstuff · 2022-01-31T19:10:10Z

I agree with your view on the matter and noticed this working on VRP expiry stuff.

That whole refresh/VRP expiry piece (in #15) needs to be broken out to accomplish this and test it properly. So I should be able to address this as part of that work.

When I push them, we should discuss the default timer values.

Previously if you had a very slow backend, the refresh timer for a reload would only start after the current refresh has finished. Now the timer will run after the timer fires for the last one. This helps avoid the client being torpedod by very slow backends Tag: #53

Tag: #53

benjojo · 2023-01-26T00:00:11Z

I split two of the subpoints into their own tickets, since they are worth their own investigations for now.

But the update loop now happens consistently, even if the backend is slow.

And VRP+SLURM updates are done in parallel

randomthingsandstuff added the release-blocker label Jan 31, 2022

randomthingsandstuff self-assigned this Jan 31, 2022

This was referenced Jan 25, 2023

exponential backoff on fetch errors #77

Closed

Expose better file fetch metrics #78

Open

benjojo added a commit that referenced this issue Jan 25, 2023

Make slurm and vrp-json updates happen in parallel

ba724ad

Tag: #53

benjojo closed this as completed Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interval of update loop varies and can slow down on slow backend #53

Interval of update loop varies and can slow down on slow backend #53

ties commented Oct 31, 2021 •

edited

Loading

ties commented Oct 31, 2021

randomthingsandstuff commented Jan 31, 2022 •

edited

Loading

benjojo commented Jan 26, 2023

Interval of update loop varies and can slow down on slow backend #53

Interval of update loop varies and can slow down on slow backend #53

Comments

ties commented Oct 31, 2021 • edited Loading

Context

Root cause

Desired behaviour

ties commented Oct 31, 2021

randomthingsandstuff commented Jan 31, 2022 • edited Loading

benjojo commented Jan 26, 2023

ties commented Oct 31, 2021 •

edited

Loading

randomthingsandstuff commented Jan 31, 2022 •

edited

Loading