Use an exponential backoff on SU failed data item uploads #890

arielmelendez · 2024-07-17T22:07:23Z

The code in question:
https://github.com/permaweb/ao/blob/main/servers/su/src/domain/clients/uploader.rs#L75-L90

Retrying a failing request 100 times in 1 second intervals can be a harmful pattern for connecting with external services. At the scale that AO SUs operate, this behavior can cause substantial heartburn for downstream services. It would be great if an exponential backoff pattern could be used here. Thanks!

ppedziwiatr · 2024-07-22T11:10:39Z

I would also consider adding some jitter here :) Also I believe the ultimate solution is to move the dataItems delivery to a separate/background job (#877)

VinceJuliano · 2024-07-22T18:14:27Z

Yes, we should have exponential backoff, but then also a robust way to ensure the item makes it through the upload eventually if it fails.

VinceJuliano · 2024-07-22T18:36:46Z

In uploader.rs, add exponential backoff to the loop that retries the upload, if it reaches some specified number of retries, say 10, add it to a persistent queue of items that need to be tried again later. And have a background process pulling from this queue and uploading.

uploader.rs will need access to store.rs, so store.rs will become a dependency of uploader.rs, new methods will need to be added to store.rs to persist the list of items that need to be retried. Perhaps they can be stored in rocksdb with a different key

TillaTheHun0 added enhancement New feature or request su ao Scheduler Unit labels Jul 18, 2024

Jeremiahstockdale assigned Jeremiahstockdale and jfrain99 and unassigned Jeremiahstockdale and jfrain99 Jul 23, 2024

VinceJuliano closed this as completed Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use an exponential backoff on SU failed data item uploads #890

Use an exponential backoff on SU failed data item uploads #890

arielmelendez commented Jul 17, 2024 •

edited

Loading

ppedziwiatr commented Jul 22, 2024

VinceJuliano commented Jul 22, 2024

VinceJuliano commented Jul 22, 2024 •

edited

Loading

Use an exponential backoff on SU failed data item uploads #890

Use an exponential backoff on SU failed data item uploads #890

Comments

arielmelendez commented Jul 17, 2024 • edited Loading

ppedziwiatr commented Jul 22, 2024

VinceJuliano commented Jul 22, 2024

VinceJuliano commented Jul 22, 2024 • edited Loading

arielmelendez commented Jul 17, 2024 •

edited

Loading

VinceJuliano commented Jul 22, 2024 •

edited

Loading