-
Notifications
You must be signed in to change notification settings - Fork 317
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: inconsistent and leaky retry delay logic in router #3002
Conversation
47ee86f
to
98ebf81
Compare
Codecov ReportBase: 53.01% // Head: 53.00% // Decreases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## master #3002 +/- ##
==========================================
- Coverage 53.01% 53.00% -0.01%
==========================================
Files 334 334
Lines 51941 51885 -56
==========================================
- Hits 27534 27504 -30
+ Misses 22796 22779 -17
+ Partials 1611 1602 -9
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
5b1656c
to
0af362b
Compare
0af362b
to
d373f87
Compare
another shameless lo reference could be useful here:
|
d373f87
to
f08985f
Compare
f08985f
to
7544df9
Compare
7544df9
to
8457cc5
Compare
Description
Addressing router job retry backoff inconsistencies which may also cause memory leak:
postStatusOnResponseQ
.To fix these issues we are no longer using an in-memory map per worker, but the job status'
retry_time
column is used for storing the calculated backoff time. JobsDB queries for retrieving jobs don't use any conditions against theretry_time
column, otherwise in case of a server restart this would cause jobs to be picked out-of-order as router's event-ordering algorithm doesn't persist its state, it only retains it in-memory.Notion Ticket
Link
Security