Handle stale jobs more carefully before purging them. #4615

jezdez · 2020-02-06T10:56:11Z

What type of PR is this? (check all applicable)

Description

In case of a dead worker, the ended_at value could be None, preventing
the purge task from running successfully and leading to the purge never
running successfully.

Related Tickets & Documents

Mobile & Desktop Screenshots/Recordings (if there are UI changes)

rauchy · 2020-02-06T12:13:42Z

redash/tasks/general.py

+            stale_jobs = []
+            for failed_job in failed_jobs:
+                # the job may not actually exist anymore in Redis
+                if not failed_job:


How about we just compact these?

rauchy · 2020-02-06T12:15:03Z

redash/tasks/general.py

+                # the job could have an empty ended_at value in case
+                # of a worker dying before it can save the ended_at value,
+                # in which case we also consider them stale
+                if not failed_job.ended_at:


This feels more like an or conditional and less like a multi-branch statement to me.

rauchy · 2020-02-06T12:17:41Z

redash/tasks/general.py

-            for job in stale_jobs:
-                job.delete()
+            stale_jobs = []
+            for failed_job in failed_jobs:


If you share my thoughts on the other couple of comments, this whole block might be better represented by a filter on failed_jobs

I'm not sure I follow, what do you mean with "a filter on failed_jobs"?

I just mean that it feels like stale_jobs is just a sub-list of failed_jobs that satisfies a predicate. Something like:

is_stale = lambda job: job.ended_at is None or (datetime.utcnow() - job.ended_at).seconds > settings.JOB_DEFAULT_FAILURE_TTL stale_jobs = filter(is_stale, compact(failed_jobs))

While you may be right that this is another way to write it, I don't see this as more readable. But it's up to you, feel free to change the patch the way you like it better.

I'm fine with keeping @jezdez's implementation as is, as it leaves room for explaining the different steps.

👍

Don't feel strongly either way, but generally I'm more in the camp of having descriptive variable / function / lambda names instead of comments (i.e. is_stale = worker_died or too_old). They just expire slower than comments.

Handle stale jobs more carefully before purging them.

a89a1c2

jezdez requested review from arikfr and rauchy February 6, 2020 10:56

rauchy reviewed Feb 6, 2020

View reviewed changes

weekly-digest bot mentioned this pull request Feb 10, 2020

Weekly Digest (3 February, 2020 - 10 February, 2020) #4633

Closed

arikfr merged commit 9646156 into master Feb 11, 2020

arikfr deleted the purge-defensively branch February 11, 2020 09:14

weekly-digest bot mentioned this pull request Feb 17, 2020

Weekly Digest (10 February, 2020 - 17 February, 2020) #4652

Closed

chr0m1ng mentioned this pull request Feb 1, 2024

[Snyk] Upgrade axios from 0.19.2 to 0.27.2 routablehq/redash#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle stale jobs more carefully before purging them. #4615

Handle stale jobs more carefully before purging them. #4615

jezdez commented Feb 6, 2020

rauchy Feb 6, 2020

rauchy Feb 6, 2020

rauchy Feb 6, 2020

jezdez Feb 6, 2020

rauchy Feb 8, 2020

jezdez Feb 10, 2020

arikfr Feb 10, 2020

rauchy Feb 11, 2020

Handle stale jobs more carefully before purging them. #4615

Handle stale jobs more carefully before purging them. #4615

Conversation

jezdez commented Feb 6, 2020

What type of PR is this? (check all applicable)

Description

Related Tickets & Documents

Mobile & Desktop Screenshots/Recordings (if there are UI changes)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment