Increase resilience of worker by breaking out the "fetch job" vs "do job" parts #19

cooperaj · 2018-10-18T15:47:05Z

In a situation where the worker fails to be able to communicate with it's job provider i.e. Redis it will fail with an exception. The only way to get it to resolve a new Redis server (assuming some sort of HA setup) is to restart the worker - this is because PHP, rather helpfully, caches name lookups.

By breaking out fetching of jobs from doing of jobs you're able to catch that possible case and cause the worker to exit (and be restarted by whatever scheduling tool you're using). In our situation this results in a new resolution to the revived/hot spare Redis instance and a working queue.

coveralls · 2018-10-19T14:07:23Z

Coverage increased (+11.3%) to 100.0% when pulling cc6db26 on UniversityOfNottingham:feature/0.2-make-worker-resiliant into ed2cbf9 on maxbrokman:0.2.

…he loop test.

cooperaj added 4 commits October 18, 2018 16:38

Add new exception and move existing into exceptions folder.

29cd433

Move to PHP7.1 minimum version and implement changes to worker.

218c9de

Move to 7.1 for the unit tests since this uses 7.1+ syntax

86ec258

Fix unit tests to work with new exception throwing.

3961dca

cooperaj added 2 commits October 19, 2018 15:17

Add new test to check the new functionality around queue read failures.

60b7686

Remove redundant code and add in a null job to increase coverage of t…

cc6db26

…he loop test.

dpgover mentioned this pull request Mar 27, 2020

Feature/0.2 make worker resiliant digbang/safe-queue#3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase resilience of worker by breaking out the "fetch job" vs "do job" parts #19

Increase resilience of worker by breaking out the "fetch job" vs "do job" parts #19

cooperaj commented Oct 18, 2018

coveralls commented Oct 19, 2018 •

edited

Loading

Increase resilience of worker by breaking out the "fetch job" vs "do job" parts #19

Are you sure you want to change the base?

Increase resilience of worker by breaking out the "fetch job" vs "do job" parts #19

Conversation

cooperaj commented Oct 18, 2018

coveralls commented Oct 19, 2018 • edited Loading

coveralls commented Oct 19, 2018 •

edited

Loading