Retrieving new jobs from the queue with a limit #141

rubik · 2019-08-04T19:21:46Z

By checking the source code it seems to me that a worker always takes as many jobs as possible from the queue, independently of how many free actors are there:

https://github.com/samuelcolvin/arq/blob/b2d397ae3b07340674f6f0edad144beaf4d4ef54/arq/worker.py#L260-L264

I think this has great potential for improvement. If the queue is large, a worker will keep all the job IDs in memory without being able to execute all of those in a meaningful time. On the contrary, if the semaphore indicates that it's full, why take jobs? (Or maybe we should take just a few.) I was also planning to run multiple instances of my worker, and in that case the parallelism would benefit from each worker taking a subset of jobs that they can manage.

With that in mind I propose that the above call to Redis is changed to:

job_ids = await self.pool.zrangebyscore(self.queue_name, limit=limit)

where limit is the number of free spots in the semaphore or some multiple of that (ideally configurable, I think the optimal value would mostly depend on the use case but it's probably slightly above 1 in most of them).

I also don't quite understand the usefulness of now. I think that in the great majority of the cases, all the enqueued jobs have a timestamp below the current timestamp. But even if they had a slightly higher timestamp, why would a free worker not run it?

The text was updated successfully, but these errors were encountered:

samuelcolvin · 2019-08-04T19:25:10Z

I don't think that's the case. The job IDs are retrieved, but they are not "taken" - another worker could still execute the job.

However I may be wrong. Please try this out and let me know if you do find a problem.

rubik · 2019-08-04T19:35:57Z

@samuelcolvin Yes, that's what I meant. The reason I was checking is that in my case the queue usually has tens to hundreds of thousands of jobs. It doesn't seem optimal at all for a worker to take all the IDs each time, if there's no way to run the corresponding jobs. I will try it out though, yes.

samuelcolvin · 2019-08-04T19:38:10Z

Makes sense. Limit would make sense.

To be honest I havent run arq v0.16 with that many pending jobs, so it's slightly unknown territory.

Very happy to try and help make it work though.

rubik · 2019-08-05T07:58:37Z

@samuelcolvin From my tests, after the queue gets big, the problem is not so much in the workers but in Redis. Since Redis is single-threaded the operations start to slow down because the job ID read takes a lot of time. I think adding a queue_read_limit_coef parameter is a good compromise. It does not even have to be the default. The code would then look like this:

import math

if self.queue_read_limit_coef is not None:
    free_actors = self.max_jobs - self._sem._value
    limit = math.ceil(free_jobs * self.queue_read_limit_coef)
    job_ids = await self.pool.zrangebyscore(self.queue_name, limit=limit)
else:
    job_ids = await self.pool.zrangebyscore(self.queue_name, max=now)

What do you think?

samuelcolvin · 2019-08-05T09:19:18Z

It should be possible to use max and limit together.

We definitely don't want to be processing any jobs not yet scheduled to be executed.

I also think better to use a simpler setting to understand, eg. queue_read_limit, and give it a reasonable default of say 1000.

rubik · 2019-08-05T10:22:16Z

@samuelcolvin Fair enough, I guess I can simply use my max_jobs as limit or something slightly higher.

samuelcolvin · 2019-08-05T10:41:17Z

yes, makes sense.

rubik · 2019-08-08T19:54:16Z

@samuelcolvin Are you planning to work on this soon or should I open a PR? I think I'll be able to do this tomorrow or during the weekend.

samuelcolvin · 2019-08-09T07:38:23Z

Would be great if you could submit a pr.

rubik mentioned this issue Aug 9, 2019

Feat worker limit #142

Merged

samuelcolvin closed this as completed in #142 Aug 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrieving new jobs from the queue with a limit #141

Retrieving new jobs from the queue with a limit #141

rubik commented Aug 4, 2019

samuelcolvin commented Aug 4, 2019

rubik commented Aug 4, 2019

samuelcolvin commented Aug 4, 2019

rubik commented Aug 5, 2019 •

edited

Loading

samuelcolvin commented Aug 5, 2019

rubik commented Aug 5, 2019

samuelcolvin commented Aug 5, 2019

rubik commented Aug 8, 2019

samuelcolvin commented Aug 9, 2019

Retrieving new jobs from the queue with a limit #141

Retrieving new jobs from the queue with a limit #141

Comments

rubik commented Aug 4, 2019

samuelcolvin commented Aug 4, 2019

rubik commented Aug 4, 2019

samuelcolvin commented Aug 4, 2019

rubik commented Aug 5, 2019 • edited Loading

samuelcolvin commented Aug 5, 2019

rubik commented Aug 5, 2019

samuelcolvin commented Aug 5, 2019

rubik commented Aug 8, 2019

samuelcolvin commented Aug 9, 2019

rubik commented Aug 5, 2019 •

edited

Loading