Allow custom job ids to be specified in the job options #335

campriceaustin · 2016-08-19T01:22:18Z

No description provided.

xdc0 · 2016-08-19T01:25:32Z

test/test_job.js

@@ -56,6 +56,13 @@ describe('Job', function(){
        expect(storedJob.opts.testOpt).to.be('enabled');
      });
    });
+
+    it('should use the custom jobId if one is provided', function() {


Test case suggestion:

Create a job with a customId and test if it can be processed, in the process callback, verify that the custom jobId matches the one provided

bradvogel · 2016-08-19T04:36:17Z

Add a bit to the readme too

manast · 2016-08-19T07:18:16Z

I have mixed feelings about this functionality. From one side it seems like something that should be nice to give to certain pro-users, but on the other hand, it may also create a new flora of issues if the user is not able to generate unique ids for every job. I haven't understood yet which consequences it would have to add several jobs using the same jobId... Isn't there a different way to achieve what you want without custom ids?

campriceaustin · 2016-08-19T07:32:49Z

@manast The general use case is having a concurrency-safe way to add a job if it doesn't already exist, based on some unique value (in our case the user id).

In our case, we're dealing with hundreds of thousands of jobs per hour, and the load on Redis has been pretty intense. We needed a way to avoid adding a job to the queue if one already exists for the user.

I've deployed our branch running this code to production and the results have been great - we've gone from ~90% CPU utilisation to < 10%.

In terms of other possible ways to achieve the same thing: we could do a transaction script which loops through jobs looking for a match, and inserts it it doesn't find one, but that's slow.

We could also keep a secondary key as a kind of lock to indicate whether a job with that unique value already exists, but that introduces other issues: what happens if some failure occurs and the lock is left orphaned? We'd need some kind of TTL, and it all gets rather complicated.

Being able to override the key seemed the best solution to me. It lets us determine whether a job with that unique value already exists in constant-time, and doesn't have any of the issues around locking.

manast · 2016-08-19T07:58:32Z

I am not sure I understand your use case completely, but what about this: when you call the addJob method you get the unique ID for that given job. If you stored that ID in a redis database using the userId as key, wouldn't you then be able to test in O(1) if the users' job exists already? You will of course need some code to update the database when jobs are completed/failed, removed and so on.

campriceaustin · 2016-08-19T08:13:45Z

@manast Unfortunately, that wouldn't be concurrency safe.

We'd need to:

Lookup to see if the user's job exists already
If not, add it. If so, ignore it.

Another thread could create or remove the job in between 1 and 2.

We'd have the same problem when removing the lock at the end of the job too, plus the possibility that the job processes but some failure interferes with removing the lock, hence preventing all future jobs from processing for that user, requiring us to build a more complex system of TTLs etc.

manast · 2016-08-19T09:38:49Z

ok. But how do you avoid points 1 and 2 using custom ids in bull?

campriceaustin · 2016-08-19T10:44:54Z

@manast We wrap 1 and 2 in a transaction. Because we have a known key, we can perform step 1 using an EXISTS command (L#84 in scripts.js), then conditionally insert depending on the result.

(Note: I've just realised we'll need to execute these within a multi call to guarantee atomicity - I'll commit that soon).

manast · 2016-08-19T12:55:18Z

why do you need multi? If the jobId exists already it just returns false or -1 for example, and since it is run in a script it will be atomic.

campriceaustin · 2016-08-20T06:48:14Z

@manast Oh, are scripts atomic by default? I thought they needed to be wrapped in a multi call. If not, then great! No need to make that change then.

manast · 2016-08-20T21:06:14Z

lib/scripts.js

    })

    var script = [
-      'local jobId = redis.call("INCR", KEYS[5])',
+      'local jobId = ARGV[2] == "" and redis.call("INCR", KEYS[5]) or ARGV[2]',
      'redis.call("HMSET", ARGV[1] .. jobId' + argvs.join('') + ')',


Why not use if/else here for readability ?

campriceaustin · 2016-08-22T00:23:28Z

@manast Cool, good suggestions. I have implemented them.

TomKaltz · 2016-09-18T04:27:15Z

@pricj004 @manast I will be using this feature to make sure duplicate jobs don't get added to my queue. When submitting a new job with the same ID as a previously FAILED job, the new job is not added to the queue. Is this desired behavior for most? I would really like it to retry/rerun the failed job if I submit it again. Can someone explain why the current behavior was chosen?

manast · 2016-09-19T10:04:22Z

@TomKaltz couldn't you just use the retry functionality? or listen to the fail event and delete the job in that case?

TomKaltz · 2016-09-20T01:08:27Z

@manast your suggestion would work I guess. I just think it would be a good feature for duplicate submitted jobId to automatically retry if it had previously failed. Currently job is ignored if ID exists and has previously failed. I see more use-case for my feature request than current functionality of ignoring the job in that state. Is there any reason why bull can't adopt this functionality?

manast · 2016-09-20T07:48:21Z

@TomKaltz mostly that it requires some extra work and I guess nobody has time to do it right now...

Allow custom job ids to be specified in the job options

0ef22f8

xdc0 reviewed Aug 19, 2016
View reviewed changes

Treat jobIds as strings

7c5940c

campriceaustin mentioned this pull request Aug 19, 2016

Custom Job_id #132

Closed

Cameron Price-Austin added 2 commits August 19, 2016 12:25

Delayed jobs should still use a numeric id to offset the timestamp

2e9940b

Don't overwrite if the key already exists

02f4009

Updating readme

e744956

manast reviewed Aug 20, 2016
View reviewed changes

Cameron Price-Austin added 5 commits August 22, 2016 09:40

Use an 'if/else' for readability in the 'addJob' script

1b765e5

Rename 'numericJobId' to 'jobCounter'

77f6b77

Renaming 'fullJobId' to 'jobIdKey' in the 'addJob' script

6c8e29d

Changing the order of params to the 'addJob' script

ceb827f

Making the readme a bit clearer on the 'jobId' option

7369515

manast merged commit 333d007 into OptimalBits:master Aug 22, 2016

bradvogel mentioned this pull request Aug 22, 2016

rate limiting? #329

Closed

bradvogel deleted the cameron/allow-custom-job-ids branch October 3, 2016 22:51

spiritinlife mentioned this pull request Oct 15, 2016

Custom jobId #351

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow custom job ids to be specified in the job options #335

Allow custom job ids to be specified in the job options #335

campriceaustin commented Aug 19, 2016

xdc0 Aug 19, 2016

bradvogel commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016 •

edited

Loading

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 20, 2016

manast Aug 20, 2016

campriceaustin commented Aug 22, 2016

TomKaltz commented Sep 18, 2016 •

edited

Loading

manast commented Sep 19, 2016

TomKaltz commented Sep 20, 2016

manast commented Sep 20, 2016

Allow custom job ids to be specified in the job options #335

Allow custom job ids to be specified in the job options #335

Conversation

campriceaustin commented Aug 19, 2016

xdc0 Aug 19, 2016

Choose a reason for hiding this comment

bradvogel commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016 • edited Loading

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 19, 2016

manast commented Aug 19, 2016

campriceaustin commented Aug 20, 2016

manast Aug 20, 2016

Choose a reason for hiding this comment

campriceaustin commented Aug 22, 2016

TomKaltz commented Sep 18, 2016 • edited Loading

manast commented Sep 19, 2016

TomKaltz commented Sep 20, 2016

manast commented Sep 20, 2016

manast commented Aug 19, 2016 •

edited

Loading

TomKaltz commented Sep 18, 2016 •

edited

Loading