Problems if worker doesn't have an error listener #47

nathanbowser · 2014-08-26T16:12:39Z

If a worker doesn't have an error event listener attached and a worker has an error, things get into a weird state because doneWorking is never called. You can see why by looking into node's eventemitter: https://github.com/joyent/node/blob/master/lib/events.js#L74-89

The text was updated successfully, but these errors were encountered:

evantahler · 2014-08-26T17:46:53Z

Can you clarify the behavior you are seeing? In general, I would expect the process/project using node-resque to crash outright if the error event is not handled.

nathanbowser · 2014-08-26T17:53:34Z

Resque will keep that worker around as running and it will never be freed.

evantahler · 2014-08-27T19:07:51Z

Ahh! I think you are referring to the fact that the process will die, but the worker.untrack() (and doneWorking()) won't be called (because the process is dead). This means the worker will be listed in redis as still 'working'. In #48 (comment) we are discussing how I don't think that this package should handle process signal listeners, and I think that this is a similar kind of thing. Due to the way node assumes that uncaught error emits should bubble up as errors, I think we would keep the same pattern, and not try to do more logic on top of it.

There are some hacks that can be done, IE:

place your invocation of node resque within a domain (to catch any un-caught error events)
use process.on('uncaughtException') to catch that error event
... or just catch that error event :D

evantahler · 2014-08-27T19:17:34Z

Perhaps we should add a note about this to the readme.

nathanbowser · 2014-08-27T19:18:36Z

The process doesn't die because resque is already using a domain. Here's an example:

var NR = require(__dirname + '/../index.js')

var connectionDetails = {
  package:   'redis',
  host:      '127.0.0.1',
  password:  '',
  port:      6379,
  database:  1,
  namespace: 'resquetest'
}

var jobs = {
  add: {
    perform: function (a, b, next) {
      console.log('performing...')
      next(new Error('Blue smoke'))
    }
  }
}

var worker = new NR.worker({connection: connectionDetails, queues: ['math']}, jobs, function () {
  worker.workerCleanup()
  worker.start()
})

var queue = new NR.queue({connection: connectionDetails}, jobs, function () {
  queue.enqueue('math', 'add', [1,2])
})

worker.start()

setTimeout(function () {
  queue.enqueue('math', 'add', [1,2])
}, 10000)

I had to put in a temporary solution for catching the error event for now. If that's your ideal permanent solution (which I personally don't feel is very clean), then we should at least document that.

nathanbowser · 2014-08-27T19:19:08Z

Oh, you published the readme note before I hit send. :)

nathanbowser · 2014-08-27T19:22:10Z

I guess another thing you might want to think about... some perform errors might not be errors in the sense that the process will be all out of whack. It could just be an error from not being able to talk to some API. If we were always dealing with errors that put node in a horrible state, then yea I think the process should be killed, but that's not always the case.

evantahler · 2014-08-27T20:09:40Z

Ahh! That last example was helpful. Yes, I agree this is broken in some way. I'll look into it.

evantahler · 2014-08-27T20:18:57Z

The weirdness actually happens in worker.fail. we get here from the domain on('error') in worker.perform, so we are emitting an error within an uncaught error callback...

evantahler · 2014-08-27T20:31:34Z

Closing in favor of #49
Thanks for pointing this out!

nathanbowser added a commit to SpiderStrategies/node-resque that referenced this issue Aug 26, 2014

Adds a test case to show the problem for issue actionhero#47

08f5d2c

evantahler added bug labels Aug 27, 2014

evantahler closed this as completed Aug 27, 2014

evantahler reopened this Aug 27, 2014

evantahler closed this as completed Aug 27, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems if worker doesn't have an error listener #47

Problems if worker doesn't have an error listener #47

nathanbowser commented Aug 26, 2014

evantahler commented Aug 26, 2014

nathanbowser commented Aug 26, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014

Problems if worker doesn't have an error listener #47

Problems if worker doesn't have an error listener #47

Comments

nathanbowser commented Aug 26, 2014

evantahler commented Aug 26, 2014

nathanbowser commented Aug 26, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

nathanbowser commented Aug 27, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014

evantahler commented Aug 27, 2014