Exception handling when Heroku API call fails? #21

swrobel · 2013-08-28T23:23:31Z

I'm wondering what happens when autoscaler hits an exception while attempting to do its thing. Here's my understanding:

I enqueue something in my code
Sidekiq attempts to enqueue that job
Autoscaler jumps in at some point and attempts to tell the API to add a worker
Exception thrown
Exception bubbles back up to the top, stopping Sidekiq & my code in its tracks

Does the job end up get enqueued in the Sidekiq or does that process fail because Autoscaler did? Is it worth trying to catch exceptions in Autoscaler since it isn't a "critical" process?

What led me to these questions is that I've been noticing weird Excon errors in honeybadger today. Here are the two exception messages I've gotten, both of which have backtraces that lead me to believe that it's happening when autoscaler attempts to hit the heroku api (each links to a gist of the backtrace):

Sorry, this is more just a dump of my thoughts, but hopefully it sparks a good conversation..

JustinLove · 2013-08-29T13:43:09Z

Good point. I may have to review Exceptional Ruby for library behavior recommendations - if it just ate exceptions, you'd have unprocessed jobs backing up for no apparent reason.

At the moment, the scale happens before the yield, so the job won't be enqueued

swrobel · 2013-08-29T17:32:45Z

hm, yikes, that seems problematic. seems like some of my jobs got through and some didn't. can't really make sense of it.

anyway, i figure it's better to have the jobs in the queue and no workers to process them than no jobs and no workers, knowwhatimean?

JustinLove · 2013-08-31T20:25:21Z

How's this?

39d5bd8

swrobel · 2013-09-03T23:41:36Z

👍

swrobel · 2013-09-04T15:59:46Z

Great, Heroku API is down today! Deploying w/ the github version to see what happens... Who would've thought we'd get a chance to test so soon.

EDIT: Sadly, no dice, can't deploy either...

JustinLove · 2013-09-04T20:59:41Z

I thought that heroku's tools used the API as well, although I had some hope that deploy was significant separate service in it's own right :-(

swrobel · 2013-09-05T22:13:14Z

Any input on how to implement a custom handler correctly? I'm doing the following, but maybe there's a simpler/better way:

  exception_handler = ->(exception) do
    Honeybadger.notify(
      error_class: exception.class,
      error_message: exception.message
    )
  end

  Sidekiq.configure_server do |config|
    config.redis = sidekiq_redis if sidekiq_redis
    config.server_middleware do |chain|
      scaler = Autoscaler::HerokuScaler.new
      scaler.exception_handler = exception_handler
      chain.add(Autoscaler::Sidekiq::Server, scaler, 60)
    end
  end

JustinLove · 2013-09-05T22:30:52Z

That's about what I intended.

swrobel · 2013-09-11T18:29:09Z

I've never been so excited to see Heroku API errors! Exceptions the last 15 minutes have been getting logged but jobs are still queuing. Yay!

Thank you, again.

swrobel · 2013-09-16T22:07:36Z

How do you feel about rescuing Exception like sidekiq does rather than limiting it to Excon? I imagine other stuff could go wrong as well that we aren't thinking of ;)

https://github.com/mperham/sidekiq/blob/master/lib/sidekiq/processor.rb#L54

JustinLove · 2013-09-17T14:44:27Z

Mostly blanket exception catching makes me nervous about masking errors.

Sidekiq has a slightly different domain - it has to wrap arbitrary user code, and then respond appropriately (such as retry) One reason not to catch all exceptions is to let Sidekiq continue it's usual error handling.

swrobel · 2013-09-17T21:34:07Z

True, but we wouldn't catch those, right? I don't think they would bubble up to Autoscaler... I'm in the same camp as you, but in situations like this where queuing is more important than performing, it just seems to make sense.

swrobel · 2013-09-28T00:08:52Z

Another day, another Heroku API failure. Here's the latest exception that's now bubbling up to my app and causing sidekiq not to enqueue: Heroku::API::Errors::ErrorWithResponse: Expected(200) <=> Actual(503 Service Unavailable)

swrobel · 2013-09-28T01:03:19Z

New idea: is it possible to handle the scaling after sidekiq has successfully enqueued the job? That way no matter what happens with autoscaler, we know jobs won't be lost...

swrobel closed this as completed Sep 3, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exception handling when Heroku API call fails? #21

Exception handling when Heroku API call fails? #21

swrobel commented Aug 28, 2013

JustinLove commented Aug 29, 2013

swrobel commented Aug 29, 2013

JustinLove commented Aug 31, 2013

swrobel commented Sep 3, 2013

swrobel commented Sep 4, 2013

JustinLove commented Sep 4, 2013

swrobel commented Sep 5, 2013

JustinLove commented Sep 5, 2013

swrobel commented Sep 11, 2013

swrobel commented Sep 16, 2013

JustinLove commented Sep 17, 2013

swrobel commented Sep 17, 2013

swrobel commented Sep 28, 2013

swrobel commented Sep 28, 2013

Exception handling when Heroku API call fails? #21

Exception handling when Heroku API call fails? #21

Comments

swrobel commented Aug 28, 2013

JustinLove commented Aug 29, 2013

swrobel commented Aug 29, 2013

JustinLove commented Aug 31, 2013

swrobel commented Sep 3, 2013

swrobel commented Sep 4, 2013

JustinLove commented Sep 4, 2013

swrobel commented Sep 5, 2013

JustinLove commented Sep 5, 2013

swrobel commented Sep 11, 2013

swrobel commented Sep 16, 2013

JustinLove commented Sep 17, 2013

swrobel commented Sep 17, 2013

swrobel commented Sep 28, 2013

swrobel commented Sep 28, 2013