Out of memory error in background job processing not handled #647

cposton · 2017-03-13T20:22:14Z

In BackgroundJobQueue.run_pending_job, out of memory errors are not caught and handled (since java.lang.OutOfMemoryError exceptions are not caught by StandardError in Ruby) so the watchdog thread continues reporting that the job is running, but it will never complete because the actual job has been destroyed in memory. Restarting the backend will pick the job back up and likely result in the same error. If the job is cancelled then the service still needs to be restarted because jobs are no longer being processed.

In my testing, I was able to include handling for java.lang.OutOfMemoryError specifically (actually I was dropping down to java.lang.VirtualMachineError instead, but that was as far as I was comfortable going at the time) which would allow us to properly fail the job and continue processing.

That being said, there is a lot of discussion around concerning if out of memory errors should be "handled" so I didn't want to submit a PR without further discussion.

The text was updated successfully, but these errors were encountered:

lmcglohon · 2017-04-05T11:05:28Z

Agreed - not sure that we should be "handling" out-of-memory errors programmatically. Going to leave this open for further discussion.

lmcglohon · 2017-08-10T18:08:57Z

@archivesspace/archivesspace-core-committers Let's discuss this on Monday, Aug. 14th.

lmcglohon · 2018-01-03T17:54:38Z

@cposton I would like to close this issue. Our philosophy is that out-of-memory errors should not be handled by the ArchivesSpace application but should instead be managed by the institution installing ArchivesSpace. That being said, are you still having issues with this?

cposton · 2018-01-03T18:42:48Z

I understand and agree with the stated philosophy. We have since modified memory allocations and have not had to deal with the issue since as far as I am aware.

cposton closed this as completed Jan 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out of memory error in background job processing not handled #647

Out of memory error in background job processing not handled #647

cposton commented Mar 13, 2017

lmcglohon commented Apr 5, 2017

lmcglohon commented Aug 10, 2017

lmcglohon commented Jan 3, 2018

cposton commented Jan 3, 2018

Out of memory error in background job processing not handled #647

Out of memory error in background job processing not handled #647

Comments

cposton commented Mar 13, 2017

lmcglohon commented Apr 5, 2017

lmcglohon commented Aug 10, 2017

lmcglohon commented Jan 3, 2018

cposton commented Jan 3, 2018