You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 1, 2024. It is now read-only.
AWS Batch jobs can fail. We designed the processing to be idempotent, so it is safe to rerun failed jobs, but we don't have an automated system for doing so.
Is your feature request related to a problem? Please describe.
You cannot see that a job has failed within the application, only in the AWS Batch console.
Describe the solution you'd like
Evaluate whether or not the built-in AWS Batch job retry feature is appropriate for our workflow.
The text was updated successfully, but these errors were encountered:
jwalgran
changed the title
Improve the visibility resiliency of AWS Batch processing of CSVs
Improve the visibility and resiliency of AWS Batch processing of CSVs
Apr 3, 2019
I would be interested in discussing this a bit with whoever pulls it. In RF we've made use of:
Job retries (which require some additional application logic to track the retry count)
Job timeouts (to prevent jobs that hang from hogging up resources)
Send Rollbar notifications when jobs fail (A Ruby Goldberg device that uses part of the strategy in SNS article above, but to wake up a Lambda that invokes Rollbar)
Overview
AWS Batch jobs can fail. We designed the processing to be idempotent, so it is safe to rerun failed jobs, but we don't have an automated system for doing so.
Is your feature request related to a problem? Please describe.
You cannot see that a job has failed within the application, only in the AWS Batch console.
Describe the solution you'd like
The text was updated successfully, but these errors were encountered: