Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job Failure Messaging #977

Closed
dfaller opened this issue Oct 4, 2017 · 0 comments
Closed

Job Failure Messaging #977

dfaller opened this issue Oct 4, 2017 · 0 comments

Comments

@dfaller
Copy link
Contributor

dfaller commented Oct 4, 2017

Transition job execution failure in the scheduler to the backend messaging system.

  1. Create command message for updating recipe dependent job statuses
  2. Create command message for failing jobs
  3. Change scheduler task update handling to use failing jobs message
  4. Change task handling thread to use failing jobs message for timed out jobs
  5. Change scheduler lost node callback to use failing jobs message
  6. Change scheduler thread to use failing jobs message for executions that are starved or have a lost node
  7. Remove all error model retrieval in the scheduler?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant