Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

control-service: improve api message on oom job execution errors #861

Merged
merged 5 commits into from
Jun 17, 2022

Conversation

mrMoZ1
Copy link
Contributor

@mrMoZ1 mrMoZ1 commented Jun 13, 2022

why: Recently users of control-service have been complaining that jobs
which fail with User Error and have failed with an OOM error didn't
have a sufficiently clear enough message that classifies such types of errors.

what: Changed internal implementation so that when a data job fails due
to K8S OOM errors the message stored in the database contains a hint as
to the cause of failure - out of memory issues. We now explicitly check the
container termination message and in case of User Errors with an OOM error
we set an appropriate message.

testing: Refactored existing tests. Added new unit tests.

Signed-off-by: Momchil Zhivkov mzhivkov@vmware.com

Signed-off-by: mrMoZ1 <mzhivkov@vmware.com>
mrMoZ1 added 3 commits June 15, 2022 16:35
Signed-off-by: mrMoZ1 <mzhivkov@vmware.com>
Signed-off-by: mrMoZ1 <mzhivkov@vmware.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants