Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(health): Add custom health check for KServe Inference Service resources #14177

Merged
merged 7 commits into from
Jun 23, 2023

Conversation

rachitchauhan43
Copy link
Contributor

@rachitchauhan43 rachitchauhan43 commented Jun 22, 2023

Context

This PR adds the custom health check for KServe Inference Service Resource

Note on DCO:

If the DCO action in the integration test fails, one or more of your commits are not signed off. Please click on the Details link next to the DCO action for instructions on how to resolve this.

Checklist:

  • Either (a) I've created an enhancement proposal and discussed it with the community, (b) this is a bug fix, or (c) this does not need to be in the release notes.
  • The title of the PR states what changed and the related issues number (used for the release note).
  • The title of the PR conforms to the Toolchain Guide
  • I've included "Closes [ISSUE #]" or "Fixes [ISSUE #]" in the description to automatically close the associated issue.
  • I've updated both the CLI and UI to expose my feature, or I plan to submit a second PR with them.
  • Does this PR require documentation updates?
  • I've updated documentation as required by this PR.
  • Optional. My organization is added to USERS.md.
  • I have signed off all my commits as required by DCO
  • I have written unit and/or e2e tests for my change. PRs without these are unlikely to be merged.
  • My build is green (troubleshooting builds).
  • My new feature complies with the feature status guidelines.
  • I have added a brief description of why this PR is necessary and/or what this PR solves.

Please see Contribution FAQs if you have questions about your pull-request.

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
end
end
end
if status_true == 3 and status_false == 0 and status_unknown == 0 then
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is status_true == 3 necessary and sufficient? I see there are four condition types which may increment status_true: IngressReady, PredictorConfigurationReady, PredictorReady, PredictorRouteReady. Do we need all 5, or is some subset sufficient?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need all 5 to be true. Added that.

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
rachitchauhan43 and others added 2 commits June 22, 2023 14:00
Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
…lth.lua

Co-authored-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>
Signed-off-by: Rachit Chauhan <rachitchauhan43@gmail.com>
@crenshaw-dev crenshaw-dev changed the title feat: Add custom health check for KServe Inference Service resources feat(health): Add custom health check for KServe Inference Service resources Jun 22, 2023
@codecov
Copy link

codecov bot commented Jun 22, 2023

Codecov Report

Patch and project coverage have no change.

Comparison is base (8f2750c) 49.62% compared to head (4af3eaf) 49.62%.

❗ Current head 4af3eaf differs from pull request most recent head 44b3ff4. Consider uploading reports for the commit 44b3ff4 to get more accurate results

Additional details and impacted files
@@           Coverage Diff           @@
##           master   #14177   +/-   ##
=======================================
  Coverage   49.62%   49.62%           
=======================================
  Files         256      256           
  Lines       43800    43800           
=======================================
  Hits        21736    21736           
  Misses      19932    19932           
  Partials     2132     2132           

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

Comment on lines 2 to 9
#- healthStatus:
# status: Progressing
# message: "PredictorConfigurationReady is Unknown\nPredictorReady is Unknown, since RevisionMissing. Configuration \"hello-world-predictor-default\" is waiting for a Revision to become ready.\nPredictorRouteReady is Unknown, since RevisionMissing. Configuration \"hello-world-predictor-default\" is waiting for a Revision to become ready.\nReady is Unknown, since RevisionMissing. Configuration \"hello-world-predictor-default\" is waiting for a Revision to become ready.\n"
# inputPath: testdata/progressing.yaml
#- healthStatus:
# status: Degraded
# message: "IngressReady is False, since Predictor ingress not created.\nPredictorConfigurationReady is False, since RevisionFailed. Revision \"helloworld-00002\" failed with message: Container failed with: container exited with no error.\nPredictorReady is False, since RevisionFailed. Revision \"helloworld-00002\" failed with message: Container failed with: container exited with no error.\nReady is False, since Predictor ingress not created.\n"
# inputPath: testdata/degraded.yaml
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wanna uncomment these and see if we get past tests?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. Just testing one by one. Once this healthy one passes I will uncomment others too.

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
Copy link
Collaborator

@crenshaw-dev crenshaw-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @rachitchauhan43!

@crenshaw-dev crenshaw-dev merged commit 722f58d into argoproj:master Jun 23, 2023
23 checks passed
yyzxw pushed a commit to yyzxw/argo-cd that referenced this pull request Aug 9, 2023
…sources (argoproj#14177)

* Initial commit for ISVC health check

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Adding test for health check and incorporating review comment.

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Adding test for degraded state

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Testing only healthy scenario

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Update resource_customizations/serving.kserve.io/InferenceService/health.lua

Co-authored-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>
Signed-off-by: Rachit Chauhan <rachitchauhan43@gmail.com>

* Uncommenting rest fo the tests

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

---------

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
Signed-off-by: Rachit Chauhan <rachitchauhan43@gmail.com>
Co-authored-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>
tesla59 pushed a commit to tesla59/argo-cd that referenced this pull request Dec 16, 2023
…sources (argoproj#14177)

* Initial commit for ISVC health check

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Adding test for health check and incorporating review comment.

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Adding test for degraded state

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Testing only healthy scenario

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

* Update resource_customizations/serving.kserve.io/InferenceService/health.lua

Co-authored-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>
Signed-off-by: Rachit Chauhan <rachitchauhan43@gmail.com>

* Uncommenting rest fo the tests

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>

---------

Signed-off-by: rachitchauhan43 <rachitchauhan43@gmail.com>
Signed-off-by: Rachit Chauhan <rachitchauhan43@gmail.com>
Co-authored-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants