New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Event intervals for Startup Probe failures #27612
Add Event intervals for Startup Probe failures #27612
Conversation
ebe7a1b
to
cca43ae
Compare
e2echart/e2e-chart-template.html
Outdated
@@ -223,6 +230,9 @@ <h5 class="modal-title">Resource</h5> | |||
return [item.locator, ` (kubelet container readiness)`, "ContainerReadinessErrored"]; | |||
} | |||
} | |||
if (m && isKubeletStartupProbeFailure(item)){ | |||
return [item.locator, ` (kubelet container readiness)`, "StartupProbeFailed"]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should the message be named startup instead of readiness since it is a startup probe?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left it as kubelet container readiness
because the StartupProbeFailed is still part of container readiness and the event came from reading the kubelet log.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are three different probes with kubelet: liveness, readiness and startup. I assume you want to use a separate startup instead of readiness to indicate the kind of probe this is about.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you are right -- thanks!
d11e126
to
73e1933
Compare
/lgtm |
@DennisPeriquet: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: DennisPeriquet, dgoodwin, xueqzhan The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
TRT-724
The kubelet logs contain the startup probe errors. We extract them (similar to how the readiness probe events were extracted) so they will be represented in the event Intervals and the Interval charts.
Sample output:
Ready
state. You can see them in this chart for pods (e.g.,redhat-operators-5nk4h
) that are in theopenshift-marketplace
namespace.output="
andoutput=<
) are represented:NOTE: the case where the log ends with
output=<
is for multi-line output. I purposely did not add the logic to capture the text in the multiline output. If we feel like this info is valuable in the future, we may pursue this.