Pod status updates take longer to propagate to the API than necessary #116617

smarterclayton · 2023-03-14T21:18:49Z

What happened?

#107897 originally identified a number of inefficiencies in how pod status is reported back to the API:

Pods are queued in a channel and then processed one by one - this causes HoL blocking and writes every observed status change, but pod status is level driven (split into kubelet: Remove status manager channel #116615 for inclusion into Give terminal phase correctly to all pods that will not be restarted #115331)
Status is a GET then PATCH, but we now have server side apply and using it would reduce 2 RT to 1 RT and reduce QPS
We treat all status updates the same, when in reality certain transitions should be prioritized (unready -> ready, Running -> Succeeded|Failed)
Status is single threaded, and we could easily try multiple
It's hard to know when our status cache has been brought up to date (so that we can bypass a SSA attempt), we could use ResourceVersion tracking for that (apimachinery proposed allowing it to be parsed)

In general these contribute to #113606 and should be improved.

/sig node
/sig scalability

What did you expect to happen?

Faster

How can we reproduce it (as minimally and precisely as possible)?

See kubelet logs. Several e2e tests stress this flow, but we need a test that shows the improvements.

Anything else we need to know?

No response

Kubernetes version

$ kubectl version
# paste output here

Cloud provider

OS version

# On Linux:
$ cat /etc/os-release
# paste output here
$ uname -a
# paste output here

# On Windows:
C:\> wmic os get Caption, Version, BuildNumber, OSArchitecture
# paste output here

Install tools

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

sftim · 2023-03-14T21:24:23Z

We treat all status updates the same, when in reality certain transitions should be prioritized (unready -> ready, Running -> Succeeded|Failed)

One key actual state report for me is to see that the kubelet has at least seen the pod and, if you like, acknowledged its responsibility for further status updates. I think that's the update to set status.startTime.

smarterclayton · 2023-03-14T22:29:43Z

Right, that's another good one. Please add others. Note however that it may take 50ms for a round trip of a pod status, and some containers might start and go ready within that time. Would be good to think through how we want to bias towards these, or whether workload characteristics might matter.

pacoxu · 2023-03-15T07:13:53Z

/sig node
/sig scalability

sftim · 2023-03-16T10:32:11Z

With SSA, the kubelet can fire off multiple PATCH requests that - the kubelet believes - can arrive in an arbitrary order at the API server and produce the right outcome. I mean that's the dream, right?

smarterclayton · 2023-03-16T18:33:44Z

Right - Kubelet would simply need to guarantee that no two SSAs for the same pod are inflight at once.

sftim · 2023-03-16T18:52:13Z

kubelet would simply need to guarantee that no two SSAs for the same pod are inflight at once.

Not even that. No two conflicting SSAs. For example, I think we can send a request to apply status.startTime and, regardless of its outcome, follow that up with a request that applies status.startTime (to the same value!) and also updates status.conditions. The order that these apply shouldn't matter.

SergeyKanzhelev · 2023-03-22T17:55:57Z

/triage accepted

smarterclayton · 2023-04-17T21:12:52Z

Re: two inflight SSAs for non-conflicting operations to the same pod

For our own sanity, we probably don't want to have to reason about that while debugging :)

k8s-triage-robot · 2024-04-16T22:07:56Z

This issue has not been updated in over 1 year, and should be re-triaged.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

smarterclayton added the kind/bug Categorizes issue or PR as related to a bug. label Mar 14, 2023

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Mar 14, 2023

smarterclayton changed the title ~~Pod status updates take longer to propagate back to the API than necessary~~ Pod status updates take longer to propagate to the API than necessary Mar 14, 2023

smarterclayton mentioned this issue Mar 14, 2023

Kubelet should react more quickly to pod lifecycle transitions #113606

Open

k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. sig/scalability Categorizes an issue or PR as relevant to SIG Scalability. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Mar 15, 2023

SergeyKanzhelev added this to Triage in SIG Node Bugs Mar 15, 2023

smarterclayton mentioned this issue Mar 16, 2023

kubelet: Remove status manager channel #116615

Closed

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Mar 22, 2023

SergeyKanzhelev moved this from Triage to Triaged in SIG Node Bugs Mar 22, 2023

smarterclayton mentioned this issue May 12, 2023

feat: rename PodHasNetwork to PodReadyToStartContainers #117702

Merged

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/accepted Indicates an issue or PR is ready to be actively worked on. labels Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pod status updates take longer to propagate to the API than necessary #116617

Pod status updates take longer to propagate to the API than necessary #116617

smarterclayton commented Mar 14, 2023 •

edited

sftim commented Mar 14, 2023

smarterclayton commented Mar 14, 2023 •

edited

pacoxu commented Mar 15, 2023

sftim commented Mar 16, 2023

smarterclayton commented Mar 16, 2023

sftim commented Mar 16, 2023

SergeyKanzhelev commented Mar 22, 2023

smarterclayton commented Apr 17, 2023

k8s-triage-robot commented Apr 16, 2024

Pod status updates take longer to propagate to the API than necessary #116617

Pod status updates take longer to propagate to the API than necessary #116617

Comments

smarterclayton commented Mar 14, 2023 • edited

What happened?

What did you expect to happen?

How can we reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

Kubernetes version

Cloud provider

OS version

Install tools

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

sftim commented Mar 14, 2023

smarterclayton commented Mar 14, 2023 • edited

pacoxu commented Mar 15, 2023

sftim commented Mar 16, 2023

smarterclayton commented Mar 16, 2023

sftim commented Mar 16, 2023

SergeyKanzhelev commented Mar 22, 2023

smarterclayton commented Apr 17, 2023

k8s-triage-robot commented Apr 16, 2024

smarterclayton commented Mar 14, 2023 •

edited

smarterclayton commented Mar 14, 2023 •

edited