Support memory qos with cgroups v2 #2570

xiaoxubeii · 2021-03-14T11:23:50Z

Enhancement Description

One-line enhancement description (can be used as a release note): Support memory qos with cgroups v2
Kubernetes Enhancement Proposal: https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/2570-memory-qos
Discussion Link: https://docs.google.com/document/d/1r9Fx2omdMLP8SJ59np-C6uBqeuS4TSbQc5ChmQWMYJ8/edit?usp=sharing
Primary contact (assignee): @xiaoxubeii
Responsible SIGs: node
Enhancement target (which target equals to which milestone):
- Alpha release target (x.y): v1.22
- Beta release target (x.y):
- Stable release target (x.y):
Alpha
- KEP (k/enhancements) update PR(s):
  - KEP-2570: Support memory qos with cgroups v2 #2571
- Code (k/k) update PR(s):
  - Feature: add unified on CRI to support cgroup v2 kubernetes#102578
  - Feature: Support memory qos with cgroups v2 kubernetes#102970
- Docs (k/website) update PR(s):
  - Support Memory QoS with cgroups v2 for 1.22 #2570 website#28566

The text was updated successfully, but these errors were encountered:

xiaoxubeii · 2021-03-14T11:24:14Z

/sig node

xiaoxubeii · 2021-03-14T11:24:34Z

/assign @xiaoxubeii

MadhavJivrajani · 2021-03-25T17:34:34Z

Hi! This sounds really interesting and I'd love to help out, please let me know how I can help out with this!

ehashman · 2021-05-04T18:45:46Z

/stage alpha
/milestone v1.22

gracenng · 2021-05-10T17:11:04Z

Hi @xiaoxubeii 👋 1.22 Enhancements Shadow here.

This enhancement is in good shape, some minor change requests in light of Enhancement Freeze on Thursday May 13th:

Update kep.yaml file to the latest template
In kep.yaml, status is currently provisional instead of implementable
Alpha graduation criteria missing
KEP not merged to master

Thanks!

gracenng · 2021-05-11T12:36:56Z

Hi @xiaoxubeii 👋 1.22 Enhancements shadow here.

To help SIG's be aware of their workload, I just wanted to check to see if SIG-Node will need to do anything for this enhancement and if so, are they OK with it?
Thanks!

xiaoxubeii · 2021-05-12T02:08:18Z

@gracenng Hey grace, I have updated necessary contents as follows:

update kep.yaml for prr approval
add Alpha graduation criteria

sig-node approvers @derekwaynecarr @mrunalp are reviewing for that. I am waiting for lgtm/approve and merge as implementable.

xiaoxubeii · 2021-05-13T01:30:12Z

@gracenng sig-node approvers(Derek and Mrunal) have gave lgtm/approve. There are few prr review requests, I have updated and am waiting for next review round. I think we can catch up with the freeze day :)

gracenng · 2021-05-13T02:55:19Z

Hi @xiaoxubeii , looks like your PRR was approved and the requested changes are all here. I have updated the status of this enhancement to tracked
Thank you for keeping me updated!

xiaoxubeii · 2021-05-13T03:11:05Z

Thanks for your help. Also thanks very much to a lot of valuable review suggestions and helps from @derekwaynecarr @mrunalp @bobbypage @giuseppe @odinuge @johnbelamaric @ehashman
Really appreciate that :)

ritpanjw · 2021-05-19T06:06:38Z

Hello @xiaoxubeii 👋 , 1.22 Docs Shadow here.

This enhancement is marked as Needs Docs for 1.22 release.
Please follow the steps detailed in the documentation to open a PR against dev-1.22 branch in the k/website repo. This PR can be just a placeholder at this time and must be created before Fri July 9, 11:59 PM PDT.
Also, take a look at Documenting for a release to familiarize yourself with the docs requirement for the release.

Thank you!

xiaoxubeii · 2021-05-19T06:19:34Z

@ritpanjw OK, thanks for reminding.

gracenng · 2021-06-23T12:29:37Z

Hi @xiaoxubeii 🌞 1.22 enhancements shadow here.

In light of Code Freeze on July 8th, this enhancement current status is tracked, and we're currently tracking kubernetes/kubernetes#102578 kubernetes/kubernetes/pull/102970

Please let me know if there is other code PR associated with this enhancement.

Thanks

xiaoxubeii · 2021-06-24T02:22:26Z

Hi @xiaoxubeii 🌞 1.22 enhancements shadow here.

In light of Code Freeze on July 8th, this enhancement current status is tracked, and we're currently tracking kubernetes/kubernetes#102578 kubernetes/kubernetes/pull/102970

Please let me know if there is other code PR associated with this enhancement.

Thanks

@gracenng It is all here, thanks.

k8s-triage-robot · 2021-11-17T22:40:11Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

SergeyKanzhelev · 2023-06-08T07:37:20Z

/label lead-opted-in

npolshakova · 2023-06-11T16:34:52Z

Hi @xiaoxubeii 👋, Enhancements team here!

Just checking in as we approach enhancements freeze on 01:00 UTC Friday, 16th June 2023.

This enhancement is targeting for stage beta for 1.28 (correct me, if otherwise.)

Here's where this enhancement currently stands:

KEP readme using the latest template has been merged into the k/enhancements repo.
KEP status is marked as implementable for latest-milestone: 1.28
KEP readme has a updated detailed test plan section filled out
KEP readme has up to date graduation criteria
KEP has a production readiness review that has been completed and merged into k/enhancements.

For this KEP, we would just need to update the following:

update the latest milestone to 1.28
update beta to 1.28

The status of this enhancement is marked as at risk. Please keep the issue description up-to-date with appropriate stages as well. Thank you!

ndixita · 2023-06-11T21:57:16Z

Thanks @npolshakova for bringing this up. This KEP is targeting beta in 1.28. I will go ahead and take care of the action items here on Monday.

cc: @xiaoxubeii

johnbelamaric · 2023-06-13T23:30:11Z

Please be sure to update the PRR questions for beta.

npolshakova · 2023-06-14T19:14:03Z

Hi @xiaoxubeii 👋, just checking in before the enhancements freeze on 01:00 UTC Friday, 16th June 2023. The status for this enhancement is at risk.

For this KEP, we would just need to update the following:

update the latest milestone to 1.28
update beta to 1.28
answer the PRR questions for beta and get an approval

Let me know if I missed anything. Thanks!

SergeyKanzhelev · 2023-06-16T00:30:25Z

@npolshakova this should be good for 1.28 now

AdminTurnedDevOps · 2023-06-21T17:59:44Z

Hey @xiaoxubeii

1.28 Docs Shadow here.

Does this enhancement work planned for 1.28 require any new docs or modification to existing docs?

If so, please follows the steps here to open a PR against dev-1.28 branch in the k/website repo. This PR can be just a placeholder at this time and must be created before Thursday 20th July 2023.

Also, take a look at Documenting for a release to get yourself familiarize with the docs requirement for the release.

Thank you!

npolshakova · 2023-07-11T22:04:51Z

Hey again @xiaoxubeii 👋

Just checking in as we approach Code freeze at 01:00 UTC Friday, 19th July 2023 .

Here’s the enhancement’s state for the upcoming code freeze:

All the PRs that are related to your enhancement are linked in the above issue description (for tracking purposes). This includes code, tests, and documentation related PR/s.
All code related PR/s are merged or are in merge-ready state ( i.e they have approved and lgtm labels applied) by the code freeze deadline. This includes any tests related PR/s too.
- Feature: add unified on CRI to support cgroup v2 kubernetes#102578
- Feature: Support memory qos with cgroups v2 kubernetes#102970

For this enhancement, it looks like the following code related PR/s are open and they need to be merged or should be in merge-ready state before the code freeze commences :

[postponed]promote MemoryQoS to Beta kubernetes#118699

Also please let me know if there are other PRs in k/k we should be tracking for this KEP.
As always, we are here to help if any questions come up. Thanks!

Rishit-dagli · 2023-07-12T15:39:23Z

Hey @xiaoxubeii , could you please create a docs PR even if it is a draft PR with no content yet against dev-1.28 branch in the k/website repo. The deadline to create this draft PR is Thursday 20th July 2023.

pacoxu · 2023-07-17T08:10:13Z

Thanks for @ndixita detailed test: https://docs.google.com/document/d/1mY0MTT34P-Eyv5G1t_Pqs4OWyIH-cg9caRKWmqYlSbI/edit?usp=sharing.

🚨🚨 Sometimes the application pod is stuck for throttling memory. This is a worse behavior than OOM kill.
So we decided to postpone promoting this feature until we can gracefully handle this issue.

See kubernetes/kubernetes#118699 (comment) as well.

So KEP needs an update.

Atharva-Shinde · 2023-07-19T05:40:09Z

Thanks for @ndixita detailed test: https://docs.google.com/document/d/1mY0MTT34P-Eyv5G1t_Pqs4OWyIH-cg9caRKWmqYlSbI/edit?usp=sharing.
🚨🚨 Sometimes the application pod is stuck for throttling memory. This is a worse behavior than OOM kill.
So we decided to postpone promoting this feature until we can gracefully handle this issue.
See kubernetes/kubernetes#118699 (comment) as well.
So KEP needs an update.

Hello @ndixita @pacoxu with reference to above comment, I am removing this KEP from the current milestone.
/milestone clear
/remove label lead-opted-in

npolshakova · 2023-09-26T13:49:03Z

Hello @ndixita, 1.29 Enhancements team here! Is this enhancement targeting 1.29? If it is, can you follow the instructions here to opt in the enhancement and make sure the lead-opted-in label is set so it can get added to the tracking board? Thanks!

SergeyKanzhelev · 2023-09-29T20:41:20Z

/stage alpha

k8s-triage-robot · 2024-01-29T11:12:48Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-02-28T12:04:28Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2024-03-29T12:47:03Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2024-03-29T12:47:07Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Mar 14, 2021

k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Mar 14, 2021

k8s-ci-robot assigned xiaoxubeii Mar 14, 2021

k8s-ci-robot added the stage/alpha Denotes an issue tracking an enhancement targeted for Alpha status label May 4, 2021

k8s-ci-robot added this to the v1.22 milestone May 4, 2021

JamesLaverack added the tracked/yes Denotes an enhancement issue is actively being tracked by the Release Team label May 5, 2021

xiaoxubeii changed the title ~~Support memory qos using cgroups v2~~ Support memory qos with cgroups v2 May 7, 2021

pacoxu mentioned this issue Jun 4, 2021

Feature: add unified on CRI to support cgroup v2 kubernetes/kubernetes#102578

Merged

xiaoxubeii mentioned this issue Jun 22, 2021

Support Memory QoS with cgroups v2 for 1.22 #2570 kubernetes/website#28566

Merged

rajula96reddy mentioned this issue Jul 19, 2021

1.22 feature blog for memory qos support with cgroups v2 kubernetes/website#29015

Merged

1 task

salaxander added tracked/no Denotes an enhancement issue is NOT actively being tracked by the Release Team and removed tracked/yes Denotes an enhancement issue is actively being tracked by the Release Team labels Aug 19, 2021

bobbypage mentioned this issue Nov 10, 2021

kubelet kernelMemcgNotification are not supported on cgroupv2 kubernetes/kubernetes#106331

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 17, 2021

k8s-ci-robot added the lead-opted-in Denotes that an issue has been opted in to a release label Jun 8, 2023

ndixita mentioned this issue Jun 15, 2023

Memory QoS Beta Update and Production Readiness #4093

Merged

This was referenced Jun 16, 2023

[postponed]promote MemoryQoS to Beta kubernetes/kubernetes#118699

Closed

promote memory qos to beta kubernetes/website#41655

Closed

k8s-ci-robot removed this from the v1.28 milestone Jul 19, 2023

Atharva-Shinde removed the lead-opted-in Denotes that an issue has been opted in to a release label Jul 19, 2023

ndixita mentioned this issue Sep 25, 2023

KEP-2570: Updating Memory QoS status to eventually deprecate the alpha feature #4243

Merged

k8s-ci-robot added stage/alpha Denotes an issue tracking an enhancement targeted for Alpha status and removed stage/beta Denotes an issue tracking an enhancement targeted for Beta status labels Sep 29, 2023

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 29, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 28, 2024

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support memory qos with cgroups v2 #2570

Support memory qos with cgroups v2 #2570

xiaoxubeii commented Mar 14, 2021 •

edited by marosset

Loading

xiaoxubeii commented Mar 14, 2021

xiaoxubeii commented Mar 14, 2021

MadhavJivrajani commented Mar 25, 2021

ehashman commented May 4, 2021

gracenng commented May 10, 2021

gracenng commented May 11, 2021

xiaoxubeii commented May 12, 2021 •

edited

Loading

xiaoxubeii commented May 13, 2021 •

edited

Loading

gracenng commented May 13, 2021

xiaoxubeii commented May 13, 2021

ritpanjw commented May 19, 2021

xiaoxubeii commented May 19, 2021

gracenng commented Jun 23, 2021

xiaoxubeii commented Jun 24, 2021

k8s-triage-robot commented Nov 17, 2021

SergeyKanzhelev commented Jun 8, 2023

npolshakova commented Jun 11, 2023 •

edited

Loading

ndixita commented Jun 11, 2023

johnbelamaric commented Jun 13, 2023

npolshakova commented Jun 14, 2023 •

edited

Loading

SergeyKanzhelev commented Jun 16, 2023

AdminTurnedDevOps commented Jun 21, 2023

npolshakova commented Jul 11, 2023

Rishit-dagli commented Jul 12, 2023

pacoxu commented Jul 17, 2023

Atharva-Shinde commented Jul 19, 2023 •

edited

Loading

npolshakova commented Sep 26, 2023

SergeyKanzhelev commented Sep 29, 2023

k8s-triage-robot commented Jan 29, 2024

k8s-triage-robot commented Feb 28, 2024

k8s-triage-robot commented Mar 29, 2024

k8s-ci-robot commented Mar 29, 2024

Support memory qos with cgroups v2 #2570

Support memory qos with cgroups v2 #2570

Comments

xiaoxubeii commented Mar 14, 2021 • edited by marosset Loading

Enhancement Description

xiaoxubeii commented Mar 14, 2021

xiaoxubeii commented Mar 14, 2021

MadhavJivrajani commented Mar 25, 2021

ehashman commented May 4, 2021

gracenng commented May 10, 2021

gracenng commented May 11, 2021

xiaoxubeii commented May 12, 2021 • edited Loading

xiaoxubeii commented May 13, 2021 • edited Loading

gracenng commented May 13, 2021

xiaoxubeii commented May 13, 2021

ritpanjw commented May 19, 2021

xiaoxubeii commented May 19, 2021

gracenng commented Jun 23, 2021

xiaoxubeii commented Jun 24, 2021

k8s-triage-robot commented Nov 17, 2021

SergeyKanzhelev commented Jun 8, 2023

npolshakova commented Jun 11, 2023 • edited Loading

ndixita commented Jun 11, 2023

johnbelamaric commented Jun 13, 2023

npolshakova commented Jun 14, 2023 • edited Loading

SergeyKanzhelev commented Jun 16, 2023

AdminTurnedDevOps commented Jun 21, 2023

npolshakova commented Jul 11, 2023

Rishit-dagli commented Jul 12, 2023

pacoxu commented Jul 17, 2023

Atharva-Shinde commented Jul 19, 2023 • edited Loading

npolshakova commented Sep 26, 2023

SergeyKanzhelev commented Sep 29, 2023

k8s-triage-robot commented Jan 29, 2024

k8s-triage-robot commented Feb 28, 2024

k8s-triage-robot commented Mar 29, 2024

k8s-ci-robot commented Mar 29, 2024

xiaoxubeii commented Mar 14, 2021 •

edited by marosset

Loading

xiaoxubeii commented May 12, 2021 •

edited

Loading

xiaoxubeii commented May 13, 2021 •

edited

Loading

npolshakova commented Jun 11, 2023 •

edited

Loading

npolshakova commented Jun 14, 2023 •

edited

Loading

Atharva-Shinde commented Jul 19, 2023 •

edited

Loading