ROX-15980 observability resource requests and limits by ludydoo · Pull Request #993 · stackrox/acs-fleet-manager

ludydoo · 2023-04-27T11:57:16Z

Sets the resource requests and limits for observability components

The values were derived from the observed metrics on prometheus/grafana.

ludydoo · 2023-04-27T13:38:52Z

/retest

stehessel · 2023-04-27T14:10:41Z

dp-terraform/helm/rhacs-terraform/charts/observability/values.yaml

+  resources:
+    requests:
+      cpu: 1500m
+      memory: 18Gi


Do we need 18Gi on all clusters? Perhaps the request should be smaller than the limit to accommodate smaller systems?

I figured 18Gi would be a minimum, since for stage it currently hovers around ~10Gi with not that many centrals on it..

But it's ok if the actual usage is above the request, right? As long as it's smaller than the limit.

Yes, though if request < limits, it's not a guaranteed QOS and the pod might be descheduled. My initial thought on prometheus, alertmanager & so on, is that since it drives all of our alerts, I would consider this as a critical component that should have a guaranteed QOS. But it's just my personal opinion. Please feel free to suggest other values to put as requests / limits.

Fair enough. It's a pity it takes so many resources, but I guess it is what it is.

porridge

Would this fail on deployment if there's a typo somewhere here, or do we need to check by hand if the deployed values match our intention?

openshift-ci · 2023-04-28T10:07:45Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ludydoo, porridge, stehessel

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [porridge]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ROX-15980 observability resource requests and limits

4954d0b

ludydoo requested review from kylape and porridge April 27, 2023 11:57

ludydoo temporarily deployed to development April 27, 2023 11:57 — with GitHub Actions Inactive

ludydoo had a problem deploying to development April 27, 2023 11:57 — with GitHub Actions Failure

ludydoo temporarily deployed to development April 27, 2023 11:57 — with GitHub Actions Inactive

ROX-15980 whoops

ba55364

ludydoo temporarily deployed to development April 27, 2023 12:58 — with GitHub Actions Inactive

ludydoo requested a review from stehessel April 27, 2023 13:39

Update values.yaml

91c096e

ludydoo temporarily deployed to development April 27, 2023 13:49 — with GitHub Actions Inactive

stehessel reviewed Apr 27, 2023

View reviewed changes

ludydoo requested a review from stehessel April 27, 2023 15:48

stehessel approved these changes Apr 27, 2023

View reviewed changes

openshift-ci bot assigned stehessel Apr 27, 2023

openshift-ci bot added the lgtm label Apr 27, 2023

porridge approved these changes Apr 28, 2023

View reviewed changes

openshift-ci bot assigned porridge Apr 28, 2023

openshift-ci bot added the approved label Apr 28, 2023

ludydoo merged commit 8620c55 into main May 2, 2023

ludydoo deleted the ROX-15980-observability-resources-requests-and-limits branch May 2, 2023 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROX-15980 observability resource requests and limits#993

ROX-15980 observability resource requests and limits#993
ludydoo merged 3 commits intomainfrom
ROX-15980-observability-resources-requests-and-limits

ludydoo commented Apr 27, 2023

Uh oh!

ludydoo commented Apr 27, 2023

Uh oh!

stehessel Apr 27, 2023

Uh oh!

ludydoo Apr 27, 2023

Uh oh!

stehessel Apr 27, 2023

Uh oh!

ludydoo Apr 27, 2023 •

edited

Loading

Uh oh!

stehessel Apr 27, 2023

Uh oh!

porridge left a comment

Uh oh!

openshift-ci bot commented Apr 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ludydoo commented Apr 27, 2023

Uh oh!

ludydoo commented Apr 27, 2023

Uh oh!

stehessel Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ludydoo Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

stehessel Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ludydoo Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stehessel Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

porridge left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci bot commented Apr 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ludydoo Apr 27, 2023 •

edited

Loading