NETOBSERV-1322: ACM & netobserv metrics #61

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

jotak merged 15 commits into netobserv:main from jotak:acm

Dec 5, 2023

Member

jotak commented Nov 14, 2023 •

edited

Loading

Preparing blog post, and storing some examples

Preview available here: https://github.com/jotak/netobserv-documents/blob/acm/blogs/acm/leverage-metrics-in-acm.md

jotak added 4 commits

November 14, 2023 16:42


          ACM & netobserv metrics

561c963


          Update ACM blog

0f5a04e


          Add an example of diy

986da51


          note on user workload metrics

ac48b12

jotak changed the title ~~ACM & netobserv metrics~~ NETOBSERV-1322: ACM & netobserv metrics


          acm 2.9, more on cardinality, ....

d03e47a

jotak marked this pull request as ready for review

November 22, 2023 09:35

jotak requested a review from skrthomas

November 22, 2023 09:35

jotak added 2 commits

November 22, 2023 10:49


          typo

cbb6c47


          mention other metrics

4f6f7b8

skrthomas reviewed

View reviewed changes

Contributor

skrthomas left a comment

Nice work, @jotak ! I added a few edits and questions :) Thanks!

acm.md Outdated

Comment on lines 9 to 11

+. Create 2 clusters (or more)
+. Choose one for being the main one / hub: install ACM operator on it; Create a default MultiClusterHub
+. In console top bar, select "all cluster" then start procedure to import an existing cluster

Contributor

skrthomas Nov 22, 2023

Suggested change

      
            1. Create 2 clusters (or more)
          
            2. Choose one for being the main one / hub: install ACM operator on it; Create a default MultiClusterHub
          
            3. In console top bar, select "all cluster" then start procedure to import an existing cluster
          
            1. Create 2 clusters (or more).
          
            2. Choose one cluster as the main one or hub, and install the ACM operator on it.
          
            3. Create a default MultiClusterHub.
          
            4. In the console top bar, select "all cluster" then start the procedure to import an existing cluster.

Contributor

skrthomas Nov 22, 2023

A few comments about these steps:

I think "create a default MultiClusterHub" should be its own step, or a nested step rather than a continuation of the step 3 with a semicolon.
Is this a MultiClusterHub custom resource or Operator? I think specifying would be good.
I'm wondering, should ACM be spelled out? Or is it an approved acronym that all readers would be familiar with? If its new, I would suggest spelling it out and putting ACM in parentheses for this first mention, then elsewhere you can just use ACM.
Periods are needed at the end of these steps.
I also added some "the".

acm.md Outdated

+. In console top bar, select "all cluster" then start procedure to import an existing cluster
+              On each cluster:
+. Install netobserv downstream (user workload prometheus won't work)

Contributor

skrthomas Nov 22, 2023

Suggested change

      
            1. Install netobserv downstream (user workload prometheus won't work)
          
            1. Install network observability operator downstream (user workload Prometheus won't work).

acm.md Show resolved Hide resolved

acm.md Show resolved Hide resolved

acm.md Show resolved Hide resolved

acm.md Show resolved Hide resolved

Member Author

jotak commented Nov 22, 2023

oops I'm sorry @skrthomas I haven't been clear about that, but the acm.md file is like an internal draft recipe, I think you can ignore it , the actual blog is just what is in the blog directory , so mainly the file named leverage-metrics-in-acm.md

jotak commented

View reviewed changes

acm.md

		@@ -0,0 +1,62 @@
		## Setup ACM with NetObserv metrics

Member Author

jotak Nov 22, 2023

for reviewers: this file is just a recipe for internal purpose, not the blog post; for the blog, look at blogs/acm/leverage-metrics-in-acm.md

Contributor

skrthomas commented Nov 22, 2023 •

edited

Loading

@jotak no worries at all; thanks for the context. You can disregard my ACM acronym comment in this case :)

jotak added 3 commits

November 24, 2023 11:22


          Update blog & yamls to cover install piloted from acm

fdf5346


          do not use default namespace

53795f0


          update

8bd4bbc

jpinsonneau approved these changes

View reviewed changes

Contributor

jpinsonneau left a comment

Sounds good as is ! Don't forget your TODOs 😉
Thanks !

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved


          Apply suggestions from code review

5de9967

Co-authored-by: Julien Pinsonneau <91894519+jpinsonneau@users.noreply.github.com>

OlivierCazade approved these changes

View reviewed changes

skrthomas reviewed

View reviewed changes

Contributor

skrthomas left a comment

@jotak I added a few more comments and suggestions. I think this looks great, and feel free to leave the suggestions if you'd rather not implement them.

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md


		### What is NetObserv?

		Network Observability (NetObserv) is a Red Hat operator providing observability over all the network traffic on a cluster by installing eBPF agents per-node which generate flow logs. These flows are collected, stored, converted into metrics, queried from dashboards and so on. More observability blog posts [here](https://cloud.redhat.com/blog/tag/observability), and NetObserv documentation [there](https://docs.openshift.com/container-platform/4.14/network_observability/network-observability-overview.html).

Contributor

skrthomas Dec 4, 2023

Suggested change

      
            Network Observability (NetObserv) is a Red Hat operator providing observability over all the network traffic on a cluster by installing eBPF agents per-node which generate flow logs. These flows are collected, stored, converted into metrics, queried from dashboards and so on. More observability blog posts [here](https://cloud.redhat.com/blog/tag/observability), and NetObserv documentation [there](https://docs.openshift.com/container-platform/4.14/network_observability/network-observability-overview.html).
          
            Network Observability (NetObserv) is a Red Hat Operator providing observability over all the network traffic on a cluster by installing eBPF agents per-node which generate flow logs. These flows are collected, stored, converted into metrics, queried from dashboards and so on. More observability blog posts [here](https://cloud.redhat.com/blog/tag/observability), and NetObserv documentation [there](https://docs.openshift.com/container-platform/4.14/network_observability/network-observability-overview.html).

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated

+              - By declaring metric names to pull
+              - Or by declaring such recording rules
+              The former is easier to configure but in many cases, this is probably not what you want. When pulling metrics from many sources, the key concept to have in mind is [metrics cardinality](https://www.robustperception.io/cardinality-is-key/). The more metrics you configure, the bigger is the impact on Prometheus and Thanos resource usage and performance. "Cardinality" here does not refer to the number of record rules or names that we declare in this configuration - these are called _metric families_ - after all, if you look closely, we only mention four distinct metric families in this config, which isn't a lot. No, what really matters with cardinality is the distinct count of all metric families _and all their combinations of label keys and values_.

Contributor

skrthomas Dec 4, 2023

Suggested change

      
            The former is easier to configure but in many cases, this is probably not what you want. When pulling metrics from many sources, the key concept to have in mind is [metrics cardinality](https://www.robustperception.io/cardinality-is-key/). The more metrics you configure, the bigger is the impact on Prometheus and Thanos resource usage and performance. "Cardinality" here does not refer to the number of record rules or names that we declare in this configuration - these are called _metric families_ - after all, if you look closely, we only mention four distinct metric families in this config, which isn't a lot. No, what really matters with cardinality is the distinct count of all metric families _and all their combinations of label keys and values_.
          
            The former is easier to configure but in many cases, this is probably not what you want. When pulling metrics from many sources, the key concept to have in mind is [metrics cardinality](https://www.robustperception.io/cardinality-is-key/). The more metrics you configure, the bigger the impact on Prometheus and Thanos resource usage and performance. "Cardinality" here does not refer to the number of record rules or names that we declare in this configuration - these are called _metric families_ - after all, if you look closely, we only mention four distinct metric families in this config, which isn't a lot. What really matters with cardinality is the distinct count of all metric families _and all their combinations of label keys and values_.

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

blogs/acm/leverage-metrics-in-acm.md Outdated Show resolved Hide resolved

stleerh reviewed

View reviewed changes

blogs/acm/leverage-metrics-in-acm.md Outdated


		Proceed until you have created a `MultiClusterObservability` resource.

		Before going further, makes sure the observability stack is up and running:

Contributor

stleerh Dec 4, 2023

Change to "make".

examples/ACM/thanos-s3.sh Outdated

		@@ -0,0 +1,24 @@
		#!/bin/bash

		if [[ "$#" -lt 1 \|\| "$1" = "--help" ]]; then

Contributor

stleerh Dec 4, 2023

Since two arguments are required, it should be -lt 2.

jotak and others added 4 commits

December 5, 2023 08:59


          Apply suggestions from code review

a72ebb1

Co-authored-by: Sara Thomas <sarthoma@redhat.com>


          typos

0cb4517


          remove TODOs, add credits

98a39b5


          fix thanos arg validation

bf9b066

jotak merged commit 85cdc6c into netobserv:main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet