Update fleet grafana dashboards #4091

p-se · 2024-06-14T10:38:57Z

Issue:

Problem

For the existing Fleet metrics (and Prometheus configuration to scrape those), Grafana dashboards have recently been added. They were pretty raw and barely usable. This PR updates them (see screenshots at the bottom).

Solution

Update Grafana dashboards for visualizing Fleet metrics.

Testing

On a cluster with Rancher and a fleet version >= v0.10.0-rc13, install the rancher-monitoring chart that includes the changes of this PR.
Open Grafana through the Rancher UI (section "monitoring") and look for 5 dashboards whose name starts with "Fleet". You will not find those dashboards embedded in the Rancher UI.
You will need to create a GitRepo to start seeing GitRepo metrics.
Check if all dashboards are showing "data". Note that, depending on the age of the monitoring installation, the time range may need to be decreased to start seeing visualizations in those dashboards.

Engineering Testing

Manual Testing

See Testing.

Automated Testing

Testing up to fetching and testing metrics from fleet-controller in https://github.com/rancher/fleet/tree/main/e2e/metrics. It does not cover the use of the ServiceMonitor to configure the Prometheus instance to scrape those metrics nor anything with Grafana.

QA Testing Considerations

No additional testing required, tests as described in the Testing section are deemed sufficient.

Regressions Considerations

I cannot think of any cases in which updating (not yet shipped) Grafana dashboards would conflict with existing dashboards.

Backporting considerations

Does not need to be backported.

Screenshots

Part of rancher/fleet#2460

github-actions · 2024-06-14T10:39:11Z

Validation steps

Ensure all container images have repository and tag on the same level to ensure that all container images are included in rancher-images.txt which are used by airgap customers.

  Ex:-
    longhorn-controller:
      repository: rancher/hardened-sriov-cni
      tag: v2.6.3-build20230913

Add a 👍 (thumbs up) reaction to this comment once done. CI won't pass without this reaction to the github-action bot's latest validation comment.
Approve the PR to run the CI check.

thehejik

I checked all 5 modified Grafana dashboards for Fleet and they are showing reasonable data. Also Prometheus Targets and Prometheus Graph are working as expected for fleet resources.

I was playing a bit with creating Fleet ClusterGroups and assigning various clusters there together with using filters in Grafana, all good.

mallardduck

@p-se - looks pretty good to me overall. ~~The PR is out of date with the HEAD, so other than updating for that before merge~~ LGTM. (Edit - weird after I submitted the review it stopped showing the outdated branch thing)

p-se · 2024-06-18T15:24:38Z

@thehejik @mallardduck
Thank you very much for testing and the review!

github-actions · 2024-06-20T14:51:01Z

Validation steps

Ensure all container images have repository and tag on the same level to ensure that all container images are included in rancher-images.txt which are used by airgap customers.

  Ex:-
    longhorn-controller:
      repository: rancher/hardened-sriov-cni
      tag: v2.6.3-build20230913

Add a 👍 (thumbs up) reaction to this comment once done. CI won't pass without this reaction to the github-action bot's latest validation comment.
Approve the PR to run the CI check.

p-se added 2 commits June 14, 2024 12:24

Update Grafana dashboards for Fleet

e3e71ae

Part of rancher/fleet#2460

make charts

31c91b4

p-se requested a review from a team as a code owner June 14, 2024 10:38

thehejik approved these changes Jun 17, 2024

View reviewed changes

mallardduck approved these changes Jun 17, 2024

View reviewed changes

p-se mentioned this pull request Jun 19, 2024

Add Grafana dashboard for Fleet performance #4106

Merged

p-se requested a review from a team June 20, 2024 12:59

recena requested review from a team and removed request for a team June 20, 2024 13:01

Merge branch 'dev-v2.9' into update-fleet-grafana-dashboards

55f46dc

thehejik merged commit f82ffa9 into rancher:dev-v2.9 Jun 21, 2024
5 checks passed

krunalhinguu pushed a commit to krunalhinguu/charts that referenced this pull request Jul 15, 2024

Update fleet grafana dashboards (rancher#4091)

6f5b04f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update fleet grafana dashboards #4091

Update fleet grafana dashboards #4091

p-se commented Jun 14, 2024

github-actions bot commented Jun 14, 2024

thehejik left a comment

mallardduck left a comment •

edited

Loading

p-se commented Jun 18, 2024

github-actions bot commented Jun 20, 2024

Update fleet grafana dashboards #4091

Update fleet grafana dashboards #4091

Conversation

p-se commented Jun 14, 2024

Issue:

Problem

Solution

Testing

Engineering Testing

Manual Testing

Automated Testing

QA Testing Considerations

Regressions Considerations

Backporting considerations

Screenshots

github-actions bot commented Jun 14, 2024

Validation steps

thehejik left a comment

Choose a reason for hiding this comment

mallardduck left a comment • edited Loading

Choose a reason for hiding this comment

p-se commented Jun 18, 2024

github-actions bot commented Jun 20, 2024

Validation steps

mallardduck left a comment •

edited

Loading