Skip to content

feat(arc): add Prometheus monitoring and Grafana dashboard#40

Merged
xnoto merged 1 commit intomainfrom
fix/arc-metrics-monitoring
Apr 30, 2026
Merged

feat(arc): add Prometheus monitoring and Grafana dashboard#40
xnoto merged 1 commit intomainfrom
fix/arc-metrics-monitoring

Conversation

@xnoto
Copy link
Copy Markdown
Contributor

@xnoto xnoto commented Apr 30, 2026

Summary

  • expose ARC controller and listener metrics for Prometheus scraping
  • add ServiceMonitor resources for the ARC system and the arc-tf runner scale set
  • vendor and adapt the upstream ARC Grafana dashboard for the repo's managed Grafana instance
  • update CODEOWNERS to assign the repo to @makeitworkcloud/admins

Details

This adds ARC monitoring that fits the cluster's existing kube-prometheus-stack and Grafana setup.

ARC controller metrics are enabled on the controller chart, and listener metrics are enabled on the arc-tf scale set. The new monitoring resources expose and scrape:

  • the ARC controller in arc-systems
  • the arc-tf listener in arc-runners

The Grafana side now includes an adapted version of ARC's upstream autoscaling runner set dashboard, wired to the cluster's default Prometheus datasource. The dashboard was patched to remove upstream sample-specific hardcoded scale set names and namespaces so it works with this repo's arc-tf configuration.

This PR also keeps minRunners: 1 for arc-tf, as intended.

Validation

  • ran pre-commit run --files .github/CODEOWNERS operators/arc/arcsystem.yaml workloads/arc/arc-tf-application.yaml workloads/arc/kustomization.yaml workloads/arc/metrics-monitoring.yaml workloads/grafana/datasource.yaml workloads/grafana/arc-dashboard.yaml workloads/grafana/kustomization.yaml
  • all checks passed

Notes

  • runner pods themselves do not expose separate ARC-specific metrics; ARC's meaningful metrics surfaces are the controller-manager and scale-set listeners
  • full local kustomize build for workloads/arc was not verified in this environment because ksops generator plugins are disabled here

@xnoto xnoto self-assigned this Apr 30, 2026
@xnoto xnoto merged commit 4f70ddc into main Apr 30, 2026
1 check passed
@xnoto xnoto deleted the fix/arc-metrics-monitoring branch April 30, 2026 17:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant