Skip to content

Commit

Permalink
fix(sli): allow Unknown status code in SLI error rate (#149) (#150) (
Browse files Browse the repository at this point in the history
…#151)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Stephan Hesselmann <shesselm@redhat.com>
  • Loading branch information
github-actions[bot] and stehessel committed Oct 3, 2023
1 parent ebe6d1f commit 4a2c6ff
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion resources/prometheus/prometheus-rules.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -351,9 +351,10 @@ spec:
# The error rate over the last 10 minutes must be smaller than 35% to count as available.

# GRPC
# TODO(ROX-19917): Re-add `grpc_code="Unknown"` to the list of server errors.
- expr: |
sum by (namespace, rhacs_instance_id, rhacs_org_id, rhacs_org_name, rhacs_cluster_name, rhacs_environment)
(rate(grpc_server_handled_total{namespace=~"rhacs-.*", job="central", grpc_type="unary", grpc_service!="v1.PingService", grpc_code!~"DeadlineExceeded|Internal|Unavailable|Unknown"}[10m]))
(rate(grpc_server_handled_total{namespace=~"rhacs-.*", job="central", grpc_type="unary", grpc_service!="v1.PingService", grpc_code!~"DeadlineExceeded|Internal|Unavailable"}[10m]))
record: central:grpc_server_handled:server_available_code:rate10m
- expr: |
Expand Down

0 comments on commit 4a2c6ff

Please sign in to comment.