Move pg_stat_replication queries to collector package #966

SuperQ · 2023-11-23T16:09:03Z

Proposal

There are existing queries for pg_stat_replication in cmd/postgres_exporter/queries.go. These metrics should be migrated to the collector package.

The text was updated successfully, but these errors were encountered:

ARPABoy · 2023-11-23T21:40:18Z

This affects replication monitoring in the way that if only pg_up and pg_replication_lag_seconds are monitored in Secondary servers and there's a network outage between Primary and Secondary servers, Secondary servers get lagged without any alarm being triggered.

It seems more reasonable to monitor replication looking at Primary server data.
SELECT COUNT(*) FROM pg_stat_replication WHERE client_addr='SLAVE_IP' AND state = 'streaming';
If it returns 0, we have an unreachable Secondary server.

SELECT COALESCE(EXTRACT(EPOCH FROM replay_lag)::bigint, 0) AS replay_lag FROM pg_stat_replication WHERE client_addr='SLAVE_IP';
If it returns more than X we have a lagged Secondary server.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move pg_stat_replication queries to collector package #966

Move pg_stat_replication queries to collector package #966

SuperQ commented Nov 23, 2023

ARPABoy commented Nov 23, 2023 •

edited

Loading

Move pg_stat_replication queries to collector package #966

Move pg_stat_replication queries to collector package #966

Comments

SuperQ commented Nov 23, 2023

Proposal

ARPABoy commented Nov 23, 2023 • edited Loading

ARPABoy commented Nov 23, 2023 •

edited

Loading