Proposal
Discussed in prometheus/proposals#76 (comment)
We propose with @dashpole to add a built-in (like up) scrape metric with limited cardinality that gives a clue where the problem is (e.g. scrapes_failed_total{area=[client / network / timeout / scrape_limit / memory_limit / storage]).
Main motivation is prometheus/proposals#76 where we propose adding one more, non trivial reason
Please do not create PRs for it yet, we need to first agree if we want this.
Proposal
Discussed in prometheus/proposals#76 (comment)
We propose with @dashpole to add a built-in (like
up) scrape metric with limited cardinality that gives a clue where the problem is (e.g. scrapes_failed_total{area=[client / network / timeout / scrape_limit / memory_limit / storage]).Main motivation is prometheus/proposals#76 where we propose adding one more, non trivial reason
Please do not create PRs for it yet, we need to first agree if we want this.