Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upProm2: 'Out of bounds' tsdb error drops target entirely #2894
Comments
This comment has been minimized.
This comment has been minimized.
|
This looks like a trivial fix where we need to add a case to this switch statement. Will send a PR soon. |
gouthamve
pushed a commit
to gouthamve/prometheus
that referenced
this issue
Jul 4, 2017
gouthamve
referenced this issue
Jul 4, 2017
Merged
Handle scrapes with OutOfBounds metrics better #2897
fabxc
closed this
Jul 5, 2017
This comment has been minimized.
This comment has been minimized.
|
I've built and deployed at commit 24e9dea but the issue persists. I'll try to track down what seems to have been missed, but this issue is about items in the far past not the future just to be clear. These are the only relevant log lines I see. I never see these and when I turned on debug these never came either. Can this be reopened? |
gouthamve
reopened this
Jul 6, 2017
gouthamve
self-assigned this
Jul 6, 2017
gouthamve
referenced this issue
Jul 6, 2017
Merged
Convert tsdb.ErrOutOfBounds to a storage error #2906
This comment has been minimized.
This comment has been minimized.
|
Fixed for good in #2906 Can you give it another try @johrstrom? |
gouthamve
closed this
Jul 7, 2017
This comment has been minimized.
This comment has been minimized.
|
Confirmed, thank your the quick support. |
jhooyberghs
referenced this issue
Mar 8, 2018
Closed
Metrics exporter dropped when some of the metrics are too old to ingest / idle (msg="append failed" err="out of bounds") #3930
anandsinghkunwar
referenced this issue
Jun 11, 2018
Open
Prometheus not scraping metrics `adding stale sample failed` #4249
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 23, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
johrstrom commentedJul 3, 2017
What did you do?
I'm scraping NetData targets that give the timestamp with the metric in the response as seen below. Sometimes some metrics are not often updated (or ever after they're created) leading to out of bounds errors.
Here's an example, I've added the comment to the out of bounds metric for the purpose of this ticket.
My settings for block durations seem to be very reasonable.
storage.tsdb.max-block-duration 17h48m0s
storage.tsdb.min-block-duration 2h0m0s
What did you expect to see?
I would expect that only the time series' that are out of bounds be dropped, not the target entirely. Logs only indicate this is a warning. I assume the expected behaviour is to drop only the erroneous time series' during ingestion and append the rest that are correctly within the bounds.
So for example if a target has 3 metrics which are out of bounds, I should ingest all except those three.
What did you see instead? Under which circumstances?
All time series' from a target with any metrics that are 'out of bounds' are being dropped. So even if 1 out of 1,000 is out of bounds, all 1,000 get dropped and the target is considered 'down' due to the out of bounds errors.
I'm a little bit ahead of the 2.0.0-alpha.2 release but behind the current branch.
Logs
Jul 03 21:15:21 ip prometheus[1829]: time="2017-07-03T21:15:21Z" level=warning msg="append failed" err="out of bounds" source="scrape.go:606" target=....