Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign up'Error on ingesting out-of-order samples' in 0.18.0 #1585
Comments
This comment has been minimized.
This comment has been minimized.
|
@jrv suggested trying https://gist.github.com/juliusv/dab35d4b17caf1937ff080238f1e7f20 which I will do now. |
RichiH
changed the title
`Out of Order Samples` in 0.18.0
`Error on ingesting out-of-order samples` in 0.18.0
Apr 25, 2016
RichiH
changed the title
`Error on ingesting out-of-order samples` in 0.18.0
'Error on ingesting out-of-order samples' in 0.18.0
Apr 25, 2016
This comment has been minimized.
This comment has been minimized.
|
As a somewhat unexpected update: All those messages came from three machines, all of them running CatOS (properly firewalled, for a specific historical reason, and I would have killed them ages ago). Gut feeling is that the messages are valid and 0.18.0 simply exposes things that 0.17.0 gracefully ignored. |
This comment has been minimized.
This comment has been minimized.
|
As another update: I removed the patch and the CatOS machines and am running with vanilla 0.18.0 from source again. Will continue to monitor the situation. |
This comment has been minimized.
This comment has been minimized.
|
I re-enabled the machines to play with the situation a bit and I think that it would be an acceptable trade-off to print the instance and/or the job name along with numDropped. That way, users have something to dog down with while STDOUT/STDERR is not flooded with crap. |
This comment has been minimized.
This comment has been minimized.
|
Could you share some of the log excerpts generated by https://gist.github.com/juliusv/dab35d4b17caf1937ff080238f1e7f20? I'm interested in the other fields also, not just what the metrics were... |
This comment has been minimized.
This comment has been minimized.
|
Or we re-introduce the detailed logging, but on DEBUG level. |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
Sure, that's what I meant. |
This comment has been minimized.
This comment has been minimized.
|
Got a private look at @RichiH's log excerpts. The problem is that samples are coming in with the same timestamp, but different value. So in that case, Prometheus is doing the right thing, but the "out of order" error message is a bit misleading. Should we have a separate error code for "hey, you can't change the value of an existing sample"? |
This comment has been minimized.
This comment has been minimized.
|
To clarify, this actually is from two targets being relabeled into the same On Mon, Apr 25, 2016 at 1:01 PM Julius Volz notifications@github.com
|
This comment has been minimized.
This comment has been minimized.
|
No, it's old machines answering to one query twice and apparently doing it live, so they produce different values every time. I can't say anything about the others seeing this effect, though. |
beorn7
self-assigned this
Apr 25, 2016
This comment has been minimized.
This comment has been minimized.
|
I would bet that it's the same issue for the other people. In 0.17.0, we just didn't check the sample value equality yet, only if the timestamps are the same: 0.17.0: https://github.com/prometheus/prometheus/blob/0.17.0/storage/local/storage.go#L595-L606 Compare to 0.18.0: https://github.com/prometheus/prometheus/blob/0.18.0/storage/local/storage.go#L607-L619 |
brian-brazil
added
the
question
label
Apr 25, 2016
This comment has been minimized.
This comment has been minimized.
|
OK, improved logging merged. The duplicate metrics from node exporter apparently only happen with a pretty old version. I think this can be closed. |
beorn7
closed this
Apr 25, 2016
This comment has been minimized.
This comment has been minimized.
spinus
commented
Jun 29, 2016
|
Is it possible to know which samples are meant to be overridden (or which set of labels cause the problem)? |
This comment has been minimized.
This comment has been minimized.
|
Switch to debug level.
Richard
Sent by mobile; excuse my brevity.
|
This comment has been minimized.
This comment has been minimized.
spinus
commented
Jun 29, 2016
|
ah, sure, I should tried that before asking. Thank you @RichiH. |
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 24, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |

RichiH commentedApr 25, 2016
As per IRC/mailing list:
After upgrading to 0.18.0, I am seeing tons of
From the ML, several others are seeing the same.