Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upcrash during startup #4598
Comments
This comment has been minimized.
This comment has been minimized.
|
Can you share the flags used to start Prometheus? It looks like you've set |
This comment has been minimized.
This comment has been minimized.
|
ping @haraldschilly |
This comment has been minimized.
This comment has been minimized.
|
We downgraded back to version 2.3.2. However, yes, the block durations aren't the defaults.
and well, previously, the max block duration was |
This comment has been minimized.
This comment has been minimized.
|
yeah without steps to reproduce It would be time consuming to troubleshoot this so I wouldn't have the time to take this road. the logs show that all overlapping blocks have the same time ranges so it probably went into some sort of a dead loop maybe caused by some edge case with your settings and the upgrade path. If you don't think we can figure out the steps to reproduce now, my suggestion is close the issue and revisit if it happens again. |
This comment has been minimized.
This comment has been minimized.
|
Yes, I fully understand :-) Just as an additional datapoint, before I close this, I experimented around with the older version These So, all I want to add is that whatever is going on is not specific to 2.4.0
|
haraldschilly
closed this
Oct 1, 2018
This comment has been minimized.
This comment has been minimized.
|
there was one specific fix that prevent crash loops after OOM so chances are this should be fine in 2.4 |
This comment has been minimized.
This comment has been minimized.
Hashfyre
commented
Mar 2, 2019
|
This is happening even in 2.5.0
|
This comment has been minimized.
This comment has been minimized.
|
many changes have been done since 2.5, can you please try with 2.7 and if it happens again please open a new issue with steps to replicate. |
haraldschilly commentedSep 12, 2018
What did you do?
starting prometheus, with some data back from version 2.3.1. then I tried 2.4.0 rc0, which did crash, and today I saw this release and it also crashed.
I fear my problem report isn't helping at all, though, and I can't really share the data files. I've deleted them and so far it looks good. In case this happens again I'll amend this with more information.
What did you expect to see?
log continues to run until it is ready to receive requests ...
What did you see instead? Under which circumstances?
sudden stop and crash
Environment
System information:
4.15.0-1009-gcp x86_64
Prometheus version:
/prometheus $ prometheus --version
prometheus, version 2.4.0 (branch: HEAD, revision: 068eaa5)
build user: root@d84c15ea5e93
build date: 20180911-10:46:37
go version: go1.10.3
Logs: