Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rule evaluation warnings prevent clean shutdown #1343

Closed
spyrexd opened this Issue Jan 25, 2016 · 9 comments

Comments

Projects
None yet
3 participants
@spyrexd
Copy link

spyrexd commented Jan 25, 2016

A sigterm was sent to the prometheus container to cleanly shutdown the process:

docker kill --signal="SIGTERM" prom

After the shutting down message the following log message is logged for each alert rule that is configured:

level=warning msg="Error while evaluating rule "ALERT CritDiskUsage\n\tIF collectd_df_percent_bytes{type=\"free\"} < 5\n\tWITH {severity=\"critical\"}\n\tSUMMARY \"Warning Disk Volume {{$labels.df}} on {{$labels.exported_instance}} is low on space\"\n\tDESCRIPTION \"Waarning Disk Volume {{$lables.df}} on {{$lables.exported_instance}} has less than 5% disk available. Current Available {{$value}}%\"\n\tRUNBOOK \"\"": query was canceled in query queue" source="manager.go:257"

The logs messages keep repeating and prometheus does not shutdown.

@fabxc

This comment has been minimized.

Copy link
Member

fabxc commented Jan 25, 2016

Which revision are you running? (Shown on the /status page in the web UI.)

@spyrexd

This comment has been minimized.

Copy link
Author

spyrexd commented Jan 25, 2016

I am running 0.16.2

@beorn7

This comment has been minimized.

Copy link
Member

beorn7 commented Jan 25, 2016

Further up in the logfile, can you see any messages about suspended ingestion? (There are usually a lot of them.) Then it is #1319 .

@beorn7 beorn7 added the bug label Jan 25, 2016

@spyrexd

This comment has been minimized.

Copy link
Author

spyrexd commented Jan 25, 2016

I do not see any logs mentioning suspended ingestion, but when I looked at the status this morning I did notice it had been over 60 hours since the last scrape of my targets. The long time since last scrape is what prompted me to stop Prometheus.

@fabxc

This comment has been minimized.

Copy link
Member

fabxc commented Jan 26, 2016

@spyrexd We need to know the revision rather than the version since the version is only bumped on releases. Please check the status page again for the revision row.

I do believe that this logging was already fixed and similarly the blocked shutdown, which is unrelated.

@spyrexd

This comment has been minimized.

Copy link
Author

spyrexd commented Jan 26, 2016

The revision is 287d9b2

@fabxc

This comment has been minimized.

Copy link
Member

fabxc commented Feb 2, 2016

We made some changes that might affect this. Can you please build from HEAD and verify whether the problem is still there.
If it is, please share the full log this time and indicate at which point you shut it down? The fact that the above message is printed, has most likely nothing to do with the actual issue.

@beorn7

This comment has been minimized.

Copy link
Member

beorn7 commented Feb 13, 2016

I assume this is fixed. Please re-open if that's not the case.

@beorn7 beorn7 closed this Feb 13, 2016

@lock

This comment has been minimized.

Copy link

lock bot commented Mar 24, 2019

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked and limited conversation to collaborators Mar 24, 2019

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
You can’t perform that action at this time.