Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign up2.6 deadlocked loading targets page #5082
Comments
This comment has been minimized.
This comment has been minimized.
|
This comment has been minimized.
This comment has been minimized.
|
Lock: https://github.com/prometheus/prometheus/blob/v2.6.0/scrape/manager.go#L189 Looks like its ApplyConfig thats holding the lock, and it itself is blocked in scrapePool.reload on a wg.Wait, which I suspect is blocked on a old scrape look stopping.
|
This comment has been minimized.
This comment has been minimized.
|
There are 162 scrapeloops, anecdotally a bunch are blocks trying to append to the remote write queues. I don't see whos holding that lock yet. |
This comment has been minimized.
This comment has been minimized.
|
Have you been able to troubleshoot the problem? |
simonpasquier
added
the
component/ui
label
Jan 17, 2019
This comment has been minimized.
This comment has been minimized.
mmerrill3
commented
Jan 18, 2019
|
I'm being hit by this issue as well. +1 |
This comment has been minimized.
This comment has been minimized.
|
Yeah I’ve had reports from others too. We’re investigating.
…On Fri, 18 Jan 2019 at 19:48, Michael Merrill ***@***.***> wrote:
I'm being hit by this issue as well. +1
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#5082 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAbGhUcuQPKv-j9sMYE0aDPz5aU10q3nks5vEiUUgaJpZM4Z1jrC>
.
|
This comment has been minimized.
This comment has been minimized.
mmerrill3
commented
Jan 18, 2019
|
I've also seen it when the remote write queues are full. When I turn of remote writes, its all good, no issues. This bit looks suspicious to me, especially if enqueue fails. prometheus/storage/remote/queue_manager.go Lines 222 to 224 in 24f19f0 |
This comment has been minimized.
This comment has been minimized.
michael-doubez
commented
Jan 22, 2019
|
I have also had this issue; in particular when changing a target from one job to another. |
This comment has been minimized.
This comment has been minimized.
|
@tomwilkie is this issue still relevant considering that the remote write code has changed a lot in 2.8? |
This comment has been minimized.
This comment has been minimized.
michael-doubez
commented
Apr 10, 2019
|
Since I switched to 2.8.1, I now longer have slowness and freeze when reloading. |
This comment has been minimized.
This comment has been minimized.
|
Closing, feel free to re-open if it still occurs with 2.9. |
tomwilkie commentedJan 8, 2019
See https://gist.github.com/tomwilkie/43b99a28cebe39d22e7c3b6e12c545bd for stack