-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Submit Queue / munger: keep state over restarts #1042
Comments
@eparis @mikedanese @fejta maybe we can come up with a list of things we want to save. @mhrgoog It'd be great if we could collect very explicitly in one place the set of things that are getting persisted. |
Right now I am brainstorming and throwing ideas against the wall. I am not sure if this justifies a design doc or not. Here are three potential possibilities:
Pros: Concurrent use should be easier.
Pros: fun with queries and ability to mine information flexibly I think even if we knew what we wanted to save now the list would change. But knowing the size of data we want would help a lot. @fejta you have any ideas of what you are looking for? |
What problem are we trying to fix here? |
@mhrgoog The data in http://submit-queue.k8s.io/#/e2e is great... except it disappears whenever we restart the merge queue. I want it to serialized to a GCS object so we can avoid that. Specifically I need to be able to measure the following each week:
Right now I cannot because whenever we restart the mergebot everything is lost. So I wind up tracking these values since 4 hours ago instead. At this point I am not concerned about fun with queries or concurrency. I want to be able to answer those three questions. |
I recommend a json object over a protobuf.
|
I'm hearing about saving and analyzing lists of time series data. Neither are things I want the mungegithub tools to get much better at. I'd rather mungegithub remained focused on github and automating stuff around github and we build 'something else' to handle the data/visualization aspects we have been adding. Should we be dumping (or having something poll) the I'm the one who started the process of showing stats in the submit-queue, but as we want more we're probably best to ask what the right solution is, not what continues to be 'easy' to bolt onto the side... Especially since in my mind the thing that would be GREAT to save across reboot is the proxied cache of github object state. So we don't have such a slow re-start and we don't run out of API tokens on restart... |
I think @eparis is probably right, the best thing would be to publish metrics (I think @apelisse or @rmmh already started publishing via promethieus?) and scrape them regularly so we can get time-series data for this stuff. ...however I'm super interested in getting something working yesterday. I'm OK with different short- and long- term solutions. |
@lavalamp no fights from me. |
Remembering the queue order across restarts would be super handy, too. Right now it's chugging away on an unimportant PR instead of 1.3 PRs. |
I don't think we have an immediate need here now. We keep the github cache and the stat-scraping script handles queue restarts. Closing. |
Could be as simple as storing a static file in GCS or as complex as adding a database to the things we run in the utility cluster.
P0 Requirements:
The text was updated successfully, but these errors were encountered: