overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h #2545

mvo5 · 2017-01-02T15:10:32Z

Changes are kept until they are pruned after 24h. This means that we keep all the changes of the last 24h around in memory. This is not ideal as LP:1642068 shows. This PR limits the amount to 100 total changes that are ready.

LP: #1642068

Signed-off-by: Zygmunt Krynicki <zygmunt.krynicki@canonical.com>

LP: #1642068

…anges

pedronis

lgtm

niemeyer · 2017-01-17T18:18:02Z

overlord/state/state.go

@@ -363,11 +364,16 @@ func (s *State) tasksIn(tids []string) []*Task {
 // Prune removes changes that became ready for more than pruneWait
 // and aborts tasks spawned for more than abortWait.
 // It also removes tasks unlinked to changes after pruneWait.
-func (s *State) Prune(pruneWait, abortWait time.Duration) {
+func (s *State) Prune(pruneWait, abortWait time.Duration, maxReadyChanges int) {


"maxReadyChanges" seems misleading since the comparison below looks at the total count instead of just the ready changes. This should probably be "pruneReadyOver" or similar, and the documentation above should be updated to mention it.

Thanks! I updated the name and the documentation now.

niemeyer · 2017-01-17T18:25:18Z

overlord/state/state.go

@@ -379,14 +385,16 @@ func (s *State) Prune(pruneWait, abortWait time.Duration) {
 			}
 			continue
 		}
-		if readyTime.Before(pruneLimit) {
+		// change old or we have too many changes
+		if readyTime.Before(pruneLimit) || (chg.Status().Ready() && len(s.changes) > maxReadyChanges) {


Change statuses aren't super cheap to compute. Best to avoid it in a frequent global iteration like this.

There's also no need in this case I believe. It has to be ready, otherwise readyTime would be zero and we wouldn't be here.

The logic here doesn't seem quite healthy though. This may end up killing changes as soon as they're done, to the point of the client being unable to even see their result.

Let's please talk about this.

Nice catch, I removed the chg.Status().Ready() check now.

Good catch on the logic, if 100 changes are in flight, this will always kill the one change that just became ready, not what we want! As a strawman I implemented a real maxReadyChanges that only counts the changes that are ready, not the total changes. Do you think that is reasonable? Or should we do something else instead (like a min-duration of e.g. 10min that we allow a change to stay even if the limit is reached?).

pedronis · 2017-01-17T18:58:03Z

On Jan 17, 2017 19:25, "Gustavo Niemeyer" <notifications@github.com> wrote: *@niemeyer* requested changes on this pull request. ------------------------------ In overlord/state/state.go <#2545 (review)>:

@@ -363,11 +364,16 @@ func (s *State) tasksIn(tids []string) []*Task {

// Prune removes changes that became ready for more than pruneWait // and aborts tasks spawned for more than abortWait. // It also removes tasks unlinked to changes after pruneWait. -func (s *State) Prune(pruneWait, abortWait time.Duration) { +func (s *State) Prune(pruneWait, abortWait time.Duration, maxReadyChanges int) { "maxReadyChanges" seems misleading since the comparison below looks at the total count instead of just the ready changes. This should probably be "pruneReadyAfter" or similar, and the documentation above should be updated to mention it. I missed noticing this... maybe we really want to implement maxReadyChanges instead, hmm

pedronis · 2017-01-18T08:03:52Z

overlord/state/state.go

+	sort.Sort(byReadyTime(changes))
+
+	// used just for couting
+	readyChanges := map[string]bool{}


wondering, isn't a counter enough? also given the ordering can't we just count backward in the main loop (maybe a bit obscure)

mvo5 changed the title ~~overlord: allow max 100 changes in "ready" state to avoid boundless grow~~ overlord: allow max 100 changes in "ready" state to avoid growing changes for 24h Jan 2, 2017

mvo5 force-pushed the bugfix/unbound-changes branch from 6f747de to 37d86f5 Compare January 2, 2017 16:13

zyga and others added 5 commits January 3, 2017 08:33

tests: move lp-1630479 regression test

503c76f

Signed-off-by: Zygmunt Krynicki <zygmunt.krynicki@canonical.com>

tests: port lp-1630479 test to snapd standards

ef9493f

Signed-off-by: Zygmunt Krynicki <zygmunt.krynicki@canonical.com>

tests: move lp-1618683 regression test

5cc28ae

Signed-off-by: Zygmunt Krynicki <zygmunt.krynicki@canonical.com>

tests: port lp-1618683 test to snapd standards

363d940

Signed-off-by: Zygmunt Krynicki <zygmunt.krynicki@canonical.com>

Allow max 100 changes in "ready" state to avoid boundless grow

3d9ce1d

LP: #1642068

mvo5 force-pushed the bugfix/unbound-changes branch from 37d86f5 to 3d9ce1d Compare January 3, 2017 07:33

mvo5 closed this Jan 3, 2017

mvo5 added 2 commits January 3, 2017 08:34

Merge remote-tracking branch 'upstream/master' into bugfix/unbound-ch…

2cc6dde

…anges

Merge remote-tracking branch 'upstream/master' into bugfix/unbound-ch…

9908a00

…anges

mvo5 reopened this Jan 3, 2017

pedronis approved these changes Jan 17, 2017

View reviewed changes

niemeyer requested changes Jan 17, 2017

View reviewed changes

pedronis added this to the 2.22 milestone Jan 17, 2017

mvo5 added 3 commits January 18, 2017 07:45

address review feedback (thanks Gustavo)

99722d0

implement real maxReadyChanges

7a7974e

increase pruneMaxChanges to 500

66e12c5

pedronis reviewed Jan 18, 2017

View reviewed changes

simplify as suggested by Samuele (thanks!)

75696df

niemeyer approved these changes Jan 18, 2017

View reviewed changes

mvo5 changed the title ~~overlord: allow max 100 changes in "ready" state to avoid growing changes for 24h~~ overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h Jan 18, 2017

niemeyer merged commit bd1ffb3 into canonical:master Jan 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h #2545

overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h #2545

mvo5 commented Jan 2, 2017 •

edited

Loading

pedronis left a comment

niemeyer Jan 17, 2017 •

edited

Loading

mvo5 Jan 18, 2017

niemeyer Jan 17, 2017

mvo5 Jan 18, 2017

pedronis commented Jan 17, 2017 via email

pedronis Jan 18, 2017

overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h #2545

overlord: allow max 500 changes in "ready" state to avoid growing changes for 24h #2545

Conversation

mvo5 commented Jan 2, 2017 • edited Loading

pedronis left a comment

Choose a reason for hiding this comment

niemeyer Jan 17, 2017 • edited Loading

Choose a reason for hiding this comment

mvo5 Jan 18, 2017

Choose a reason for hiding this comment

niemeyer Jan 17, 2017

Choose a reason for hiding this comment

mvo5 Jan 18, 2017

Choose a reason for hiding this comment

pedronis commented Jan 17, 2017 via email

pedronis Jan 18, 2017

Choose a reason for hiding this comment

mvo5 commented Jan 2, 2017 •

edited

Loading

niemeyer Jan 17, 2017 •

edited

Loading