feat(gc): improve periodic GC logic #73

Stebalien · 2019-10-11T02:24:33Z

Don't timeout a full user-requested GC.
Always make sure that closing the datastore can interrupt a GC.
Instead of timing out periodic GC, keep going with a delay between iterations.

1. Don't timeout a _full_ user-requested GC. 2. Always make sure that closing the datastore can interrupt a GC. 3. Instead of timing out periodic GC, keep going with a delay between iterations.

Stebalien · 2019-10-11T02:24:56Z

cc @aarshkshah1992. Could you review this?

Stebalien · 2019-10-11T02:28:24Z

datastore.go

-		MaxGcDuration:  1 * time.Minute,
-		GcInterval:     45 * time.Minute,
+		GcInterval:     15 * time.Minute,
+		GcSleep:        10 * time.Second,


I reduced these as probabilistically sampling a log every 10 seconds shouldn't be that expensive.

Rational behind this algorithm:

If we assume that deletes are randomly distributed, having one value log ready for garbage collection should corallite with other value logs being ready.

After we do a full pass through all value logs, we shouldn't need to GC for a while.

That's why I have the short sleep/long sleep system.

@Stebalien In your first point, do you mean to say that if a sample of a value log file "hits" the discard ratio, the probability that a sample in the next log file will do so too goes up ?

Please can you explain this in a bit more detail ?

Yes, but, actually, I'm not so sure about my assumption.

Assumption: Deletes are randomly distributed between all the value logs.

Conclusion: At any given point in time, all value logs should have approximately the same number of discarded items. Therefore, if any one value log is ready to be garbage collected, others are also likely to be ready for garbage collection.

That's not quite correct. Values in a given value log are temporally correlated so deletes aren't likely to be completely random. However, the fact that enough time has passed for one value log to collect garbage is still a good indication that another value log may also have collected enough garbage for compaction.

aarshkshah1992 · 2019-10-14T03:42:14Z

@Stebalien LGTM. Just one question to improve my own understanding of how badger GC works.

feat(gc): improve periodic GC logic

a2eb52d

1. Don't timeout a _full_ user-requested GC. 2. Always make sure that closing the datastore can interrupt a GC. 3. Instead of timing out periodic GC, keep going with a delay between iterations.

Stebalien commented Oct 11, 2019

View reviewed changes

Stebalien merged commit 7d3125d into master Oct 14, 2019

Stebalien deleted the feat/periodic-gc-redux branch October 14, 2019 11:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gc): improve periodic GC logic #73

feat(gc): improve periodic GC logic #73

Stebalien commented Oct 11, 2019

Stebalien commented Oct 11, 2019

Stebalien Oct 11, 2019

aarshkshah1992 Oct 14, 2019 •

edited

Stebalien Oct 14, 2019

aarshkshah1992 commented Oct 14, 2019

feat(gc): improve periodic GC logic #73

feat(gc): improve periodic GC logic #73

Conversation

Stebalien commented Oct 11, 2019

Stebalien commented Oct 11, 2019

Stebalien Oct 11, 2019

Choose a reason for hiding this comment

aarshkshah1992 Oct 14, 2019 • edited

Choose a reason for hiding this comment

Stebalien Oct 14, 2019

Choose a reason for hiding this comment

aarshkshah1992 commented Oct 14, 2019

aarshkshah1992 Oct 14, 2019 •

edited