Tombstone memory improvements #7084

jwilder · 2016-07-27T22:22:31Z

Required for all non-trivial PRs

Rebased/mergable
Tests pass
CHANGELOG.md updated

This PR has a number of improvements to reduce memory usage, improve performance and fix bugs related to tombstone files.

Tombstones were read into memory and then applied to TSM files. Since we read TSM files concurrently, this could cause large memory spikes at startup. It now loads them iteratively using a more consistent amount of memory.
V1 tombstones also read all values into memory. This has been changed to be more iterative.
Deletes to TSM files are not applied concurrently which should improve deletes in the engine.
When tombstones are loaded, batches are used to apply each tombstone instead of reading them all into memory at once.
When tombstones were written, a bug caused the series to be add N times for each TSM files. This cause tombstone files to be much larger than necessary and be much slower to reload.
Some TSM cursors were not closed which could cause TSM files used during a query to and also being compacted to leak.

jsternberg · 2016-07-27T22:30:40Z

I only took a look at the query engine portion of this, but that part's LGTM.

e-dard · 2016-07-28T10:01:52Z

tsdb/engine/tsm1/file_store.go

+	}
+
+	resC := make(chan res)
+	var n int


nit: this is just a style thing, but since you know you're sending a result for each file, couldn't you get rid of n completely and just create a buffered channel of length len(f.files)?

In your gathering loop at the bottom of walkFiles, you would then loop len(resC) times.

e-dard · 2016-07-28T10:18:42Z

Just a few nits, otherwise LGTM 👍

benbjohnson · 2016-07-28T13:55:15Z

tsdb/engine/tsm1/file_store.go

+	// struct to hold the result of opening each reader in a goroutine
+	type res struct {
+		err error
+	}


Seems like you could just do a chan error since there's only one field in the struct.

benbjohnson · 2016-07-28T13:58:40Z

I agree with @e-dard's nits but otherwise LGTM.

mark-rushakoff · 2016-07-28T15:28:17Z

tsdb/engine/tsm1/reader.go

+		}
+		batch = append(batch, ts.Key)
+
+		if len(batch) > 4096 {


len(batch) > 4096 means the original capacity of batch was exceeded and the slice had to grow. Nit, but should this be >= 4096?

Tombstone were read fully into memory at startup which could consume a lot of RAM and OOM the process if there were a lot of deleted series and many TSM files. This now walks the tombstone file and iteratively applies the tombstone which uses significantly less RAM. This may be slightly slower in the generate cause, but should scale better.

Use a bufio.Scanner to read v1 tombstones instead of reading in the whole file and parsing it from memory.

This keeps some memory bounds when reloading a TSM files tombstones so that the heap does not grow exceedintly fast and stay there after the deletes are applied.

If there were multiple TSM files and a delete/drop was run, we would write the delete series to the tombstone file N times for each file. This occurred because FileStore.WalkKeys walks every key in every TSM file which can return duplicate keys. This issue caused TSM files to be much larger than they should be and also cause large memory usage during the delete.

Aux and condition iterators where not closed which could cause TSM files to leak if they were queried against while a compaction was running.

If they were left around, re-enabling them again could cause future compactions to continuously fail. A restart of the server would clean them up correctly though.

break cause the first one to be tracked and all others would leak as temp files that would not be removed until the server restarted.

The path info only contained the file name which caused tombstone files to not be removed if there were queries running against a file that was compacted. This is now consistent with the TSMReader.Path which returns the full path info.

oiooj · 2016-08-15T06:37:54Z

👍

jwilder added the area/tsm label Jul 27, 2016

jwilder added this to the 1.0.0 milestone Jul 27, 2016

jwilder force-pushed the jw-tombstones branch from 88cf911 to 1e6ef16 Compare July 27, 2016 22:25

e-dard reviewed Jul 28, 2016
View reviewed changes

benbjohnson reviewed Jul 28, 2016
View reviewed changes

mark-rushakoff reviewed Jul 28, 2016
View reviewed changes

jwilder force-pushed the jw-tombstones branch from 1e6ef16 to b26a74f Compare July 29, 2016 02:22

jwilder added 13 commits July 28, 2016 20:25

Use scanner for reading v1 tombstones

a8c69e2

Use a bufio.Scanner to read v1 tombstones instead of reading in the whole file and parsing it from memory.

Apply deletes to TSM files concurrently

4436e65

Apply reload tombstones in batches

ef8ecf0

This keeps some memory bounds when reloading a TSM files tombstones so that the heap does not grow exceedintly fast and stay there after the deletes are applied.

Close drained iterators

0b60862

Aux and condition iterators where not closed which could cause TSM files to leak if they were queried against while a compaction was running.

Ensure aux and cond cursors are closed when iterator is closed

602a2e8

Remove temp TSM files when disabling compactions

c1a94e8

If they were left around, re-enabling them again could cause future compactions to continuously fail. A restart of the server would clean them up correctly though.

Make sure all in-use files are tracked

c3fda24

break cause the first one to be tracked and all others would leak as temp files that would not be removed until the server restarted.

Include full for tombstone files

030f1ef

The path info only contained the file name which caused tombstone files to not be removed if there were queries running against a file that was compacted. This is now consistent with the TSMReader.Path which returns the full path info.

Fix go vet

8367771

Update changelog

c1840be

Simplifications

5576e7f

jwilder force-pushed the jw-tombstones branch from b26a74f to 5576e7f Compare July 29, 2016 02:25

jwilder merged commit 37674d2 into master Jul 29, 2016

jwilder deleted the jw-tombstones branch July 29, 2016 02:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tombstone memory improvements #7084

Tombstone memory improvements #7084

jwilder commented Jul 27, 2016 •

edited

Loading

jsternberg commented Jul 27, 2016 •

edited by jackzampolin

Loading

e-dard Jul 28, 2016

e-dard commented Jul 28, 2016

benbjohnson Jul 28, 2016

benbjohnson commented Jul 28, 2016

mark-rushakoff Jul 28, 2016

oiooj commented Aug 15, 2016

Tombstone memory improvements #7084

Tombstone memory improvements #7084

Conversation

jwilder commented Jul 27, 2016 • edited Loading

Required for all non-trivial PRs

jsternberg commented Jul 27, 2016 • edited by jackzampolin Loading

e-dard Jul 28, 2016

Choose a reason for hiding this comment

e-dard commented Jul 28, 2016

benbjohnson Jul 28, 2016

Choose a reason for hiding this comment

benbjohnson commented Jul 28, 2016

mark-rushakoff Jul 28, 2016

Choose a reason for hiding this comment

oiooj commented Aug 15, 2016

jwilder commented Jul 27, 2016 •

edited

Loading

jsternberg commented Jul 27, 2016 •

edited by jackzampolin

Loading