Free ephemeral resources used for scanning #70

jcharum · 2020-06-11T15:36:27Z

When we scan results, we get a slice of *openerAtReaders, one for each result task. We read from each reader sequentially. When we are done with a given reader, we retain some of the resources used when reading from it, most notably gob decode buffers. As we scan, we accumulate these defunct buffers, and our memory footprint grows.

This happens for two reasons:

We pop off the slice of readers to iterate, i.e. q = q[1:]. However, we do not clear the backing array reference to the reader.
When we close the reader, we don't clear the sliceioReader, which in turn holds the gob decoder.

Fixing either would eliminate the specific scan leak. Fix both, as I think it's the correct behavior.

jschellenberger

LGTM. Did you try running? Should I try running?

jcharum · 2020-06-11T16:30:41Z

LGTM. Did you try running? Should I try running?

I diagnosed and tested this using a test program.

You can patch this if you have urgent needs and don't want to wait for this to land internally.

prasadgopal · 2020-06-11T17:01:54Z

is it possible to write a test to make sure we don't undo this fix at some point? (if it is not too difficult)

jcharum · 2020-06-11T17:08:54Z

I did not come up with a good way of testing, as it is really an internal resource usage detail. Two possibilities:

Write a test that accesses internals. This does not meet my personal judgment for utility to fragility ratio.
Write a test that creates, uses, and closes many readers, forces a GC, and checks memory usage. This seems possibly fragile and I again question the utility to fragility ratio.

This goes for both the multiReader and openerAtReader changes. I'm open to debate and suggestion though.

josh-newman

I'm ok with landing this and improving testing as we think of better alternatives (as measured by utility vs. fragility).

Just to toss in another idea, we could write a test that scans some notably large objects, then uses a recursive, reflective object size estimator [1] to walk the scanner's references and make sure total size drops after close.

[1] For GRAILers: @jschellenberger made a prototype of something like this: D35649.

jcharum added 2 commits June 11, 2020 15:09

Set sliceioReader to nil so it can be GCed

722ecc8

Clear reader when we are done with it

53b367e

jcharum requested a review from josh-newman June 11, 2020 15:36

jschellenberger reviewed Jun 11, 2020

View reviewed changes

josh-newman approved these changes Jun 11, 2020

View reviewed changes

jcharum merged commit 5a96188 into grailbio:master Jun 11, 2020

jcharum deleted the scan-leak branch June 11, 2020 23:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free ephemeral resources used for scanning #70

Free ephemeral resources used for scanning #70

jcharum commented Jun 11, 2020

jschellenberger left a comment

jcharum commented Jun 11, 2020 •

edited

Loading

prasadgopal commented Jun 11, 2020

jcharum commented Jun 11, 2020

josh-newman left a comment

Free ephemeral resources used for scanning #70

Free ephemeral resources used for scanning #70

Conversation

jcharum commented Jun 11, 2020

jschellenberger left a comment

Choose a reason for hiding this comment

jcharum commented Jun 11, 2020 • edited Loading

prasadgopal commented Jun 11, 2020

jcharum commented Jun 11, 2020

josh-newman left a comment

Choose a reason for hiding this comment

jcharum commented Jun 11, 2020 •

edited

Loading