Fix memory leak in pdfjs.js. #42

ltratt · 2016-10-02T13:17:58Z

A large amount of data is pushed into the global variable canvas_logs which isn't cleared in runPdfJS. On each iteration the list grows, eventually significantly so.

On a Linux machine with a recent-ish V8, it manages 2777 iterations before an allocation fails (at which point it's allocated over 2GiB of virtual memory, and used about 1.4Gib) and V8 crashes (Fatal error in CALL_AND_RETRY_LAST). From iteration ~800 onwards, things start slowing down gradually; at iteration ~1700, they start slowing down considerably; and at iteration ~2750, performance falls off a cliff; presumably as the GC comes under more and more strain. Here's a graph showing this (x axis=iteration number; y axis=time):

It can be a little hard to see what's going on because the last iteration or two are so slow and, in a sense, distort the rest of the graph. If I chop off the last 80 iterations, one can still see that odd things are happening:

By emptying canvas_logs at the end of each benchmark run, we make the
benchmark's performance more consistent. With this patch I can happily run this
benchmark in constant memory for 10000 iterations, with no discernible change in
iteration time over that.

A large amount of data is pushed into the global variable 'canvas_logs' which isn't cleared in runPdfJS. On each iteration the list grows, eventually significantly so. On a Linux machine with a recent-ish V8, it manages 2777 iterations before an allocation fails (at which point it's allocated over 2GiB of virtual memory, and used about 1.4Gib) and V8 crashes ("Fatal error in CALL_AND_RETRY_LAST"). From iteration ~800 onwards, things start slowing down gradually; at iteration ~1700, they start slowing down considerably; and at iteration ~2750, performance falls off a cliff; presumably as the GC comes under more and more strain. By emptying canvas_logs at the end of each benchmark run, we make the benchmark's performance more consistent. With this patch I can happily run this benchmark in constant memory for 10000 iterations, with no discernible change in iteration time over that.

natorion · 2016-10-14T09:12:13Z

@s3ththompson

woess · 2017-01-09T19:08:09Z

@ltratt I'm afraid emptying canvas_logs in runPdfJS skips the correctness checks in tearDownPdfJS.

ltratt · 2017-01-09T19:27:28Z

@woess Hello Andreas, yes, I think you're right. That said, IMHO, the current placement of the correctness checks means that one is forced to choose between a) predictable performance on each iteration b) correctness checks. Both options suck, though I think for a benchmark the former sucks marginally less. It would be nice to do both though!

I've had a quick look, and the code is doing something that I don't understand. If I try and move the correctness check into runPdfJS then on the first iteration canvas_logs[0].length == 6, which fails the check, but on all subsequent iterations canvas_logs[0].length == 36788... I do not pretend to understand why. Any thoughts?

woess · 2017-02-22T03:00:38Z

@ltratt Agreed. You could clear the canvas_logs array at the beginning of runPdfJS so that you'll at least have the last iteration checked (or limit the length of the array to a reasonable size). I guess the benchmark is just not designed to be run for so many iterations...

I don't see the canvas_logs[0].length == 6 behavior you described.

This means that the checks in tearDownPdfJs are still executed.

ltratt · 2017-05-22T09:44:45Z

OK, so I meant to look at this and then... forgot.

I've made the change that Andreas (@woess) suggested. I believe that it does the right thing: the benchmark no longer leaks memory; but the checks are still run.

I'd suggest I squash this before any possible merge, assuming people agree that this is the right fix.

ltratt · 2018-02-04T20:37:34Z

I think it's officially time to "ping" this one. Or we can let it die -- but, if I'm honest, I'd prefer someone to state explicitly that it won't be merged.

natorion · 2018-02-07T23:18:51Z

It won't be merged. Octane is retired and no longer maintained. Sorry for the long communication cycle.

ltratt force-pushed the master branch from 6999eec to 1eafbe9 Compare October 2, 2016 13:24

Move the canvas logs flushing to the beginning of unPdfJs.

e84337e

This means that the checks in tearDownPdfJs are still executed.

natorion closed this Feb 7, 2018

This was referenced Mar 12, 2019

Why Arent More Users More Happy With Our VMs? Part 2 guevara/read-it-later#2919

Open

Why Arent More Users More Happy With Our VMs? Part 2 guevara/read-it-later#2920

Open

Why Arent More Users More Happy With Our VMs? Part 2 guevara/read-it-later#2921

Open

guevara mentioned this pull request Sep 16, 2020

Why Arent More Users More Happy With Our VMs? Part 2 guevara/read-it-later#7023

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leak in pdfjs.js. #42

Fix memory leak in pdfjs.js. #42

ltratt commented Oct 2, 2016 •

edited

Loading

natorion commented Oct 14, 2016

woess commented Jan 9, 2017

ltratt commented Jan 9, 2017

woess commented Feb 22, 2017

ltratt commented May 22, 2017

ltratt commented Feb 4, 2018

natorion commented Feb 7, 2018 •

edited

Loading

Fix memory leak in pdfjs.js. #42

Fix memory leak in pdfjs.js. #42

Conversation

ltratt commented Oct 2, 2016 • edited Loading

natorion commented Oct 14, 2016

woess commented Jan 9, 2017

ltratt commented Jan 9, 2017

woess commented Feb 22, 2017

ltratt commented May 22, 2017

ltratt commented Feb 4, 2018

natorion commented Feb 7, 2018 • edited Loading

ltratt commented Oct 2, 2016 •

edited

Loading

natorion commented Feb 7, 2018 •

edited

Loading