fs: do bulk file reads to optimize cache extraction #3539

zkat · 2017-05-30T20:56:15Z

Summary

This patch boosts cache extraction by ~2x+ by letting node
do more parallelization work. This makes nearly all of the file copy
stuff be done by the C++ code with minimal boundary-crossing (at least
compared to node streams).

Streams in node.js are ~3x slower, specially for small files,
than just doing fs.writeFile/readFile, because of this boundary. This
is something Yarn might want to take into account in other places.

The reason this is OK is because pretty much any files this would
handle would fit neatly into memory (any npm packages MUST fit
into memory by definition, because of the way npm@<5 does extracts).

If you really want to make doubleplus sure to minimize memory usage,
you could do an fs.stat to find the file size and then do heuristics
to only use streams for files bigger than MB.

Test plan

I guess I should probably spend more time on making sure this passes the test suite, but it's a pretty benign patch, imo 👍

zkat · 2017-05-30T21:31:18Z

It looks like the test suite is just generally not passing right now? If anyone points me to the file(s) I should look at, if there's new breakage, and I'll look into fixing it ❤️

rmg · 2017-05-30T23:05:14Z

src/util/fs.js

+            events.onProgress(data.dest);
+            cleanup();
+          },
+          err => {


This version seems to swallow the error while the original version would re-throw it after the cleanup(). Am I mis-reading this?

No you read that right. That's my mistake. Fixed 😁

Also dropped the forwarding of the resolved value in the resolve handler. Maybe it's not actually used anyway?

This patch boosts cache extraction by ~2x+ by letting node do more parallelization work. This makes nearly all of the file copy stuff be done by the C++ code with minimal boundary-crossing (at least compared to node streams). Streams in node.js are ~3x slower, specially for small files, than just doing fs.writeFile/readFile, because of this boundary. This is something Yarn might want to take into account in other places. The reason this is OK is because pretty much any files this would handle would fit neatly into memory (any npm packages MUST fit into memory by definition, because of the way npm@<5 does extracts). If you really want to make doubleplus sure to minimize memory usage, you could do an fs.stat to find the file size and then do heuristics to only use streams for files bigger than <X>MB.

Daniel15 · 2017-05-30T23:45:42Z

Would be interesting to benchmark this compared to #3290 (cc @sciolist). The benefit of this approach is that it doesn't need any native code.

zkat · 2017-05-30T23:48:54Z

I can only assume my version is slower (another reason why I wasn't sure whether to PR it). It probably comes down to how much complexity you want to add (to maintenance, deployment, etc) vs raw perf.

It might also be worth benchmarking it on the latest node8, since this patch will go faster in step with libuv perf boosts.

Daniel15 · 2017-05-30T23:51:12Z

I can only assume my version is slower

Yeah, I'm just curious about how large the difference is. This patch is still a good idea, even if it's not as fast as using native code 😃 We'll probably always have some "pure JS" fallback in case the native code doesn't work for whatever reason.

zkat · 2017-05-31T06:17:01Z

This is what me shaking my fist at u looks like btw. ✊ 🔥

sciolist · 2017-05-31T09:32:32Z

I ran it through my test project and got:

yarn master: 25s
zkat: 14s
fcopy: 9s

I think that this seems well worth it, considering it requires no native code.

Keeping 16x full files rather than the streams should be okay. Not quite sure what kind of file sizes there are in the larger node modules, I have a list of some pretty big ones somewhere that I've used as test cases in a few projects but can't check them right now. :)

arcanis · 2017-05-31T09:52:30Z

src/util/fs.js

-              resolve();
-            }
+      reporter.verbose(reporter.lang('verboseFileCopy', data.src, data.dest));
+      return (currentlyWriting[data.dest] = readFile(data.src)


You need to use readFileBuffer, otherwise the output will be an utf8 string instead of a raw buffer, which will corrupt the data if they happen to be contain invalid utf8 sequences (hence the corrupted tarball warnings in the tests). On the plus side, using buffers should contribute to make your PR even faster! 😃

arcanis · 2017-05-31T09:57:34Z

Thanks a lot! Really nice optimization you found there 👍

cpojer · 2017-05-31T11:02:16Z

This is awesome. Thank you so much @zkat.

Daniel15 · 2017-06-06T05:29:24Z

Yeah, thanks for this!

One thing I just thought of (that @cpojer kinda reminded me of) is that using native file copying will provide a pretty large speed boost on any systems that use copy-on-write filesystem such as zfs or btrfs (or Apple's new APFS, apparently). This is something we can't do with a pure JS implementation as the file system has no idea that we're actually copying a file - it just sees a read followed by a write. We'd need native code like in #3290 to take advantage of CoW. On the other hand, maybe simply generating a shell script with all the required cp --reflink=auto commands and execing that would be more maintainable than native code, as Node.js' handling of native modules is notoriously bad. 😛

zkat · 2017-06-06T07:08:52Z

See nodejs/node#12902 and libuv/libuv#925 for the node-side issues on CoW support in node.

ronkorving · 2017-06-06T07:41:23Z

Help on making libuv/libuv#925 happen would be deeply appreciated :)

rmg reviewed May 30, 2017

View reviewed changes

zkat force-pushed the master branch from 8c46ff0 to bac4344 Compare May 30, 2017 23:10

zkat force-pushed the master branch from bac4344 to 33b0fc6 Compare May 30, 2017 23:14

arcanis reviewed May 31, 2017

View reviewed changes

Uses readFileBuffer instead of readFile

5cf622f

arcanis merged commit 7a63e0d into yarnpkg:master May 31, 2017

develar mentioned this pull request Jul 6, 2017

use fcopy for faster file copying #3290

Closed

BYK mentioned this pull request Sep 18, 2017

Update: use fs.copyFile when available #4486

Merged

elliottkember mentioned this pull request Jan 20, 2021

Asar creation fails on Apple Silicon electron-userland/electron-builder#5565

Closed

BYK mentioned this pull request Dec 7, 2021

fix: Remove GitHub asset checksum getsentry/craft#333

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fs: do bulk file reads to optimize cache extraction #3539

fs: do bulk file reads to optimize cache extraction #3539

zkat commented May 30, 2017

zkat commented May 30, 2017

rmg May 30, 2017

zkat May 30, 2017

rmg May 31, 2017

Daniel15 commented May 30, 2017

zkat commented May 30, 2017

Daniel15 commented May 30, 2017

zkat commented May 31, 2017

sciolist commented May 31, 2017

arcanis May 31, 2017

arcanis commented May 31, 2017

cpojer commented May 31, 2017

Daniel15 commented Jun 6, 2017 •

edited

Loading

zkat commented Jun 6, 2017

ronkorving commented Jun 6, 2017

fs: do bulk file reads to optimize cache extraction #3539

fs: do bulk file reads to optimize cache extraction #3539

Conversation

zkat commented May 30, 2017

zkat commented May 30, 2017

rmg May 30, 2017

Choose a reason for hiding this comment

zkat May 30, 2017

Choose a reason for hiding this comment

rmg May 31, 2017

Choose a reason for hiding this comment

Daniel15 commented May 30, 2017

zkat commented May 30, 2017

Daniel15 commented May 30, 2017

zkat commented May 31, 2017

sciolist commented May 31, 2017

arcanis May 31, 2017

Choose a reason for hiding this comment

arcanis commented May 31, 2017

cpojer commented May 31, 2017

Daniel15 commented Jun 6, 2017 • edited Loading

zkat commented Jun 6, 2017

ronkorving commented Jun 6, 2017

Daniel15 commented Jun 6, 2017 •

edited

Loading