perf(runtime/fs): optimize readFile by using a single large buffer #12057

AaronO · 2021-09-13T14:20:07Z

This avoids allocating N buffers when reading entire files and copying when concatenating them.

Benchmarks

# Before
❯ deno run -A ./cli/bench/deno_common.js
read_128k_sync:      	n = 50000, dt = 1.941s, r = 25760/s, t = 38820ns/op
read_128k:           	n = 50000, dt = 9.613s, r = 5201/s, t = 192259ns/op

# After
❯ ./target/release/deno run -A ./cli/bench/deno_common.js
read_128k_sync:      	n = 50000, dt = 1.746s, r = 28637/s, t = 34920ns/op
read_128k:           	n = 50000, dt = 8.865s, r = 5640/s, t = 177300ns/op

Notes

The improvements to read_128k_sync are somewhat reduced here by the extra cwd lookups caused by the stat, so perf(runtime): cache cwd lookups #12056 will work hand in hand with this change to improve file reads
This operates under the assumption that files won't change (truncated or extended) whilst being read, which isn't guaranteed but seems fair IMO
Also simplifies implementation of readTextFile / readTextFileSync

Accidentally committed, denoland#12056 adds `read_128k_sync`

runtime/js/12_io.js

runtime/js/40_read_file.js

lucacasonato

Not in favor of this. File reads area not atomic. We can't start optimizing at the cost of correctness.

AaronO · 2021-09-13T20:31:50Z

Not in favor of this. File reads area not atomic. We can't start optimizing at the cost of correctness.

My point is more that there's no sane use-case to write & read-all at the same time when you don't control the rate of reading and it's uncommon and IMO would be mostly fine as is.

But as I mentioned we can make this more robust:

Files being truncated should already be handled, files being extended could be handled, either by allocating an extra byte in the buffer and seeing if that is set or by doing an extra read.

essentially the current implementation would be a fast path for an unchanged file and then it would fallback to the slow path if the file was extended, so that wouldn't hurt or change correctness

lucacasonato · 2021-09-13T20:37:06Z

With fallback, SGTM

piscisaureus · 2021-09-15T01:03:36Z

FWIW, node.js does exactly this. https://github.com/nodejs/node/blob/bbd4c6eee90ef600022c72316ba82d37a2ae16bc/lib/fs.js#L326-L349

Allocate an extra byte in our read buffer to detect "overflow" then fallback to unsized readAll for remainder of extended file, this is a slowpath that should rarely happen in practice

AaronO · 2021-09-16T10:43:28Z

I still think that it would be sane/fair to not read beyond the stat'd size, I can't imagine a sane use-case for it but I implemented the slowpath fallback anyway.

@bartlomieju @lucacasonato That should address your concerns

lucacasonato · 2021-09-16T18:00:44Z

runtime/js/12_io.js

+    if (cursor > size) {
+      // Read remaining and concat
+      return concatBuffers([buf, readAllSync(r)]);
+    } else { // cursor == size


Suggested change

} else { // cursor == size

} else { // cursor <= size

lucacasonato

LGTM

AaronO added 3 commits September 13, 2021 16:14

perf(runtime/fs): optimize readFile by using a single large buffer

7a685ae

Revert deno_common.js changes

570f832

Accidentally committed, denoland#12056 adds `read_128k_sync`

lint

705b952

bartlomieju reviewed Sep 13, 2021

View reviewed changes

runtime/js/12_io.js Outdated Show resolved Hide resolved

bartlomieju reviewed Sep 13, 2021

View reviewed changes

runtime/js/40_read_file.js Show resolved Hide resolved

lucacasonato reviewed Sep 13, 2021

View reviewed changes

handle extended/truncated files during read

852062e

Allocate an extra byte in our read buffer to detect "overflow" then fallback to unsized readAll for remainder of extended file, this is a slowpath that should rarely happen in practice

AaronO added 2 commits September 16, 2021 12:43

fmt

a4465db

fix

d2f3d20

lucacasonato reviewed Sep 16, 2021

View reviewed changes

lucacasonato approved these changes Sep 16, 2021

View reviewed changes

AaronO merged commit 00948a6 into denoland:main Sep 16, 2021

AaronO deleted the perf/read-file-sized branch September 16, 2021 18:28

kt3k mentioned this pull request Sep 21, 2021

Canary CI is failing with readFileEncodeBase64Success case denoland/std#1290

Closed

bnoordhuis mentioned this pull request Oct 10, 2021

Deno v1.14: readFileSync returns an Uint8Array whose underlying buffer is null-terminated #12298

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(runtime/fs): optimize readFile by using a single large buffer #12057

perf(runtime/fs): optimize readFile by using a single large buffer #12057

AaronO commented Sep 13, 2021 •

edited

Loading

lucacasonato left a comment

AaronO commented Sep 13, 2021 •

edited

Loading

lucacasonato commented Sep 13, 2021

piscisaureus commented Sep 15, 2021

AaronO commented Sep 16, 2021

lucacasonato Sep 16, 2021

lucacasonato left a comment

perf(runtime/fs): optimize readFile by using a single large buffer #12057

perf(runtime/fs): optimize readFile by using a single large buffer #12057

Conversation

AaronO commented Sep 13, 2021 • edited Loading

Benchmarks

Notes

lucacasonato left a comment

Choose a reason for hiding this comment

AaronO commented Sep 13, 2021 • edited Loading

lucacasonato commented Sep 13, 2021

piscisaureus commented Sep 15, 2021

AaronO commented Sep 16, 2021

lucacasonato Sep 16, 2021

Choose a reason for hiding this comment

lucacasonato left a comment

Choose a reason for hiding this comment

AaronO commented Sep 13, 2021 •

edited

Loading

AaronO commented Sep 13, 2021 •

edited

Loading