Improve throughput of read streams by transferring multiple records at once #70

vweevers · 2019-05-27T18:50:37Z

Working on the level-bench benchmarks got me thinking. Currently level-iterator-stream ignores the size argument of stream._read(size). Per tick it transfers only 1 db record from the underlying iterator to the stream's buffer. I think we can be smarter about this. By connecting the knowledge that size records are requested, all the way down to the db (in the case of leveldown, down to the C++, potentially replacing its current read-ahead cache mechanism).

In pseudo-code it would look something like (ignore error handling for a moment):

// level-iterator-stream
ReadStream.prototype._read = function (size) {
  var self = this

  // Fetch <size> records from the db, then call "visit" repeatedly within a tick
  this._iterator.visit(size, function visit (record) {
     // Record is either null, an object { key, value } or just a key or value
     self.push(record)
  })
})

This also avoids allocating 3 callback functions per record. Alternatively:

this._iterator.nextv(size, function (records) { // aka "chunks" in streams
   for (let record of records) self.push(record)
})

Or if streams were to get a .pushv method similar to .writev:

this._iterator.nextv(size, function (records) {
   self.pushv(records)
})

/cc @mcollina: could such an API be faster? I'm also wondering how _read() behaves in an asyncIterator. Is size always 1 in that case, or does the stream read ahead?

@peakji @ralphtheninja /cc @kesla

The text was updated successfully, but these errors were encountered:

peakji · 2019-06-14T07:23:11Z

Respecting size is good for streams, but I guess moving from eager read-ahead cache to on demand read-ahead cache won't make much difference, because it cannot reduce the number of times we cross the C++ / JS boundary?

mcollina · 2019-06-14T07:27:04Z

Getting more than one item at the same time will significantly increase throughput.

Is size always 1 in that case, or does the stream read ahead?

The streams read ahead accordingly to highWaterMark. Essentially it would try to read 16 entries by default.

On-demand read head will guarantee some performance speedup because it reduces the time an object will live on the heap: as a result, the GC will end up doing less work. I would recommend implementing this anyway.

vweevers · 2019-06-14T08:48:40Z

Error-handling included, I propose the following API:

iterator.nextv(size, function (err, records) {
   if (err) // errored
   if (records.length === 0) // reached end

   for (let record of records) // ..
})

If size < 0 then nextv yields all records (or as much as limit) (there is no default safeguard)
If size === 0 then nextv yields 0 records (it's illegal to call nextv(0) more than once)
nextv respects the boolean keys and values options passed to the iterator ctor. If both are true, record will be a { key, value } object. Else record will be either a key or value.

vweevers · 2019-06-14T08:50:20Z

We can play with this idea in leveldown: introduce a nextv method on the iterator, combine it with a temporary fork of level-iterator-stream that utilizes nextv, then benchmark it.

It might increase the number of times we cross the C++ / JS boundary, especially for small records, because the highWaterMark of streams in objectMode is measured in number of records, while the highWaterMark of leveldown's iterator is measured in bytes. IMO this does not matter because both these parameters can be tweaked by the user as necessary. Would it warrant semver-major though?

MeirionHughes · 2020-06-04T21:56:32Z

what changes to leveldown would this require? changing the c++ iterator cache size (per iterator) to match the nextv(size... ?

vweevers · 2020-06-05T06:03:37Z

Depends on what we want to do with the highWaterMark (in bytes) logic. I'm now wondering, in current code, whether db.createReadStream({ highWaterMark }) passes highWaterMark to both the iterator and the stream. That would be a problem.

Raynos · 2020-06-05T09:04:12Z

👍

We implement iterator.batchNext() instead of iterator.nextv() in a userland library. We use the undocumented iterator.cache field.

When creating a leveldown iterator we set the high water mark to 1024 * 1024 aka 1MB to make sure the cache is populated.

Implementing a native nextv() in leveldown so that we don't access the private .cache field would be nice.

MeirionHughes · 2020-06-05T09:33:48Z

I didn't even know you could specify it on the iterator options (its not on the docs). Is highWaterMark meant to be not documented? Also is this something that is (or can be) made uniform across AbstractLevelDown implementations?

vweevers · 2020-06-05T10:20:25Z

@MeirionHughes It's missing in docs (Level/leveldown#468).

Also is this something that is (or can be) made uniform across AbstractLevelDown implementations?

Only leveldown and rocksdb can support the byte-based hwm.

vweevers · 2020-06-07T08:24:55Z

How would y'all feel about altogether removing the hwm measured in bytes, in favor of a nextv() measured in number of records? Every abstract-leveldown implementation can support that.

Raynos · 2020-06-07T12:58:36Z

👍 for nextv()

I think removing highWaterMark in leveldown itself is a breaking change in terms of default performance. You will want to benchmark the throughput and/or number of C <-> JS boundaries in leveldown with and without highWaterMark to make sure there's no performance regression in removing highWaterMark.

Alternatively phrased, if there's no highWaterMark in leveldown we might want to implement the next method in terms of calling nextv and returning the first key/value pair and assigning the remainder into the existing iterator.cache field.

TODO: benchmarks, decide on default highWaterMark. Ref Level/community#70 Ref #1

Ref Level/community#70 Closes #1

On the C++ side: - Replace asBuffer options with encoding options - Refactor iterator_next to work for nextv(). We already had a iterator.ReadMany(size) method in C++, with a hardcoded size. Now size is taken from the JS argument to _nextv(size). The cache logic for next() is the same as before. Ref Level/community#70 Ref Level/abstract-level#12 - Use std::vector<Entry> in iterator.cache_ instead of std::vector<std::string> so that the structure of the cache matches the desired result of nextv() in JS. On the JS side: - Use classes for ChainedBatch, Iterator, ClassicLevel - Defer approximateSize() and compactRange() - Encode arguments of approximateSize() and compactRange(). Ref Level/community#85 - Add promise support to additional methods - Remove tests that were copied to abstract-level. This is the most of it, a few more changes are needed in follow-up commits.

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

On the C++ side: - Replace asBuffer options with encoding options - Refactor iterator_next to work for nextv(). We already had a iterator.ReadMany(size) method in C++, with a hardcoded size. Now size is taken from the JS argument to _nextv(size). The cache logic for next() is the same as before. Ref Level/community#70 Ref Level/abstract-level#12 - Use std::vector<Entry> in iterator.cache_ instead of std::vector<std::string> so that the structure of the cache matches the desired result of nextv() in JS. On the JS side: - Use classes for ChainedBatch, Iterator, ClassicLevel - Defer approximateSize() and compactRange() - Encode arguments of approximateSize() and compactRange(). Ref Level/community#85 - Add promise support to additional methods - Remove tests that were copied to abstract-level. This is the most of it, a few more changes are needed in follow-up commits.

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

vweevers · 2022-02-19T10:59:10Z

Done in abstract-level, memory-level and classic-level (not on npm yet), when combined with level-read-stream.

When not using streams, you can still benefit from the new machinery by using nextv(). And on classic-level, next() has the same performance characteristics as before (on leveldown). As for hwm, there are 2 options now: Level/classic-level#1.

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

vweevers added enhancement New feature or request discussion Discussion labels May 27, 2019

vweevers added this to Backlog in Level (old board) via automation May 27, 2019

vweevers added benchmark Requires or pertains to benchmarking and removed enhancement New feature or request labels May 29, 2019

vweevers mentioned this issue Jun 14, 2019

Remove fast-future Level/leveldown#638

Merged

vweevers mentioned this issue Oct 3, 2019

Aggregation of key/values? Level/leveldown#261

Closed

vweevers mentioned this issue Jun 4, 2020

Slow results of createReadStream in the browser Level/bench#21

Closed

vweevers removed this from Backlog in Level (old board) Dec 4, 2021

vweevers mentioned this issue Jan 16, 2022

API design: getMany(), key & value iterators, iterator.nextv() Level/abstract-leveldown#380

Closed

vweevers added a commit to Level/read-stream that referenced this issue Jan 29, 2022

Make use of nextv(), keys() and values()

b77085f

TODO: benchmarks, decide on default highWaterMark. Ref Level/community#70 Ref #1

This was referenced Jan 29, 2022

Make use of nextv(), keys() and values() Level/read-stream#2

Merged

Benchmark against level Level/abstract-level#4

Closed

vweevers added a commit to Level/read-stream that referenced this issue Jan 30, 2022

Make use of nextv(), keys() and values() (#2)

7c4fc44

Ref Level/community#70 Closes #1

vweevers added a commit to Level/classic-level that referenced this issue Feb 19, 2022

Breaking: rename highWaterMark to highWaterMarkBytes

2d49754

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

vweevers mentioned this issue Feb 19, 2022

Breaking: rename highWaterMark to highWaterMarkBytes Level/classic-level#1

Merged

vweevers added a commit to Level/classic-level that referenced this issue Feb 19, 2022

Breaking: rename highWaterMark to highWaterMarkBytes

52dc30c

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

vweevers closed this as completed Feb 19, 2022

vweevers added a commit to Level/classic-level that referenced this issue Feb 26, 2022

Breaking: rename highWaterMark to highWaterMarkBytes (#1)

a62e779

To remove a conflict with streams. Also adds documentation. Ref Level/leveldown#468 Ref Level/community#70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve throughput of read streams by transferring multiple records at once #70

Improve throughput of read streams by transferring multiple records at once #70

vweevers commented May 27, 2019 •

edited

peakji commented Jun 14, 2019 •

edited

mcollina commented Jun 14, 2019

vweevers commented Jun 14, 2019

vweevers commented Jun 14, 2019

MeirionHughes commented Jun 4, 2020

vweevers commented Jun 5, 2020

Raynos commented Jun 5, 2020

MeirionHughes commented Jun 5, 2020 •

edited

vweevers commented Jun 5, 2020

vweevers commented Jun 7, 2020

Raynos commented Jun 7, 2020

vweevers commented Feb 19, 2022

Improve throughput of read streams by transferring multiple records at once #70

Improve throughput of read streams by transferring multiple records at once #70

Comments

vweevers commented May 27, 2019 • edited

peakji commented Jun 14, 2019 • edited

mcollina commented Jun 14, 2019

vweevers commented Jun 14, 2019

vweevers commented Jun 14, 2019

MeirionHughes commented Jun 4, 2020

vweevers commented Jun 5, 2020

Raynos commented Jun 5, 2020

MeirionHughes commented Jun 5, 2020 • edited

vweevers commented Jun 5, 2020

vweevers commented Jun 7, 2020

Raynos commented Jun 7, 2020

vweevers commented Feb 19, 2022

vweevers commented May 27, 2019 •

edited

peakji commented Jun 14, 2019 •

edited

MeirionHughes commented Jun 5, 2020 •

edited