Reconsider how "rethinkdb export" calculates progress #3483

timmaxw · 2015-01-01T00:04:49Z

Currently rethinkdb export runs r.table(...).count() and then r.table(...). As it reads documents from the stream returned by r.table(...), it counts the number of documents; then it uses this value as the numerator of the progress fraction, with the result of r.table(...).count() as the denominator.

This means that the server must traverse the entire data set twice: once to count the number of documents, and again to stream them to the client. For a data set that doesn't fit into RAM, this could make rethinkdb export take twice as long as it needs to. Until the first traversal is finished, rethinkdb export will report 0% progress.

We should consider using a distribution query to estimate the denominator for the progress fraction instead. We can do this by running r.table(...).info()['doc_count_estimates'].sum(). This won't give exact results, but it will run in constant time instead of reading the entire table.

The text was updated successfully, but these errors were encountered:

Tryneus · 2015-01-02T23:47:16Z

This is up in review 2426.

Tryneus · 2015-01-02T23:57:57Z

This has been approved and merged into next in commit 92c7390. Will be in release 1.16.

danielmewes added this to the 1.16-polish milestone Jan 2, 2015

Tryneus self-assigned this Jan 2, 2015

Tryneus added cp:devops tp:performance labels Jan 2, 2015

Tryneus closed this as completed Jan 2, 2015

AtnNn modified the milestones: 1.16-polish, 1.16 Jan 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconsider how "rethinkdb export" calculates progress #3483

Reconsider how "rethinkdb export" calculates progress #3483

timmaxw commented Jan 1, 2015

Tryneus commented Jan 2, 2015

Tryneus commented Jan 2, 2015

Reconsider how "rethinkdb export" calculates progress #3483

Reconsider how "rethinkdb export" calculates progress #3483

Comments

timmaxw commented Jan 1, 2015

Tryneus commented Jan 2, 2015

Tryneus commented Jan 2, 2015