Implement flattened doc storage #2539

eiri · 2020-02-11T16:34:40Z

Overview

Implementation of "flattened" doc storage format to Document Storage RFC

Testing recommendations

$ make eunit apps=couch suites=couch_util tests=json_explode_test_,json_implode_test_

$ make eunit apps=couch_jobs,couch_views,fabric

Most of these tests are for quorum and clustered response handling which will no longer exist with FoundationDB. Eventually we'll want to go through these and pick out anything that is still applicable and ensure that we re-add them to the new test suite.

This provides a base implementation of a fabric API backed by FoundationDB. While a lot of functionality is provided there are a number of places that still require work. An incomplete list includes: 1. Document bodies are currently a single key/value 2. Attachments are stored as a range of key/value pairs 3. There is no support for indexing 4. Request size limits are not enforced directly 5. Auth is still backed by a legacy CouchDB database 6. No support for before_doc_update/after_doc_read 7. Various implementation shortcuts need to be expanded for full API support.

This provides a good bit of code coverage for the new implementation. We'll want to expand this to include relevant tests from the previous fabric test suite along with reading through the various other tests and ensuring that we cover the API as deeply as is appropriate for this layer.

This is not an exhaustive port of the entire chttpd API. However, this is enough to support basic CRUD operations far enough that replication works.

This still holds all attachment data in RAM which we'll have to revisit at some point.

When uploading an attachment we hadn't yet flushed data to FoundationDB which caused the md5 to be empty. The `new_revid` algorithm then declared that was because it was an old style attachment and thus our new revision would be a random number. This fix just flushes our attachments earlier in the process of updating a document.

I was accidentally skipping this step around properly serializing/deserializing attachments. Note to self: If someon specifies attachment headers this will likely break when we attempt to pack the value tuple here.

The older chttpd/fabric split configured filters as one step in the coordinator instead of within each RPC worker.

This fixes the behavior when validating a document update that is recreating a previously deleted document. Before this fix we were sending a document body with `"_deleted":true` as the existing document. However, CouchDB behavior expects the previous document passed to VDU's to be `null` in this case.

This was a remnant before we used a version per database.

This changes `chttpd_auth_cache` to use FoundationDB to back the `_users` database including the `before_doc_update` and `after_doc_read` features.

RFC: apache/couchdb-documentation#409 Main API is in the `couch_jobs` module. Additional description of internals is in the README.md file.

Neither partitioned databases or shard splitting will exist in a FoundationDB layer.

This adds the mapping of CouchDB start/end keys and so on to the similar yet slightly different concepts in FoundationDB. The handlers for `_all_dbs` and `_all_docs` have been udpated to use this new logic.

The existing logic around return codes and term formats is labyrinthine. This is the result of much trial and error to get the new logic to behave exactly the same as the previous implementation.

Simple function change to `fabric2_db:name/1`

Previously I was forgetting to keep the previous history around which ended up limiting the revision depth to two.

The old test got around this by using couch_httpd_auth cache in its tests which is fairly odd given that we run chttpd_auth_cache in production. This fixes that mistake and upgrades chttpd_auth_cache so that it works in the test scenario of changing the authentication_db configuration.

This API allows for listing all database info blobs in a single request. It accepts the same parameters as `_all_dbs` for controlling pagination of results and so on.

Previously only `POST` with a list of keys was supported. The new `GET` support just dumps all database info blobs in a single ordered response.

Previously changes feeds would fail if they streamed data for more than five seconds. This was because of the FoundationDB's transaction time limit. After the timeout fired, an 1007 (transaction_too_long) error was raised, and transaction was retried. The emitted changes feed would often crash or simple hang because the HTTP state would be garbled as response data was re-sent over the same socket stream again. To fix the issue introduce a new `{restart_tx, true}` option for `fold_range/4`. This option sets up a new transaction to continue iterating over the range from where the last one left off. To avoid data being resent in the response stream, user callback functions must first read all the data they plan on sending during that callback, send it out, and then after that it must not do any more db reads so as not to trigger a `transaction_too_old` error.

Index builder performs writes in the same transaction as the changes feed so we can't use iterators as they disable writes.

I accidentally ported part of the old couch_att test suite into an actual "feature" that's not actually accessible through any API.

This tracks the number of bytes that would be required to store the contents of a database as flat files on disk. Currently the following items are tracked: * Doc ids * Revisions * Doc body as JSON * Attachment names * Attachment type * Attachment length * Attachment md5s * Attachment headers * Local doc id * Local doc revision * Local doc bodies

Versionstamp sequences should always be binaries when retrieved from a rev info map.

Previously each doc was read in a separate transaction. It turns out that size limits do not apply to read-only transactions so we don't have to worry about that here. Also transaction restart are already implemented so we don't have to worry about timeout either.

We already handle them in couch_jobs_type_monitor so let's do it in `couch_jobs:wait_pending` as well. Recent fixes in FDB 6.2 didn't completely fix the issue and ther are still spurious 1009 errors dumped in the logs. They seem to be benign as far as couch_jobs operation goes as type monitor code already showed, so let's not pollute the logs with them.

Previously, if the metadata key is bumped in a transaction, the same transaction could not be used to add jobs with `couch_jobs`. That's because metadata is a versionstamped value, and when set, it cannot be read back until that transaction has committed. In `fabric2_fdb` there is a process dict key that is set which declares that metadata was already read, which happens before any db update, however `couch_jobs` uses it's own caching mechanism and doesn't know about that pdict key. Ideally we'd implement a single `couch_fdb` module to be shared between `couch_jobs` and `fabric2_db` but until then it maybe simpler to just let `couch_jobs` use its own metadata key. This way, it doesn't get invalidated or bumped every time dbs get recreated or design docs are updated. The only time it would be bumped is if the FDB layer prefix changed at runtime.

It's possible for other couch_epi plugins to interfere with this test, so mock `couch_epi:decide/5` to always return `no_decision`.

We started to emit that in CouchDB 4.x for temporary views and possibly other endpoints.

eiri · 2020-04-14T15:20:49Z

Closing as obsolete.

davisp and others added 30 commits July 31, 2019 11:55

Update build system for FoundationDB

609a45d

Update ddoc_cache to use fabric2

9083da6

Start switching chttpd HTTP endpoints to fabric2

0cf5f46

This is not an exhaustive port of the entire chttpd API. However, this is enough to support basic CRUD operations far enough that replication works.

Remove debug logging

716d5b3

Implement attachment compression

c4f1182

This still holds all attachment data in RAM which we'll have to revisit at some point.

Fix fabric2_txids:terminate/2

ad31f51

Convert attachment info to disk terms correctly

bc8007b

I was accidentally skipping this step around properly serializing/deserializing attachments. Note to self: If someon specifies attachment headers this will likely break when we attempt to pack the value tuple here.

Allow for previously configured filters

f7a790e

The older chttpd/fabric split configured filters as one step in the coordinator instead of within each RPC worker.

Database config changes should bump the db version

5e12e06

This was a remnant before we used a version per database.

Implement _users db authentication

3931685

This changes `chttpd_auth_cache` to use FoundationDB to back the `_users` database including the `before_doc_update` and `after_doc_read` features.

Update get security to use fabric2

d16cb14

Fix arity in changes timeout callback

920e1ff

Fix exception in cache auth doc update

b9ee168

CouchDB background jobs

0c2d674

RFC: apache/couchdb-documentation#409 Main API is in the `couch_jobs` module. Additional description of internals is in the README.md file.

Remove tests for deprecated features.

40561bc

Neither partitioned databases or shard splitting will exist in a FoundationDB layer.

Implement _all_dbs/_all_docs API parameters

a8e306d

This adds the mapping of CouchDB start/end keys and so on to the similar yet slightly different concepts in FoundationDB. The handlers for `_all_dbs` and `_all_docs` have been udpated to use this new logic.

Fix bulk docs error reporting

633d894

The existing logic around return codes and term formats is labyrinthine. This is the result of much trial and error to get the new logic to behave exactly the same as the previous implementation.

Fix COPY method

bf9fa0a

Simple function change to `fabric2_db:name/1`

Fix revision tree extensions

e5fefbe

Previously I was forgetting to keep the previous history around which ended up limiting the revision depth to two.

Implement POST /_dbs_info

7696999

Fix formatting of all_docs_test.exs

79ea59e

Disable broken couch_att tests

8e574e9

Expose ICU ucol_getSortKey

d42d9b7

Fix more elixir tests

7a3bfe6

davisp and others added 18 commits February 13, 2020 16:25

Implement fabric2_db:list_dbs_info/1,2,3

416de8d

This API allows for listing all database info blobs in a single request. It accepts the same parameters as `_all_dbs` for controlling pagination of results and so on.

Support GET /_dbs_info endpoint

31878f8

Previously only `POST` with a list of keys was supported. The new `GET` support just dumps all database info blobs in a single ordered response.

Use {restart_tx, false} option in view index builder changes feed

81fa3ee

Index builder performs writes in the same transaction as the changes feed so we can't use iterators as they disable writes.

Remove attachment headers field

8bb8e70

I accidentally ported part of the old couch_att test suite into an actual "feature" that's not actually accessible through any API.

Add tests for database size tracking

c4bbae4

Convert versionstamps to binaries

0292f0e

Versionstamp sequences should always be binaries when retrieved from a rev info map.

Test coverage: list_dbs and list_dbs_info

e60ff85

Test coverage: get_full_doc_info

dae118b

Test coverage: validate_dbname, validate_docid

2b1a5d7

Test coverage: apply_open_doc_opts

1fd40ce

Sync Makefile with master (#2566)

951cfd1

Improve validate_dbname test

8773386

It's possible for other couch_epi plugins to interfere with this test, so mock `couch_epi:decide/5` to always return `no_decision`.

Add 410 status code to stats_descriptions

d0ab91e

We started to emit that in CouchDB 4.x for temporary views and possibly other endpoints.

eiri force-pushed the prototype/flattened-doc-storage branch from 1c97a07 to ae3d86c Compare February 21, 2020 17:53

eiri added 6 commits February 21, 2020 14:45

Add JSON explode/implode function

6f705cc

Fix invalid json object in fabric2_doc_crud test

8fec352

Use flattened JSON for store doc in fdb

04a88e3

Add migration from the old doc storage format

168f105

Bit of refactoring in json_explode

92580fe

Flip Deleted in a doc key

26097d6

eiri force-pushed the prototype/flattened-doc-storage branch from ae3d86c to 26097d6 Compare February 21, 2020 18:45

davisp force-pushed the prototype/fdb-layer branch from b3bd36b to bdd0578 Compare March 2, 2020 22:53

eiri closed this Apr 14, 2020

eiri deleted the prototype/flattened-doc-storage branch April 14, 2020 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement flattened doc storage #2539

Implement flattened doc storage #2539

eiri commented Feb 11, 2020

eiri commented Apr 14, 2020

Implement flattened doc storage #2539

Implement flattened doc storage #2539

Conversation

eiri commented Feb 11, 2020

Overview

Testing recommendations

eiri commented Apr 14, 2020