New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add operational guide on tuning availability, consistency, and durability + make ignoring corrupt commitlogs on bootstrap default in all sample YAMLs #1491

Merged

richardartoul merged 10 commits into master from ra/availability-consistency-durability

Mar 26, 2019

Contributor

richardartoul commented Mar 23, 2019

No description provided.

richardartoul requested review from robskillington, schallert, arnikola, benraskin92, mway and prateek

March 23, 2019 18:56

codecov bot commented Mar 23, 2019 •

edited

Loading

Codecov Report

Merging #1491 into master will decrease coverage by 27.3%.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           master   #1491      +/-   ##
=========================================
- Coverage    70.9%   43.5%   -27.4%     
=========================================
  Files         842     829      -13     
  Lines       71918   70192    -1726     
=========================================
- Hits        51021   30570   -20451     
- Misses      17561   36741   +19180     
+ Partials     3336    2881     -455

Flag	Coverage Δ
#aggregator	`58.3% <0%> (-24.1%)`	⬇️
#cluster	`30.2% <0%> (-55.7%)`	⬇️
#collector	`39.1% <0%> (-24.6%)`	⬇️
#dbnode	`68.6% <0%> (-12.2%)`	⬇️
#m3em	`44.1% <0%> (-29.1%)`	⬇️
#m3ninx	`48.9% <0%> (-25.4%)`	⬇️
#m3nsch	`100% <0%> (+48.8%)`	⬆️
#metrics	`17.5% <0%> (ø)`	⬆️
#msg	`74.9% <0%> (ø)`	⬆️
#query	`1.5% <0%> (-64.6%)`	⬇️
#x	`42.3% <0%> (-34.5%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update caff9d4...6b5836f. Read the comment docs.

mway reviewed

View reviewed changes

Collaborator

mway left a comment

this is really great - thanks for writing this up! a few requests and a bunch of nits (some super pedantic) - feel free to take or leave most of them.

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

mway reviewed

View reviewed changes

Collaborator

mway left a comment

this is looking great! thanks for the updates - a couple of super minor nits and then i think this is good to go.

docs/operational_guide/availability_consistency_durability.md Outdated

    
              ### Client Write and Read Consistency

              The possible configuration values for write and read consistency are discussed in more detail [in this section](../m3db/architecture/consistencylevels.md) of the documentation, but in short M3DB behaves similarly to other H.A systems with configurable consistency such as Cassandra that allow the caller to control the consistency level of writes and reads from the client.

              The possible configuration values for write and read consistency are discussed in more detail in [the Consistency Levels section](../m3db/architecture/consistencylevels.md). In short, M3DB behaves similarly to other HA systems with configurable consistency such as Cassandra that allow the caller to control the consistency level of writes and reads from the client.

Collaborator

mway Mar 24, 2019

nit:

with configurable consistency, such as Cassandra, which allow [...]

docs/operational_guide/availability_consistency_durability.md Outdated

    
              ### Commitlog Configuration

              By default M3DB runs with an asynchronous commitlog such that writes will be acknowleged as successful by the client even though the data may not have been physically flushed to the commitlog on disk yet. M3DB supports changing this default behavior to run the commitlog synchronously, but this is not currently exposed to users in the YAML configuration and generally leads to a massive performance degradation.

              By default M3DB runs with an asynchronous commitlog such that writes will be reported as successful by the client, though the data may not have been flushed to disk yet.

Collaborator

mway Mar 24, 2019

nit:

By default, M3DB [...]

docs/operational_guide/availability_consistency_durability.md

    
              ### Commitlog Configuration

              By default M3DB runs with an asynchronous commitlog such that writes will be reported as successful by the client, though the data may not have been flushed to disk yet.

              M3DB supports changing this default behavior to run the commitlog synchronously, but this is not currently exposed to users in the YAML configuration and generally leads to a massive performance degradation.

              We recommend running M3DB with an asynchronous commitlog.

Collaborator

mway Mar 24, 2019

it might be worth explaining this a little more concretely. there is a config snippet below, but it's not immediately clear how exactly that controls the (a)synchronicity of the commitlog.

Collaborator

benraskin92 Mar 25, 2019

+1

docs/operational_guide/availability_consistency_durability.md Outdated


		This instructs M3DB to handle writes for new timeseries (for a given time block) asynchronously. Creating a new timeseries in memory is much more expensive than simply appending a new write to an existing series, so the default configuration of creating them asynchronously improves M3DBs write throughput significantly when many new series are being created all at once.

		However, since new time series are created asynchronously, its possible that there may be a brief delay inbetween when a write is acknowledged by the client and when that series becomes available for subsequent reads.

Collaborator

mway Mar 24, 2019

i suspect that that's the case on average simply because

mean(client->server unidirectional read latency) ≤ mean(flush interval + write() + fsync() latency).

correct? (this is getting super pedantic, sorry - i don't think it's necessarily critical to call the distinction out here, this is more for my own edification.)

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md

    
              1. 524288 or more bytes have been written since the last time M3DB flushed the commitlog.

              2. One or more seconds has elapsed since the last time M3DB flushed the commitlog.

              In addition, the configuration also states that M3DB should allow up to `2097152` writes to be buffered in the commitlog queue before the database node will begin rejecting incoming writes so it can attempt to drain the queue and catch up. Increasing the size of this queue can often increase the write throughput of an M3DB node at the cost of potentially losing more data if the node experiences a sudden failure like a hard crash or power loss.

Collaborator

mway Mar 24, 2019

drain the queue and catch up

i don't think (but am potentially just not seeing) that we've documented/diagrammed the architectural details to explain how the queue works (maybe that'd be overkill), but if we ever add that, this should probably link to it.

Contributor Author

richardartoul Mar 25, 2019

Yeah unfortunately we don't have that right now

Collaborator

mway Mar 25, 2019

i didn't think so - just a note for the future, then. :)

docs/operational_guide/availability_consistency_durability.md Outdated

    
              writeNewSeriesLimitPerSecond: 1048576

              ```

              This value can be set much lower than the default value for workloads in which a significant increase in cardinality usually indicates an abusive caller.

Collaborator

mway Mar 24, 2019

abusive caller

i'd recommend changing this to "misbehaving", as abuse implies malicious intent. more likely, it's a caller that doesn't understand the goals or limitations of the system.

don't want to get too tied up in semantics, but if we use words like "abusive", "misbehaving", etc, we might want to very clearly call out or link to expectations (e.g. wrt dimensionality, cardinality, etc).

Contributor Author

richardartoul Mar 25, 2019

Misbehaving in this context would depend on your setup and workload. I.E at Uber it might be someone emitting UUIDs in a spark job, but for someone elses setup that might be the intended use case

Collaborator

mway Mar 25, 2019

understood - still think we should switch to "misbehaving" instead of "abusive" unless you disagree with the semantics there.

Contributor Author

richardartoul Mar 25, 2019

yep i did

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

mway added the area:documentation label

benraskin92 reviewed

View reviewed changes

docs/operational_guide/availability_consistency_durability.md Outdated

    
              ### Client Write and Read consistency

              We recommend running the client with `writeConsistencyLevel` set to `majority` and `readConsistencyLevel` set to `unstrict_majority`.

              This means that all write must be acknowledged by a quorums of nodes in order to be considered succesful, and that reads will attempt to achieve quorum, but will return the data from a single node if they are unable to achieve quorum.

Collaborator

benraskin92 Mar 25, 2019

Maybe give a brief example?

Contributor Author

richardartoul Mar 25, 2019

added an extra sentence

docs/operational_guide/availability_consistency_durability.md

    
              ### Commitlog Configuration

              By default M3DB runs with an asynchronous commitlog such that writes will be reported as successful by the client, though the data may not have been flushed to disk yet.

              M3DB supports changing this default behavior to run the commitlog synchronously, but this is not currently exposed to users in the YAML configuration and generally leads to a massive performance degradation.

              We recommend running M3DB with an asynchronous commitlog.

Collaborator

benraskin92 Mar 25, 2019

+1

docs/operational_guide/availability_consistency_durability.md

    
              This configuration states that the commitlog should be flushed whenever either of the following is true:

              1. 524288 or more bytes have been written since the last time M3DB flushed the commitlog.

Collaborator

benraskin92 Mar 25, 2019

Why 524288 and 2097152?

Contributor Author

richardartoul Mar 25, 2019

its just the default we've always used. Presumably @robskillington did some benchmarking?

Collaborator

mway Mar 25, 2019

they're 2^19 and 2^21, respectively, if that helps.

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

docs/operational_guide/availability_consistency_durability.md

    
              ### Ignoring Corrupt Commitlogs on Bootstrap

              As described in the "Tuning for Performance and Availability" section, we recommend configuring M3DB to ignore corrupt commitlog files on bootstrap. However, if you want to avoid any amount of inconsistency or data loss, no matter how minor, then you should configure M3DB to return unfulfilled when the commitlog bootstrapper encounters corrupt commitlog files. You can do so by modifying your configuration to look like this:

Collaborator

benraskin92 Mar 25, 2019

Hmm do we need both this section and ### Ignoring Corrupt Commitlogs on Bootstrap?

Contributor Author

richardartoul Mar 25, 2019

Not totally following. I have the subheading twice, once under availability section and once under consistency

schallert reviewed

View reviewed changes

Collaborator

schallert left a comment

This looks awesome so far, LGTM once the other discussions are resolved.

docs/operational_guide/availability_consistency_durability.md Outdated Show resolved Hide resolved

Richard Artoul added 8 commits

March 25, 2019 15:03


          Add operational guide on tuning availability, consistency, and durabi…

d200d1c

…lity


          Fix guide name

ca673b9


          improve language

a138c3e


          fix typo

ac3cac2


          refactor paragraph

195897b


          address feedback

ac1e028


          refactor into two sections;

0c960a9


          address more feedback

351879b

richardartoul force-pushed the ra/availability-consistency-durability branch from 3e7121b to 351879b Compare

March 25, 2019 19:03


          Merge branch 'master' into ra/availability-consistency-durability

d2ed4a6

mway approved these changes

View reviewed changes

Collaborator

mway left a comment

🥇


          Merge branch 'master' into ra/availability-consistency-durability

6b5836f

richardartoul merged commit d7d3559 into master

richardartoul deleted the ra/availability-consistency-durability branch

March 26, 2019 15:24

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

benraskin92 benraskin92 left review comments

schallert schallert left review comments

mway mway approved these changes

robskillington Awaiting requested review from robskillington

arnikola Awaiting requested review from arnikola

prateek Awaiting requested review from prateek

Labels

area:documentation