Clarify hardware recs for throughput #5472

jseldess · 2019-09-24T18:54:46Z

Increase optimal vCPUs to 32.

Update GCP and AWS recs as well. Holding off on Azure until
the upcoming cloud report, at which point we should revisit
our hardware recs more broadly.

Fixes #4711.

cockroach-teamcity · 2019-09-24T18:54:53Z

This change is

rkruze

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @bdarnell and @rkruze)

cockroach-teamcity · 2019-09-24T18:57:11Z

Online preview: http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/4edce5d8ece4cd1ef186e47a526506ef8b9aa6df/

Edited pages:

cockroach-teamcity · 2019-09-24T19:01:54Z

Online preview: http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/e54617443363086b5ed8f9f0d63d98e368cd24ed/

Edited pages:

bdarnell · 2019-09-24T19:15:21Z

v19.1/recommended-production-settings.md

- The ideal configuration is 4-16 vCPUs, 8-64 GB memory nodes (2-4 GB of memory per vCPU).
+- To optimize for throughput, use larger nodes, up to 16 vCPUs and 64 GB of RAM. Based on internal testing results, 16 vCPUs is the sweet spot for OLTP workloads.
+
+    To increase throughput further, add more nodes to the cluster instead of increasing node size; higher vCPUs will have NUMA](https://en.wikipedia.org/wiki/Non-uniform_memory_access)(non-uniform memory access) implications.


I'd remove this note about NUMA. We haven't explored this but we have no reason to believe it's a major concern at this point. The immediate reason why going to 32 vCPUs and beyond loses efficiency is simple mutex contention.

Also for the record, on AWS m5/c5 nodes, you can go up to 48 vCPUs in a single NUMA group, so NUMA isn't a concern on this platform until you get to 72 vCPUs.

cockroach-teamcity · 2019-09-24T20:20:45Z

Online preview: http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/289fbb2c7eda8de9728fa0cdd4fb53a930817905/

Edited pages:

cockroach-teamcity · 2019-09-24T20:23:18Z

Online preview: http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/25c1407956c0969f7fc112504cde31cfeb23650b/

Edited pages:

bdarnell

Reviewed 10 of 10 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @jseldess)

v19.1/deploy-cockroachdb-on-aws.md, line 72 at r2 (raw file):

- Run at least 3 nodes to ensure survivability.

- Use `m` (general purpose), `c` (compute-optimized), or `i` (storage-optimized) [instances](https://aws.amazon.com/ec2/instance-types/), with SSD-backed [EBS volumes](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/EBSVolumeTypes.html) or [Instance Store volumes](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ssd-instance-store.html). For example, Cockroach Labs has used `c5d.8xlarge` (36 vCPUs and 72 GiB of RAM per instance, NVMe SSD) for internal testing.

Our most-tested configuration is still the 4xlarge, i think.

v19.1/deploy-cockroachdb-on-google-cloud-platform.md, line 61 at r2 (raw file):

- Run at least 3 nodes to [ensure survivability](recommended-production-settings.html#topology).

- Use `n1-standard` or `n1-highcpu` [predefined VMs](https://cloud.google.com/compute/pricing#predefined_machine_types), or [custom VMs](https://cloud.google.com/compute/pricing#custommachinetypepricing), with [Local SSDs](https://cloud.google.com/compute/docs/disks/#localssds) or [SSD persistent disks](https://cloud.google.com/compute/docs/disks/#pdspecs). For example, Cockroach Labs has used `n1-standard-32` (32 vCPUs and 60 GB of RAM per VM, local SSD) for internal testing.

Ditto, for n1-standard-16.

Increase optimal vCPUs to 32. Update GCP and AWS recs as well. Holding off on Azure until the upcoming cloud report, at which point we should revisit our hardware recs more broadly. Fixes #4711.

cockroach-teamcity · 2019-09-25T01:25:00Z

Online preview: http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/595b04cf2c75c0e99b2b0c3c32d66f4beabbe207/

Edited pages:

jseldess requested review from bdarnell and rkruze September 24, 2019 18:54

rkruze reviewed Sep 24, 2019

View reviewed changes

jseldess force-pushed the prod-checklist-update branch from 4edce5d to e546174 Compare September 24, 2019 18:59

bdarnell approved these changes Sep 24, 2019

View reviewed changes

jseldess force-pushed the prod-checklist-update branch from e546174 to 289fbb2 Compare September 24, 2019 20:18

jseldess force-pushed the prod-checklist-update branch from 289fbb2 to 25c1407 Compare September 24, 2019 20:21

jseldess mentioned this pull request Sep 24, 2019

Update hardware recs based on 2019 cloud report #5475

Closed

1 task

bdarnell approved these changes Sep 24, 2019

View reviewed changes

Clarify hardware recs for throughput

595b04c

Increase optimal vCPUs to 32. Update GCP and AWS recs as well. Holding off on Azure until the upcoming cloud report, at which point we should revisit our hardware recs more broadly. Fixes #4711.

jseldess force-pushed the prod-checklist-update branch from 25c1407 to 595b04c Compare September 25, 2019 01:19

jseldess merged commit 62f2864 into master Sep 25, 2019

jseldess deleted the prod-checklist-update branch September 25, 2019 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify hardware recs for throughput #5472

Clarify hardware recs for throughput #5472

Uh oh!

jseldess commented Sep 24, 2019 •

edited

Loading

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

rkruze left a comment

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

bdarnell Sep 24, 2019

Uh oh!

bdarnell Sep 24, 2019

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

bdarnell left a comment

Uh oh!

cockroach-teamcity commented Sep 25, 2019

Uh oh!

Uh oh!

Clarify hardware recs for throughput #5472

Clarify hardware recs for throughput #5472

Uh oh!

Conversation

jseldess commented Sep 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

rkruze left a comment

Choose a reason for hiding this comment

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

bdarnell Sep 24, 2019

Choose a reason for hiding this comment

Uh oh!

bdarnell Sep 24, 2019

Choose a reason for hiding this comment

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

cockroach-teamcity commented Sep 24, 2019

Uh oh!

bdarnell left a comment

Choose a reason for hiding this comment

Uh oh!

cockroach-teamcity commented Sep 25, 2019

Uh oh!

Uh oh!

jseldess commented Sep 24, 2019 •

edited

Loading