Exploration of CockroachDB for the control plane database

Table of Contents

This repo
Using this repo
Known issues
For further digging
CockroachDB Caveats
References

See RFD 53 ("Control plane data storage requirements") for background.

See RFD 110 ("CockroachDB for the control plane database") for the results of this testing.

This repo

The code in this repo uses Terraform to provision a multi-node CockroachDB cluster on AWS, using OmniOS and Joshua’s CRDB binaries. This covers:

terraform configuration to deploy the whole thing
working illumos builds (and configuration and SMF manifests) for:
- cockroachdb
- Prometheus
- Grafana
- sysbench
- chrony
- Prometheus’s node_exporter
monitoring:
- generic VM metrics: boot time, CPU utilization, etc.
- illumos-specific metrics: I/O latency and utilization, network throughput
- CockroachDB metrics
- Grafana and Prometheus metrics
- many of these use AWS-based service discovery

Using this repo

Prerequisites

To deploy a cluster, you need to have:

terraform configured using your AWS account
an ssh key pair configured in AWS called "dap-terraform" OR change locals.ssh_key_name in terraform/nodes.tf to refer to your key’s name
IAM permissions to configure the Terraform cluster (TBD: which perms, exactly?)
the aws CLI configured locally
read access to the oxide-cockroachdb-exploration S3 bucket

Step 1: Download and unpack the raw proto tarball

Before deploying the cluster, you must download and unpack the "proto" tarball, which lives in that S3 bucket. From the root of your clone of this repository:

$ aws s3 cp s3://oxide-cockroachdb-exploration/proto-raw.tgz ./proto-raw.tgz
$ tar xzf proto-raw.tgz

This will unpack a bunch of files into the ./vminit directory. These are built binaries, configuration, and other files for the various programs we use: CockroachDB, Prometheus, Grafana, Chrony, haproxy, etc. They’re organized by role ("common", "db", and "mon"), and each role’s directory contains files to be installed into the target VM starting at "/". Note that there are also files in this repo mixed in with the files extracted from proto-raw.tgz. This is a little janky, but it allows us to keep config files and the like under source control without having to check in gigantic binaries.

Note	Most of these files are not necessarily for a stock deployment, but you do need `./vminit/fetcher.gz` from that tarball. This is an illumos build of the "fetcher" tool contained in this repo.

Step 2: Create S3 buckets

You will need to create two S3 buckets:

$s3_terraform_bucket: will be used to store Terraform state
$s3_asset_bucket: will be used to store assets used at deployment time for the VMs

These can be the same bucket, if you want. You can create these using the S3 web console or the S3 CLI.

To use these:

Modify ./terraform/aws.tf. In the block starting with backend "s3", set "bucket" to your $s3_terraform_bucket that you created above. Make sure the "region" is set correctly, too.
Modify ./terraform/nodes.tf. In the locals block at the top, set s3_asset_bucket to the name of your $s3_asset_bucket.

Step 3: Build and upload the VM tarballs

Now build the tarballs and upload them to S3:

$ cd vminit
$ make all
$ aws s3 cp vminit-common.tgz s3://$s3_asset_bucket/vminit-common.tgz
$ aws s3 cp vminit-cockroachdb.tgz s3://$s3_asset_bucket/vminit-cockroachdb.tgz
$ aws s3 cp vminit-mon.tgz s3://$s3_asset_bucket/vminit-mon.tgz
$ cd ..

Step 4: Deploy the cluster

Now, you can configure Terraform and deploy the cluster:

$ cd terraform
$ terraform init
$ terraform apply

On success, this will output the public and private IPs of all the nodes in the cluster. You will need to ssh to the public IP address of each database node and enable CockroachDB with:

$ svcadm enable -s cockroachdb

(These tools deliberately don’t do this by default because in some cases we want to provision more nodes than we initially enable. This gives you control over which nodes form the initial cluster.)

Now, note the public IP address of the load generator node and the private IP address of any of the database nodes, then log into the load generator and run:

$ ssh root@LOADGEN_PUBLIC_IP
$ configure_cluster --host DB_PRIVATE_IP

You should now have a cluster running and being monitored!

The easiest way to see the Grafana graphs of the cluster is to set up an SSH tunnel, connect to Grafana, and look at the "Testing Dashboard". The env.sh file in this repo contains aliases to help with this:

$ source env.sh
$ start_project_ssh
mon internal IP: 192.168.1.146
db0 internal IP: 192.168.1.24
db0 external IP: 54.202.147.117
ssh -o "StrictHostKeyChecking accept-new" -L9090:192.168.1.146:9090 -L3000:192.168.1.146:3000 -L8080:192.168.1.24:8080 root@54.202.147.117
$

Take the command it outputs and run it yourself to set up SSH tunnels for Grafana, Prometheus, and the CockroachDB Admin UI. You can access Grafana at http://127.0.0.1:3000/ (load up the "Testing Dashboard") and you can access the CockroachDB Admin UI at http://127.0.0.1:8080/. (If you get a 404 for the CockroachDB Admin UI, make sure you ran configure_cluster above.)

To log into Grafana, use the default username and password, both of which are "admin".

If you need to modify the VM contents

You can modify the files under vminit/{common,mon,db} in order to modify the corresponding files in deployed VMs. To apply these changes, use the steps above to build new VM tarballs, upload to your S3 asset bucket, and redeploy.

For files under version control, remember to upstream your changes to this repository. Remember that many of these files are not under version control, but came from the proto-raw.tgz tarball instead. To update the canonical copy of these, construct a new tarball and upload it to the "oxide-cockroachdb-exploration" S3 bucket.

Constructing proto-raw.tgz

"proto-raw.tgz" is currently constructed and maintained by hand. It consists of:

vminit/common: a directory tree of files to be installed into all VMs. This includes the binaries, libraries, and supporting files for Chrony, illumos-exporter, and node_exporter.
vminit/cockroachdb: a directory tree of files to be installed into CockroachDB database and load generator VMs. This includes the binaries, libraries, and supporting files for CockroachDB itself, sysbench, haproxy, etc.
vminit/fetcher.gz: a gzipped copy of a release build of tools/fetcher (in this repo) for illumos
vminit/mon: a directory tree of files to be installed into the monitoring VM. This includes the binaries, libraries, and supporting files for Prometheus and Grafana.

The builds generally come from garbage-compactor (for components that are present there) or else fairly stock builds. Note that there are configuration files and the like stored in this repo (not proto-raw.tgz) that will go into some of the same directories under vminit. See the note above about this.

Once you’ve constructed the directory layout by hand, upload this to the S3 bucket so that other users can use this tarball.

This seems janky

Yes, there’s a lot of room for improvement here. This was a quick solution for keeping this process fairly reproducible.

Known issues

cockroachdb: We’re currently working on a build from master from the summer. We should switch to a release build and make sure we’re exercising Pebble. (We are exercising Pebble now, but if we switch to the latest release as of this writing, we will be back on RocksDB.)
cockroachdb: Readline functionality (e.g., up arrow to see previous command) doesn’t work in cockroach sql shell
chrony setup: Sometimes a cold start of the VMs leaves CockroachDB in maintenance, having crashed because its clock was too far out of sync. This should not be possible because we’re starting chrony and configuring it to wait until it has successfully sync’d the clock (with step, not slew) before starting CockroachDB on all nodes. Still, it happens sometimes.
cockroachdb: Before you’ve initialized the CRDB cluster, if you go to the adminui, you get a very blank 404 page
terraform: we sometimes hit: hashicorp/terraform-provider-aws#12533. Retrying terraform apply has worked around the issue.
cockroachdb: I tried activating statement diagnostics for an UPSERT that one of the workloads runs to see what that does. This produced a bundle that was 23 bytes (0 bytes downloaded, for some reason). This may have been a known bug (see raw notes file) but I’m not sure. There’s a good, short video showing the data in these bundles.
cockroachdb: flags for the cockroach workload command do not match the online docs
my tools: when running configure_cluster, for some reason we only see one node in the haproxy config file even though all three seemed to be up when we configured the cluster. This hasn’t been a problem because I abandoned haproxy early on.
cockroachdb: missing illumos implementations for a lot of the system metrics (which I believe come from gosigar).
my tools: should try setting up a "secure" cluster
my tools: "env.sh" listing of VMs should exclude terminated ones, particularly for --stop-instances and --start-instances

CockroachDB recently changed the default from RocksDB to PebbleDB, despite the documentation (even for the build that I’m using) not having been updated to reflect that.

For further digging

See also the open questions in the report.

Is it worth trying to see what happens when it runs out of disk space by putting its text log and data on separate filesystems and seeing what it logs? See 9/30 for description of apparent corruption when this happens, including data lost after the corruption was supposedly repaired.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
analysis		analysis
data		data
findings		findings
terraform		terraform
tools		tools
vminit		vminit
.gitignore		.gitignore
README.adoc		README.adoc
env.sh		env.sh
raw_notes.adoc		raw_notes.adoc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exploration of CockroachDB for the control plane database

This repo

Using this repo

Prerequisites

Step 1: Download and unpack the raw proto tarball

Step 2: Create S3 buckets

Step 3: Build and upload the VM tarballs

Step 4: Deploy the cluster

If you need to modify the VM contents

Constructing proto-raw.tgz

This seems janky

Known issues

For further digging

CockroachDB Caveats

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

oxidecomputer/cockroachdb-exploration

Folders and files

Latest commit

History

Repository files navigation

Exploration of CockroachDB for the control plane database

This repo

Using this repo

Prerequisites

Step 1: Download and unpack the raw proto tarball

Step 2: Create S3 buckets

Step 3: Build and upload the VM tarballs

Step 4: Deploy the cluster

If you need to modify the VM contents

Constructing proto-raw.tgz

This seems janky

Known issues

For further digging

CockroachDB Caveats

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages