Cassandra clusters specify initial token for large startup speedup #844

rukai · 2022-10-06T01:09:44Z

The shotover-int-tests/cassandra:4.0.6 image no longer sets a CMD, it now falls back to CMD ["cassandra", "-f"], which is the safest default.
None of our docker-compose.ymls actually use that fallback though and instead overwrite it in various ways as detailed below:

Cluster tests

We make use of shotover-int-tests/cassandra:4.0.6 to set some config we previously relied on bitnami for and then use the custom command to set initial_token, which even bitnami didn't offer.

In order to keep the token list more readable we do some weird things with environment variables.
The token list is set to the docker-compose env var CASSANDRA_INITIAL_TOKENS.
Then the container uses that environment variable at runtime.
We have to use $$ to escape the $ otherwise docker-compose will substitute the variable with an empty string, rather than letting the container do the subsitution.

Manually specifying the tokens allows cassandra cluster init to skip a lot of waiting around and takes the time for initialization to complete on my machine from 70s to 15s

It will also allow us to improve the quality of the integration tests as we will be able to make assertions on the tokens returned in system.local.
This will be done in a follow up though.

The tokens were chosen by extracting them from the naturally randomly generated tokens of a real cluster.
Possibly 128 tokens per node is excessive and it would be easier to deal with if we reduced it to say 16per node, but I'll leave that for a potential follow up.

The dc1 -> Mars change in test_cassandra_peers_rewrite_cassandra4 was needed because its docker-compose.yml was setting CASSANDRA_DC: Mars which was previously unused by the bitnami image, but now that we are using the official image as a base the CASSANDRA_DC field is being properly picked up.

Single instance tests

The only change here was to set the command: cassandra -f -Dcassandra.skip_wait_for_gossip_to_settle=0 -Dcassandra.initial_token=0
I think this is an improvement as setting the command explicitly shows the mild hack these tests use to speed up initialization.

rukai force-pushed the initial_tokens branch 4 times, most recently from aedddd5 to 1c8a6fc Compare October 6, 2022 01:43

Cassandra clusters specify initial token for large startup speedup

1f49823

rukai force-pushed the initial_tokens branch from 5901adc to de59964 Compare October 6, 2022 03:11

Combine into a single image

49fb61a

rukai force-pushed the initial_tokens branch from de59964 to 49fb61a Compare October 6, 2022 03:30

rukai marked this pull request as ready for review October 6, 2022 04:06

rukai requested review from conorbros, benbromhead and XA21X October 6, 2022 04:06

conorbros approved these changes Oct 10, 2022

View reviewed changes

XA21X approved these changes Oct 10, 2022

View reviewed changes

Merge branch 'main' into initial_tokens

01133a2

rukai enabled auto-merge (squash) October 11, 2022 03:21

Merge branch 'main' into initial_tokens

eacbdff

rukai merged commit 95e2090 into shotover:main Oct 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cassandra clusters specify initial token for large startup speedup #844

Cassandra clusters specify initial token for large startup speedup #844

rukai commented Oct 6, 2022 •

edited

Cassandra clusters specify initial token for large startup speedup #844

Cassandra clusters specify initial token for large startup speedup #844

Conversation

rukai commented Oct 6, 2022 • edited

Cluster tests

Single instance tests

rukai commented Oct 6, 2022 •

edited