Reduce address comparisons for network topology replica calculation by mpenick · Pull Request #532 · apache/cassandra-cpp-driver

mpenick · 2022-08-15T19:33:23Z

This uses a DenseHashSet to keep prevent duplicate replicas instead of doing a linear scan through the existing replicas. I'm seeing around a 4.5x speed up for larger replication factors (rf = 54).

This uses a `DenseHashSet` to keep prevent duplicate replicas instead of doing a linear scan through the existing replicas. I'm seeing around a 4.5x speed up for larger replication factors (rf = 54).

mpenick · 2022-08-15T19:33:52Z

tests/src/unit/tests/test_token_map.cpp

-  size_t replication_factor = 3;
-  size_t total_replicas = std::min(num_hosts, replication_factor) * num_dcs;
+  size_t replication_factor = 54;
+  size_t total_replicas = replication_factor;


This should likely be reverted. This is a pathological use case though.

mpenick · 2022-08-15T19:45:16Z

tests/src/unit/test_token_map_utils.hpp


  static size_t size_of(const String& value) { return value.size(); }

+  static size_t size_of(const Address& value) { char buf[16]; return value.to_inet(buf); }


These changes fix test warnings.

zakalibit · 2022-08-15T20:00:05Z

src/token_map_impl.hpp

          skipped_endpoints_this_dc.push_back(curr_token_it);
        } else {
-          if (add_replica(replicas, Host::Ptr(host))) {
+          if (replicas_set.insert(host).second) {


why not just embed the changes in the add_replica()?
another option is to try keep replicas sorted and use binary search? i.e. use
std::sort(), or even std::stable_sort() that should be faster for sorted or, almost sorted data, then just use std::lower_bound() with custom Comparator to check if the host needs to be added.

The token order of replicas matters.

still could embed the logic in to add_replica()

src/host.hpp

…nStack to AWS

Michael Penick added 2 commits August 8, 2022 09:56

Test with large replication factor and large number of vnodes

257c9a9

Reduce address comparisons for network topology replica calculation

928ff70

This uses a `DenseHashSet` to keep prevent duplicate replicas instead of doing a linear scan through the existing replicas. I'm seeing around a 4.5x speed up for larger replication factors (rf = 54).

mpenick commented Aug 15, 2022

View reviewed changes

Remove debugging

74eb1b9

mpenick commented Aug 15, 2022

View reviewed changes

zakalibit reviewed Aug 15, 2022

View reviewed changes

mpenick commented Aug 15, 2022

View reviewed changes

src/host.hpp Outdated Show resolved Hide resolved

Michael Penick and others added 5 commits August 16, 2022 09:10

Use address set

5b35f2d

fix the missing OS_DISTRO env variable as a result of moving from Ope…

2501425

…nStack to AWS

Merge branch 'fix_cpp_aws' into v2.15.2-large-rf

f38c0a0

specifically use local variable for os distro

a78cc90

embed get_os_distro call in script block

402119e

absurdfarce closed this Oct 29, 2025

absurdfarce deleted the v2.15.2-large-rf branch October 29, 2025 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce address comparisons for network topology replica calculation#532

Reduce address comparisons for network topology replica calculation#532
mpenick wants to merge 8 commits intomasterfrom
v2.15.2-large-rf

mpenick commented Aug 15, 2022

Uh oh!

mpenick Aug 15, 2022 •

edited

Loading

Uh oh!

mpenick Aug 15, 2022

Uh oh!

zakalibit Aug 15, 2022

Uh oh!

mpenick Aug 15, 2022

Uh oh!

zakalibit Aug 17, 2022

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		static size_t size_of(const String& value) { return value.size(); }

		static size_t size_of(const Address& value) { char buf[16]; return value.to_inet(buf); }

Conversation

mpenick commented Aug 15, 2022

Uh oh!

mpenick Aug 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mpenick Aug 15, 2022

Choose a reason for hiding this comment

Uh oh!

zakalibit Aug 15, 2022

Choose a reason for hiding this comment

Uh oh!

mpenick Aug 15, 2022

Choose a reason for hiding this comment

Uh oh!

zakalibit Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mpenick Aug 15, 2022 •

edited

Loading