cluster: small performance optimisations #780

wprzytula · 2023-07-31T13:21:13Z

This is a bunch of performance-related small optimisation of the cluster module, mainly focused on avoiding cloning where possible.

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
~~[ ] I added relevant tests for new features and bug fixes.~~
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
~~[ ] I have provided docstrings for the public items that I want to introduce.~~
~~[ ] I have adjusted the documentation in ./docs/source/.~~
~~[ ] I added appropriate Fixes: annotations to PR description.~~

Keyspace strategies were cloned and allocated into a Vec, just for creating an iterator to references to them in order to construct a `ReplicaLocator`. It was done due to lifetime issues (in order to send references to another thread, they must be 'static). The problem is solved by sending keyspaces to another thread for constructing `ReplicaLocator` and then moving it back to the calling thread afterwards. Clones are elided. Hooray!

By leveraging branched variable initialisation, cloning datacenters and racks is no longer necessary.

This one wasn't particularly hard.

There no reason why the method should be async.

wprzytula · 2023-08-02T07:57:45Z

v2: rebased on main.

scylla/src/transport/cluster.rs

scylla/src/transport/session.rs

scylla/src/transport/cluster.rs

There was a convoluted logic that can be easily expressed using itertools' `find_or_first()` iterator method.

havaker · 2023-08-04T09:22:54Z

scylla/src/transport/cluster.rs

+            .map(|node| node.get_working_connections())
+            .flatten_ok()
+        // By an invariant `self.known_peers` is nonempty, so the returned iterator
+        // is nonempty, too.


Why is that? If all node pools are in MaybePoolConnections::Broken state, every call to node.get_working_connections() will fail.

That's true, and hence the resulting iterator will consist of only Err variants.
flatten_ok() does not remove any Err variants; what it does is flattening the Ok variants; that is, Ok(Ok(x)) becomes Ok(x).

That's a quite large change in semantics. Previously, if all nodes had a broken connection pool then the function returned an error; otherwise you would get a list of current connections. Now, you are putting the responsibility for handling errors to the caller. Despite this change the code seems to be working due to how the current callers are handling the errors from the iterator.

I'd rather see the old semantics, i.e. the function should either fail immediately if there are no connections or return an iterator that returns just Arc<Connection>. Alternatively (but not preferably) you could document the meaning of what the iterator really returns.

scylla/src/transport/session.rs

havaker

Left a nit, LGTM

scylla/src/transport/cluster.rs

piodul

One request, LGTM otherwise.

piodul · 2023-08-21T13:57:40Z

scylla/src/transport/session.rs

+        let handles = connections_iter.map(|c| async {
+            match c {
+                Ok(c) => c.fetch_schema_version().await,
+                Err(err) => Err(err),
+            }
+        });


Would and_then instead of map work? Similarly above, lines 868-873.

You probably mean "instead of match". Then I don't think so, because the operation is done inside an async block, which is a known limitation of functional idioms in Rust.

piodul · 2023-08-21T14:20:08Z

scylla/src/transport/cluster.rs

+            .map(|node| node.get_working_connections())
+            .flatten_ok()
+        // By an invariant `self.known_peers` is nonempty, so the returned iterator
+        // is nonempty, too.


That's a quite large change in semantics. Previously, if all nodes had a broken connection pool then the function returned an error; otherwise you would get a list of current connections. Now, you are putting the responsibility for handling errors to the caller. Despite this change the code seems to be working due to how the current callers are handling the errors from the iterator.

I'd rather see the old semantics, i.e. the function should either fail immediately if there are no connections or return an iterator that returns just Arc<Connection>. Alternatively (but not preferably) you could document the meaning of what the iterator really returns.

wprzytula · 2023-08-22T08:10:25Z

Reworked iter_working_connections() so that now it returns either an error or an iterator with successful working connections (with at least one of them).

`get_working_connections()` not only allocated `Vec`s for all nodes, but it even cloned them to one big `Vec`. At least the latter could be avoided by returning an iterator over connections. The new replacement method, `iter_working_connections()`, was put on `ClusterData` instead of `Cluster`, because it borrows from `ClusterData`, so lifetime issues would be brought otherwise. The two uses of `get_working_connections()`, `Session::prepare()` and `Session::check_schema_agreement()`, were updated to use `iter_working_connections()`, and `get_working_connections()` was deleted as not needed anymore.

As `join_all()` accepts any `IntoIterator`, it accepts iterators in particular. Allocating a `Vec` is unnecessary.

wprzytula added the performance Improves performance of existing features label Jul 31, 2023

wprzytula requested review from piodul and havaker July 31, 2023 13:23

wprzytula force-pushed the cluster-small-optimisations branch from a27b28f to 095c6fb Compare July 31, 2023 13:24

wprzytula added 4 commits August 2, 2023 09:56

cluster: elide clones at PeerEndpoint construction

94557a8

By leveraging branched variable initialisation, cloning datacenters and racks is no longer necessary.

cluster: elide clone in update_rack_count()

7f9d5e7

This one wasn't particularly hard.

cluster: un-async get_working_connections()

00d37b3

There no reason why the method should be async.

wprzytula force-pushed the cluster-small-optimisations branch from 095c6fb to 54fe919 Compare August 2, 2023 07:57

havaker suggested changes Aug 3, 2023

View reviewed changes

scylla/src/transport/cluster.rs Show resolved Hide resolved

scylla/src/transport/session.rs Outdated Show resolved Hide resolved

scylla/src/transport/cluster.rs Show resolved Hide resolved

cluster: prepare: refactor for clarity

e81b93b

There was a convoluted logic that can be easily expressed using itertools' `find_or_first()` iterator method.

wprzytula force-pushed the cluster-small-optimisations branch from 54fe919 to 73373d2 Compare August 3, 2023 17:03

havaker reviewed Aug 4, 2023

View reviewed changes

havaker suggested changes Aug 4, 2023

View reviewed changes

scylla/src/transport/session.rs Outdated Show resolved Hide resolved

scylla/src/transport/session.rs Show resolved Hide resolved

wprzytula force-pushed the cluster-small-optimisations branch from 73373d2 to 8f9e619 Compare August 4, 2023 15:35

wprzytula requested a review from havaker August 4, 2023 17:03

havaker approved these changes Aug 15, 2023

View reviewed changes

scylla/src/transport/cluster.rs Show resolved Hide resolved

wprzytula requested a review from avelanarius August 18, 2023 15:19

piodul requested changes Aug 21, 2023

View reviewed changes

wprzytula force-pushed the cluster-small-optimisations branch from 8f9e619 to dbcd9e9 Compare August 22, 2023 08:09

wprzytula added 2 commits August 22, 2023 10:11

cluster: send_use_keyspace(): avoid Vec allocation

1bc5744

As `join_all()` accepts any `IntoIterator`, it accepts iterators in particular. Allocating a `Vec` is unnecessary.

wprzytula force-pushed the cluster-small-optimisations branch from dbcd9e9 to 1bc5744 Compare August 22, 2023 08:11

wprzytula requested review from piodul and havaker August 22, 2023 08:11

piodul approved these changes Aug 22, 2023

View reviewed changes

piodul merged commit decac4e into scylladb:main Aug 22, 2023
8 checks passed

wprzytula deleted the cluster-small-optimisations branch August 22, 2023 09:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cluster: small performance optimisations #780

cluster: small performance optimisations #780

wprzytula commented Jul 31, 2023

wprzytula commented Aug 2, 2023

havaker Aug 4, 2023

wprzytula Aug 4, 2023

piodul Aug 21, 2023

havaker left a comment

piodul left a comment

piodul Aug 21, 2023

wprzytula Aug 22, 2023

piodul Aug 21, 2023

wprzytula commented Aug 22, 2023

cluster: small performance optimisations #780

cluster: small performance optimisations #780

Conversation

wprzytula commented Jul 31, 2023

Pre-review checklist

wprzytula commented Aug 2, 2023

havaker Aug 4, 2023

Choose a reason for hiding this comment

wprzytula Aug 4, 2023

Choose a reason for hiding this comment

piodul Aug 21, 2023

Choose a reason for hiding this comment

havaker left a comment

Choose a reason for hiding this comment

piodul left a comment

Choose a reason for hiding this comment

piodul Aug 21, 2023

Choose a reason for hiding this comment

wprzytula Aug 22, 2023

Choose a reason for hiding this comment

piodul Aug 21, 2023

Choose a reason for hiding this comment

wprzytula commented Aug 22, 2023