message: match unknown tenants to the default tenant #13843

denesb · 2023-05-10T09:10:55Z

On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name.

If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen.

This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster.

This PR proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore.

I tested this manually, by creating a cluster of 2 OSS nodes, then upgrading one of the nodes to enterprise and verifying (with extra logging) that service level connections are matched to the default statement tenant after the PR and they indeed match to the default scheduling group before.

Fixes: #13841
Fixes: #12552

We have a set amount of connection types for each tenant. The amount of these connection types can change. Although currently these are hardcoded in a single place, soon (in the next patch) there will be yet another place where these will be used. To avoid duplicating these names, making future changes error prone, centralize them in a const array, generalizing the concept of a tenant connection type.

On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name. If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen. This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster. This patch proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore.

scylladb-promoter · 2023-05-10T12:09:34Z

CI state FAILURE - https://jenkins.scylladb.com/job/scylla-master/job/scylla-ci/1169/

denesb · 2023-05-10T14:07:29Z

CI state FAILURE - https://jenkins.scylladb.com/job/scylla-master/job/scylla-ci/1169/

Known flaky test #13211. Re-kicked.

scylladb-promoter · 2023-05-17T13:35:23Z

CI state ABORTED - https://jenkins.scylladb.com/job/scylla-master/job/scylla-ci/1286/

denesb · 2023-05-17T13:37:13Z

Looks like there is a genuine failure here, some tests never finish.

denesb · 2023-05-18T07:40:44Z

Looks like there is a genuine failure here, some tests never finish.

Locally I reproduced a known CI issue #13887, so retrying CI.

scylladb-promoter · 2023-05-18T10:54:17Z

CI state SUCCESS - https://jenkins.scylladb.com/job/scylla-master/job/scylla-ci/1308/

avikivity · 2023-05-21T18:08:25Z

message/messaging_service.cc

+    // recognize, but we know its a tenant, not a system connection.
+    // Fall-back to the default tenant in this case.
+    for (auto&& connection_prefix : _connection_types_prefix) {
+        if (isolation_cookie.find(connection_prefix.data()) == 0) {


Why not isolation_cookie == connection_prefix?

The find() thing needs deciphering. It's not that complicated, but "==" is much simper, no?

I doesn't work. isolation_cookie will be something like statement:default for the case this patch wants to solve.

Yes, it even says "prefix".

…tond Dénes On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name. If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen. This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster. This PR proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore. I tested this manually, by creating a cluster of 2 OSS nodes, then upgrading one of the nodes to enterprise and verifying (with extra logging) that service level connections are matched to the default statement tenant after the PR and they indeed match to the default scheduling group before. Fixes: #13841 Fixes: #12552 Closes #13843 * github.com:scylladb/scylladb: message: match unknown tenants to the default tenant message: generalize per-tenant connection types

…tond Dénes On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name. If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen. This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster. This PR proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore. I tested this manually, by creating a cluster of 2 OSS nodes, then upgrading one of the nodes to enterprise and verifying (with extra logging) that service level connections are matched to the default statement tenant after the PR and they indeed match to the default scheduling group before. Fixes: #13841 Fixes: #12552 Closes #13843 * github.com:scylladb/scylladb: message: match unknown tenants to the default tenant message: generalize per-tenant connection types (cherry picked from commit a7c2c9f)

denesb added 2 commits May 10, 2023 04:28

denesb mentioned this pull request May 10, 2023

Read query times out with reader_concurrency_semaphore during rolling upgrade on a mixed cluster #12552

Closed

2 tasks

denesb requested review from eliransin and avikivity May 10, 2023 09:11

avikivity reviewed May 21, 2023

View reviewed changes

scylladb-promoter merged commit a7c2c9f into scylladb:master May 23, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

message: match unknown tenants to the default tenant #13843

message: match unknown tenants to the default tenant #13843

denesb commented May 10, 2023 •

edited

scylladb-promoter commented May 10, 2023

denesb commented May 10, 2023

scylladb-promoter commented May 17, 2023

denesb commented May 17, 2023

denesb commented May 18, 2023

scylladb-promoter commented May 18, 2023

avikivity May 21, 2023

denesb May 22, 2023

avikivity May 22, 2023

message: match unknown tenants to the default tenant #13843

message: match unknown tenants to the default tenant #13843

Conversation

denesb commented May 10, 2023 • edited

scylladb-promoter commented May 10, 2023

denesb commented May 10, 2023

scylladb-promoter commented May 17, 2023

denesb commented May 17, 2023

denesb commented May 18, 2023

scylladb-promoter commented May 18, 2023

avikivity May 21, 2023

Choose a reason for hiding this comment

denesb May 22, 2023

Choose a reason for hiding this comment

avikivity May 22, 2023

Choose a reason for hiding this comment

denesb commented May 10, 2023 •

edited