messaging: when upgrading OSS nodes to Enterprise, service-levels are matched to the default scheduling group #13841

denesb · 2023-05-10T09:07:51Z

On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name.

If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen.

This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster.

…tond Dénes On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name. If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen. This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster. This PR proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore. I tested this manually, by creating a cluster of 2 OSS nodes, then upgrading one of the nodes to enterprise and verifying (with extra logging) that service level connections are matched to the default statement tenant after the PR and they indeed match to the default scheduling group before. Fixes: #13841 Fixes: #12552 Closes #13843 * github.com:scylladb/scylladb: message: match unknown tenants to the default tenant message: generalize per-tenant connection types

mykaul · 2023-07-12T09:37:11Z

I'm unsure why the backport candidate was removed here - @denesb ?

denesb · 2023-07-12T09:45:57Z

I'm unsure why the backport candidate was removed here - @denesb ?

The backport candidate lable is on this issue, added by the bot, as it's supposed to do.

mykaul · 2023-07-12T10:11:06Z

I'm unsure why the backport candidate was removed here - @denesb ?

The backport candidate lable is on this issue, added by the bot, as it's supposed to do.

err - you are right - wrong bug.

Need maintainers to backport I guess - it had the label set 1.5 months ago.

…tond Dénes On connection setup, the isolation cookie of the connection is matched to the appropriate scheduling group. This is achieved by iterating over the known statement tenant connection types as well as the system connections and choosing the one with a matching name. If a match is not found, it is assumed that the cluster is upgraded and the remote node has a scheduling group the local one doesn't have. To avoid demoting a scheduling group of unknown importance, in this case the default scheduling group is chosen. This is problematic when upgrading an OSS cluster to an enterprise version, as the scheduling groups of the enterprise service-levels will match none of the statement tenants and will hence fall-back to the default scheduling group. As a consequence, while the cluster is mixed, user workload on old (OSS) nodes, will be executed under the system scheduling group and concurrency semaphore. Not only does this mean that user workloads are directly competing for resources with system ones, but the two workloads are now sharing the semaphore too, reducing the available throughput. This usually manifests in queries timing out on the old (OSS) nodes in the cluster. This PR proposes to fix this, by recognizing that the unknown scheduling group is in fact a tenant this node doesn't know yet, and matching it with the default statement tenant. With this, order should be restored, with service-level connections being recognized as user connections and being executed in the statement scheduling group and the statement (user) concurrency semaphore. I tested this manually, by creating a cluster of 2 OSS nodes, then upgrading one of the nodes to enterprise and verifying (with extra logging) that service level connections are matched to the default statement tenant after the PR and they indeed match to the default scheduling group before. Fixes: #13841 Fixes: #12552 Closes #13843 * github.com:scylladb/scylladb: message: match unknown tenants to the default tenant message: generalize per-tenant connection types (cherry picked from commit a7c2c9f)

denesb · 2023-07-12T12:32:48Z

Backported to 5.3, 5.2 and 5.1. No point in backporting to enterprise releases, the fix only affects OSS.

denesb self-assigned this May 10, 2023

This was referenced May 10, 2023

Read query times out with reader_concurrency_semaphore during rolling upgrade on a mixed cluster #12552

Closed

message: match unknown tenants to the default tenant #13843

Merged

mykaul added the Eng-3 label May 11, 2023

mykaul modified the milestones: 5.x, 5.4 May 11, 2023

scylladb-promoter closed this as completed in #13843 May 23, 2023

scylladb-promoter added the Backport candidate label May 23, 2023

denesb removed the Backport candidate label Jul 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

messaging: when upgrading OSS nodes to Enterprise, service-levels are matched to the default scheduling group #13841

messaging: when upgrading OSS nodes to Enterprise, service-levels are matched to the default scheduling group #13841

denesb commented May 10, 2023

mykaul commented Jul 12, 2023

denesb commented Jul 12, 2023

mykaul commented Jul 12, 2023

denesb commented Jul 12, 2023

messaging: when upgrading OSS nodes to Enterprise, service-levels are matched to the default scheduling group #13841

messaging: when upgrading OSS nodes to Enterprise, service-levels are matched to the default scheduling group #13841

Comments

denesb commented May 10, 2023

mykaul commented Jul 12, 2023

denesb commented Jul 12, 2023

mykaul commented Jul 12, 2023

denesb commented Jul 12, 2023