Scale and Performance testing for 1.26 k8s clusters #711

vivek-shilimkar · 2023-04-24T06:55:06Z

We need to do performance testing regarding large clusters to see how it performs..

Specific usecases:

Local and downstream 1.26 RKE1 and RKE2 clusters - individual cluster performance.
Large number of downstream RKE1/RKE2 Rancher provisioned clusters.

Individual Cluster Performance

Rancher Cluster Config:

Rancher version: latest 2.7.5-rc1
Rancher cluster type: RKE1/RKE2 HA (3 all-roles nodes, 1 worker node tainted to run rancher-monitoring)
Cluster Size: Medium
Docker version: 20.10
K8s version: v1.26.4-rancher2-1/v1.26.4+rke2r1
Log level: debug
certmanager version: 1.11.0

Downstream Cluster Config:

Downstream cluster types: HA RKE1, RKE2, K3s (3 all-roles nodes)
Installed apps: N/A
Deployments: ranchertest/mytestcontainer
rke1 version = v1.26.4-rancher2-1
rke2 version = v1.26.4+rke2r1
k3s version = v1.26.4+k3s1

Method:

Setup HA Local (k8s version) cluster with Rancher
1. Install monitoring
2. Install other workloads/Apps if desired (SKIPPED)
Provision HA Downstream (k8s version) clusters (1 of each RKE1, RKE2, K3s)
Load each downstream cluster with a set of "objects"
1. 100 secrets
2. 10 projects
3. 12 namespaces
4. 300 users (users are created in the Local cluster)
5. 1 project Role Template
6. 300 project Role Template bindings
Deploy desired pods onto each downstream cluster
1. Deploy ranchertest/mytestcontainer until reaching 330 pod limit
2. Collect CPU and Memory profile logs as scale-in progresses
3. Collect rancher pod logs as scale-in progresses
Collect rancher-monitoring metrics artifacts

Scalability

Rancher Cluster Config:

Rancher version: latest 2.7.5-rc1
Rancher cluster type: RKE1/RKE2 HA , 3 all-roles nodes, 1 worker node (tainted to run rancher-monitoring)
Cluster Size: Medium
Docker version: 20.10
K8s version: v1.26.4-rancher2-1/v1.26.4+rke2r1
Log level: debug
certmanager version: 1.11.0

Downstream Cluster Config:

Downstream cluster types: RKE1/RKE2 (1 all-roles nodes)
Installed apps: N/A
Deployments: N/A
rke1 version = v1.26.4-rancher2-1
rke2 version = v1.26.4+rke2r1

Method:

Setup HA Local (k8s version) cluster with Rancher
1. Install monitoring
2. Install other workloads/Apps if desired (SKIPPED)
Let idle for 30 minutes
Provision Downstream (k8s version) clusters (RKE1/RKE2)
- Scale to half the documented cluster limit
Collect rancher-monitoring metrics artifacts and review for unexpected behavior
Provision Downstream (k8s version) clusters (RKE1/RKE2)
- Scale to the full documented cluster limit
Collect rancher-monitoring metrics artifacts and review for unexpected behavior

The text was updated successfully, but these errors were encountered:

git-ival · 2023-05-16T17:48:50Z

Was blocked due to lack of PSACT support via the rancher2 tfp, see rancher/terraform-provider-rancher2#1112 and rancher/terraform-provider-rancher2#908.

Unblocked by creating a PR for V1 clusters then combining the following PRs and building the provider locally:

Enable setting default PSACT for rancher2_cluster add admission_configuration support with state migration logic terraform-provider-rancher2#1119
feat: add PSACT to cluster_v2 resource and data-source terraform-provider-rancher2#1117

Testing is underway, RKE1 is completed

git-ival · 2023-06-06T20:52:27Z

Results have been logged internally

vivek-shilimkar added the area/scale-testing label Apr 24, 2023

vivek-shilimkar added this to the v2.7.x milestone Apr 24, 2023

vivek-shilimkar assigned floatingman, bmdepesa and git-ival Apr 24, 2023

sowmyav27 modified the milestones: v2.7.x, 2023-Q2-v2.7x May 4, 2023

git-ival added the [zube]: QA Backlog label May 4, 2023

bmdepesa added [zube]: QA Working and removed [zube]: QA Backlog labels May 25, 2023

vivek-shilimkar mentioned this issue May 29, 2023

[Feature] K8s 1.26 support rancher/rancher#41113

Closed

10 tasks

git-ival closed this as completed Jun 6, 2023

cbron added [zube]: Done and removed [zube]: QA Working labels Jun 6, 2023

git-ival unassigned floatingman Jun 16, 2023

yonasberhe23 removed the [zube]: Done label Sep 5, 2023

git-ival added the [zube]: Done label Sep 22, 2023

aiyengar2 removed the [zube]: Done label Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale and Performance testing for 1.26 k8s clusters #711

Scale and Performance testing for 1.26 k8s clusters #711

vivek-shilimkar commented Apr 24, 2023 •

edited by git-ival

Loading

git-ival commented May 16, 2023

git-ival commented Jun 6, 2023

Scale and Performance testing for 1.26 k8s clusters #711

Scale and Performance testing for 1.26 k8s clusters #711

Comments

vivek-shilimkar commented Apr 24, 2023 • edited by git-ival Loading

Individual Cluster Performance

Rancher Cluster Config:

Downstream Cluster Config:

Method:

Scalability

Rancher Cluster Config:

Downstream Cluster Config:

Method:

git-ival commented May 16, 2023

git-ival commented Jun 6, 2023

vivek-shilimkar commented Apr 24, 2023 •

edited by git-ival

Loading