Skip to content
Switch branches/tags
Go to file
Cannot retrieve contributors at this time

As use-case we will use the Mercury Demo website. Mercury is a Sitecore Commerce accelerator ( and there is a Dockerized demonstration environment available on request from Having a fully Dockerized Sitecore Commerce webshop allows us to quickly get up and running in Kubernetes.

Start the Mercury demo in Kubernetes using:

PS> kubectl apply -f ./scale

Remove the xc9 deployment before running this example

Scale application

You can scale cluster nodes and pods.

To enable Node scaling (full details here), add the --enable-cluster-autoscaler and minimum and maximum number of nodes to your cluster create or update command, e.g:

--enable-cluster-autoscaler --min-count 1 --max-count 5

The cluster autoscaler has many more parameters, see here Run EnableNodeScaling.ps1 to add auto-scaling to a running cluster.

To enable Pod scaling (full details here and tutorial), specify resources per container in a Pod, e.g:

  requests: # Minimum
     cpu: 250m
  limits: # Maximum
     cpu: 250m

Above states that the container requests at least and at most 250 milli CPU (i.e. 25% CPU).

Next add a HorizontalPodAutoscaler, for example:

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
  name: commerce
  namespace: default
    apiVersion: apps/v1
    kind: Deployment
    name: commerce
  minReplicas: 1
  maxReplicas: 3
  - type: Resource
      name: cpu
        type: Utilization
        averageUtilization: 50 # Target is 50% of the requested CPU accros all pods, i.e. 50% of 250 milli CPU in this case

Above spec requests to add a Pod when the CPU load for a single Pod, which is the minimum and initial situation, goes above 50% of the requested 250 milli CPU.

See here for full details.

Trigger load

To trigger some load we will use K6 Loadimpact.

In loadtest.js a single product is added to the cart.

To run this cart addition for 30 seconds and 100 virtual users do:

PS> docker run -i loadimpact/k6 run -u 100 -d 30s -< loadtest.js

Monitor the number of Pod replicas by:

PS> kubectl get pod -w

Inspect the Horizontal Pod Autoscaler:

PS> kubectl get hpa

and a full description of its actions:

PS> kubectl describe hpa

The autoscaler by default queries every 15 seconds for metrics. When needed it will upscale the number of Pods. Downscaling Pods will happen after a (default) cool down period of 5 minutes.