Implement Resource Consumer #11570

piosz · 2015-07-20T08:30:40Z

This is @socaa's intern project.

Background

We are working on autoscaling for Kubernetes in 3 dimensions:

cluster size autoscaling
horizontal autoscaling of pod - changing the size of replication controller WIP: auto-scaler proposal #2863
vertical autoscaling of pod - changing its resource limits Vertical pod auto-sizer #10782

We need a tool which will help us to test e2e those kinds of autoscaling.

Resource Consumer

Resource Consumer will consist of two parts: container consuming resources and library managing this load.

Container

The container which consumes specified amount of resources (cpu/mem). Docker container, uploaded to Google Container Registry, open sourced, possibly written in go (it’s not a requirement, but it’s strongly encouraged). The resources to consume should be set and mutated by sending appropriate http request to the container.

The interface should allow to consume the given amount of cpu/mem for the given period of time. It should consist of the following methods:

ConsumeCPU(milicores, duration_sec)
ConsumeMem(megabytes, duration_sec)
GetCurrentStatus()

Each http request will be handled by spawning a new process which will consume given amount of resource and exit after given timeout.The request cannot be cancelled.

Consuming CPU

Consuming the whole core can be done easily. The problem is how to consume some part of it. One of possible ways to achieve it is to split time into small quants, then perform heavy compute operation during some percentage of quants and sleep during the other ones. It is a part of the task to figure out the best way to consume CPU.

Consuming memory

There are few possible ways how to handle memory consumption. We want to make sure the memory can be correctly freed. The only problem is that the container use some memory itself, so that this amount should be considered in calculations as well. It is a part of the task to figure out the best way to consume memory.

Library

Client side library written in go will be linked with e2e tests and should consist of two kinds of methods: static consumption of resources and dynamic one.

Static consumption

This part of library should allow to consume constant amount of resource over time:

ConsumeCPU(node, percentage)
ConsumeMem(node, percentage)
CreateConsumingPod(node) : PodID
ConsumeCPU(pod, percentageOfRequest)
ConsumeMem(pod, percentageOfRequest)

Dynamic consumption

This part of library should allow to create service on top of replication controller, and then request to consume the given amount of resources by all replicas together based on service level load balancing. It can be implemented with sending a lot of consumption request for short period of time and small amount of resources. The interface should consist of:

CreateConsumingService() : RcID
ConsumeCPU(rc, milicores)
ConsumeMem(rc, megabytes)

In both cases there might be an option to add possibility to modify/stop consumption if needed.

Use cases

Cluster size autoscaling

consume more resources on each node that is specified for autoscaler
observe that cluster size increased

Horizontal autoscaling of pod

create consuming RC and start consuming appropriate amount of resources
observe that RC has been resized
observe that usage on each replica decreased

Vertical autoscaling of pod

create consuming pod and start consuming appropriate amount of resources
observed that limits has been increased

Milestones

Dropped ideas

[optional] add possibility to specify initial consumption request
[optional] add a status page

Releases

Date	Version	Image	Release notes
8/14/15	alpha	`gcr.io/google_containers/resource_consumer:alpha`	Support only for cpu consumption
9/15/15	beta	`gcr.io/google_containers/resource_consumer:beta`	Feature complete
12/14/15	beta2	`gcr.io/google_containers/resource_consumer:beta2`	Added support for custom metrics

The text was updated successfully, but these errors were encountered:

piosz · 2015-07-20T08:30:59Z

cc @jszczepkowski @fgrzadkowski

wojtek-t · 2015-07-20T09:00:09Z

/cc me

bgrant0607 · 2015-08-01T00:01:50Z

cc @AnanyaKumar

vishh · 2015-08-01T00:16:22Z

/sub

Part of kubernetes#11570

derekwaynecarr · 2015-08-04T14:17:51Z

/sub

ramr · 2015-08-05T22:43:41Z

/sub

piosz · 2015-08-14T10:22:08Z

The alpha version is released as gcr.io/google_containers/resource_consumer:alpha. It supports consuming cpu only.

piosz · 2015-09-15T15:12:09Z

The beta version is released as gcr.io/google_containers/resource_consumer:beta. It supports consuming both cpu and memory.

cc @derekwaynecarr

piosz · 2015-10-05T09:50:37Z

Good job @socaa!

piosz added the sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. label Jul 20, 2015

piosz self-assigned this Jul 20, 2015

piosz mentioned this issue Jul 22, 2015

Added cluster size autoscaling e2e test #11685

Merged

socaa mentioned this issue Jul 31, 2015

Added http API skeleton server. #12076

Merged

bgrant0607 mentioned this issue Aug 1, 2015

QoS proposal #11713

Merged

piosz mentioned this issue Aug 1, 2015

Added exporting autoscaling metrics in Heapster config #12103

Merged

socaa pushed a commit to socaa/kubernetes that referenced this issue Aug 4, 2015

Added http API skeleton server.

01798ba

Part of kubernetes#11570

socaa mentioned this issue Aug 5, 2015

Add docker file to Resource Consumer #12257

Merged

This was referenced Aug 11, 2015

Added consume cpu function to Resource Consumer #12517

Merged

Handling http POST requests added to Resource Consumer #12578

Merged

mbforbes added priority/backlog Higher priority than priority/awaiting-more-evidence. sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. labels Aug 17, 2015

socaa mentioned this issue Aug 25, 2015

Changed Resource Consumer for correct parsing POST requests #13140

Merged

roberthbailey added the team/control-plane label Aug 27, 2015

socaa mentioned this issue Aug 27, 2015

Dynamic cpu consumption #13243

Merged

This was referenced Sep 7, 2015

Horizontal Pod Autoscaling e2e tests #13640

Merged

added possibility of memory consumption to Dockerfile #13736

Merged

Horizontal Pod Autoscaler is deleted along with namespace #13786

Merged

Resource Consumer Handler milicore changed to millicore #13788

Merged

This was referenced Sep 10, 2015

Memory consumption added to Resource Consumer #13789

Merged

Milicore to Millicore in autoscaling_utils.go #13793

Merged

This was referenced Sep 16, 2015

Memory dynamic consumption #14036

Merged

Memory Limit added to RCConfig in autoscaling_utils.go #14100

Merged

Static Consumption added to autoscaling_utils.go #14167

Merged

Use Resource Consumer for tests in autoscaling.go #14275

Merged

This was referenced Sep 23, 2015

WaitForService added to autoscaling_utils.go #14428

Merged

added README file to Resource Consumer #14645

Merged

piosz closed this as completed Oct 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Resource Consumer #11570

Implement Resource Consumer #11570

piosz commented Jul 20, 2015

piosz commented Jul 20, 2015

wojtek-t commented Jul 20, 2015

bgrant0607 commented Aug 1, 2015

vishh commented Aug 1, 2015

derekwaynecarr commented Aug 4, 2015

ramr commented Aug 5, 2015

piosz commented Aug 14, 2015

piosz commented Sep 15, 2015

piosz commented Oct 5, 2015

Implement Resource Consumer #11570

Implement Resource Consumer #11570

Comments

piosz commented Jul 20, 2015

Background

Resource Consumer

Container

Consuming CPU

Consuming memory

Library

Static consumption

Dynamic consumption

Use cases

Cluster size autoscaling

Horizontal autoscaling of pod

Vertical autoscaling of pod

Milestones

Dropped ideas

Releases

piosz commented Jul 20, 2015

wojtek-t commented Jul 20, 2015

bgrant0607 commented Aug 1, 2015

vishh commented Aug 1, 2015

derekwaynecarr commented Aug 4, 2015

ramr commented Aug 5, 2015

piosz commented Aug 14, 2015

piosz commented Sep 15, 2015

piosz commented Oct 5, 2015