Proposal: Subnet Scarcity #1635

rbtr · 2022-09-28T22:28:46Z

Reason for Change:

This starts the design train for the Subnet Scarcity feature, a solution to the issue raised in #1605.

Namely: in SWIFT, due to the batch-wise scaling of the IP Pool, CNS reserves an overhead of IPs from the Subnet on every Node. This can lead to artificial Subnet IP Exhaustion where there are insufficient unreserved IPs left in the Subnet for Nodes to join the cluster or for Pods to schedule, even when the real total Pod IP usage is less than the total Subnet IP Capacity.

Issue Fixed:

Requirements:

uses conventional commit messages
includes documentation
adds unit tests

Notes:

rbtr · 2022-09-28T22:31:24Z

docs/feature/subnet-scarcity/phase-1/1-subnetstate.md

+        '400':
+          description: bad input parameter


@rsagasthya can you fill in the missing response types and codes of this API as it has been written?

The response codes are 400 for invalid networkId or subnetName, 500 in cases of error in retrieving the cache from controller. Success is 200.

nairashu · 2022-09-29T00:23:09Z

docs/feature/subnet-scarcity/proposal.md

+When a Pod is created, the CNI will call with a request to assign an IP. If CNS is out of IPs and cannot honor that request, the CNI will return an error to the CRI, which will follow up by tearing down that Pod sandbox and starting over. Because of this stateless retrying, CNS can only reliable understand that it needs _at least one more_ IP, because it is impossible to tell if subsequent requests are retries for the same Pod, or many different Pods. If _many_ Pods have been scheduled, CNS will still only request a single additional batch of IPs, and assign those IPs one at a time until it runs out, then request a single additional batch of IPs...
+
+A more predictive method of IP Pool scaling will be added to CNS: CNS will watch Pods for its Node, and will request/release IPs immediately based on the number of Pods scheduled. The Batching behavior will be unchanged, and CNS will continue to request IPs in Batches $B$ based on the local IP usage.
+


Add in handling of the race condition that Ramiro brought up on Rahul's PR

noted, I have covered this in my local working draft and will include it in the next PR addition

Signed-off-by: Evan Baker <rbtr@users.noreply.github.com>

Signed-off-by: GitHub <noreply@github.com>

* stub docs/design for design proposals Signed-off-by: Evan Baker <rbtr@users.noreply.github.com> * feature proposal: subnet scarcity phase 1 Signed-off-by: Evan Baker <rbtr@users.noreply.github.com> * feature proposal: subnet scarcity phase 2 Signed-off-by: GitHub <noreply@github.com> * feature proposal: subnet scarcity phase 3 Signed-off-by: GitHub <noreply@github.com> Signed-off-by: Evan Baker <rbtr@users.noreply.github.com> Signed-off-by: GitHub <noreply@github.com>

rbtr added enhancement cns Related to CNS. docs Documentation only labels Sep 28, 2022

rbtr requested review from nairashu, neaggarwMS, rsagasthya and thatmattlong September 28, 2022 22:28

rbtr self-assigned this Sep 28, 2022

rbtr force-pushed the proposal/subnet-scarcity branch from 851e6e1 to b21e83f Compare September 28, 2022 22:34

rbtr commented Sep 28, 2022

View reviewed changes

nairashu reviewed Sep 29, 2022

View reviewed changes

nairashu previously approved these changes Sep 29, 2022

View reviewed changes

rbtr added 4 commits September 29, 2022 17:16

stub docs/design for design proposals

5d3fe5f

Signed-off-by: Evan Baker <rbtr@users.noreply.github.com>

feature proposal: subnet scarcity phase 1

f7e0e00

Signed-off-by: Evan Baker <rbtr@users.noreply.github.com>

feature proposal: subnet scarcity phase 2

ebdbdb6

Signed-off-by: GitHub <noreply@github.com>

feature proposal: subnet scarcity phase 3

d05709d

Signed-off-by: GitHub <noreply@github.com>

rbtr dismissed nairashu’s stale review via d05709d September 29, 2022 17:16

rbtr force-pushed the proposal/subnet-scarcity branch from b21e83f to d05709d Compare September 29, 2022 17:16

rsagasthya approved these changes Sep 29, 2022

View reviewed changes

rbtr merged commit 50771ed into Azure:master Sep 29, 2022

rbtr deleted the proposal/subnet-scarcity branch September 29, 2022 22:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal: Subnet Scarcity #1635

Proposal: Subnet Scarcity #1635

Uh oh!

rbtr commented Sep 28, 2022

Uh oh!

rbtr Sep 28, 2022

Uh oh!

rsagasthya Sep 29, 2022

Uh oh!

nairashu Sep 29, 2022

Uh oh!

rbtr Sep 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		When a Pod is created, the CNI will call with a request to assign an IP. If CNS is out of IPs and cannot honor that request, the CNI will return an error to the CRI, which will follow up by tearing down that Pod sandbox and starting over. Because of this stateless retrying, CNS can only reliable understand that it needs _at least one more_ IP, because it is impossible to tell if subsequent requests are retries for the same Pod, or many different Pods. If _many_ Pods have been scheduled, CNS will still only request a single additional batch of IPs, and assign those IPs one at a time until it runs out, then request a single additional batch of IPs...

		A more predictive method of IP Pool scaling will be added to CNS: CNS will watch Pods for its Node, and will request/release IPs immediately based on the number of Pods scheduled. The Batching behavior will be unchanged, and CNS will continue to request IPs in Batches $B$ based on the local IP usage.

Proposal: Subnet Scarcity #1635

Proposal: Subnet Scarcity #1635

Uh oh!

Conversation

rbtr commented Sep 28, 2022

Uh oh!

rbtr Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

rsagasthya Sep 29, 2022

Choose a reason for hiding this comment

Uh oh!

nairashu Sep 29, 2022

Choose a reason for hiding this comment

Uh oh!

rbtr Sep 29, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants