iptables proxy reorg in preparation for minimizing iptables-restore #110266

danwinship · 2022-05-28T17:34:10Z

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

This just moves code around in pkg/proxy/iptables/proxier.go with no real change to the logic of what rules we output. But the end result is that the loop in syncProxyRules is broken up into 3 phases:

figure out what chains will be needed for this servicePort, and mark them in activeNATChains
write rules to KUBE-SERVICES / KUBE-EXTERNAL-SERVICES / KUBE-NODEPORTS jumping to the servicePort-specific chains
write the servicePort-specific chains (KUBE-SVC-*, KUBE-SVL-*, KUBE-EXT-*, KUBE-FW-*, KUBE-SEP-*)

A followup PR (#110268) will then change it so that we skip the third step if the servicePort in question hasn't changed since the last sync, so as to minimize the input to iptables-restore.

Which issue(s) this PR fixes:

none

Does this PR introduce a user-facing change?

NONE

/sig network
/priority important-longterm

k8s-ci-robot · 2022-05-28T17:34:17Z

@danwinship: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

danwinship · 2022-06-06T12:58:48Z

/retest-required

We figure out early on whether we're going to end up outputting no endpoints, so update the metrics then. (Also remove a redundant feature gate check; svcInfo already checks the ServiceInternalTrafficPolicy feature gate itself and so svcInfo.InternalPolicyLocal() will always return false if the gate is not enabled.)

Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the endpoint chains. Previously they were handled entirely at the top of the loop. Now we record which ones are in use at the top but don't create them and fill them in until the bottom.

Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the SVC and SVL chains. We were already filling them in at the end of the loop; this fixes it to create them at the bottom of the loop as well.

Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the handling of the EXT chain.

Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This fixes the jump rules for internal traffic. Previously we were handling "jumping from kubeServices to internalTrafficChain" and "adding masquerade rules to internalTrafficChain" in the same place.

Part of reorganizing the syncProxyRules loop to do: 1. figure out what chains are needed, mark them in activeNATChains 2. write servicePort jump rules to KUBE-SERVICES/KUBE-NODEPORTS 3. write servicePort-specific chains (SVC, SVL, EXT, FW, SEP) This moves the FW chain creation to the end (rather than having it in the middle of adding the jump rules for the LB IPs).

danwinship · 2022-07-19T10:41:47Z

/retest-required

thockin

args used to be an optimization - I wonder if we have accidentally pessimized by breaking apart blocks that used to share one args slice?

Overall I see what this does and why it makes the next step possible. I find it a little harder to follow, personally, but I might just be too close to the current code. There are so many variables that are set in one place and then used hundreds of lines away.

We should look for ways to make this easier to read, still.

Thanks for good, self-contained commits. Exemplary.

/lgtm
/approve

k8s-ci-robot · 2022-07-27T00:06:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: danwinship, thockin

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/proxy/OWNERS~~ [danwinship,thockin]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-triage-robot · 2022-07-27T03:38:16Z

The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass.

This bot retests PRs for certain kubernetes repos according to the following rules:

The PR does have any do-not-merge/* labels
The PR does not have the needs-ok-to-test label
The PR is mergeable (does not have a needs-rebase label)
The PR is approved (has cncf-cla: yes, lgtm, approved labels)
The PR is failing tests required for merge

You can:

Review the full test history for this PR
Prevent this bot from retesting with /lgtm cancel or /hold
Help make our tests less flaky by following our Flaky Tests Guide

/retest

k8s-triage-robot · 2022-07-27T06:05:17Z

The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass.

This bot retests PRs for certain kubernetes repos according to the following rules:

The PR does have any do-not-merge/* labels
The PR does not have the needs-ok-to-test label
The PR is mergeable (does not have a needs-rebase label)
The PR is approved (has cncf-cla: yes, lgtm, approved labels)
The PR is failing tests required for merge

You can:

Review the full test history for this PR
Prevent this bot from retesting with /lgtm cancel or /hold
Help make our tests less flaky by following our Flaky Tests Guide

/retest

k8s-triage-robot · 2022-07-27T08:32:17Z

The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass.

This bot retests PRs for certain kubernetes repos according to the following rules:

The PR does have any do-not-merge/* labels
The PR does not have the needs-ok-to-test label
The PR is mergeable (does not have a needs-rebase label)
The PR is approved (has cncf-cla: yes, lgtm, approved labels)
The PR is failing tests required for merge

You can:

Review the full test history for this PR
Prevent this bot from retesting with /lgtm cancel or /hold
Help make our tests less flaky by following our Flaky Tests Guide

/retest

k8s-triage-robot · 2022-07-27T10:54:17Z

The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass.

This bot retests PRs for certain kubernetes repos according to the following rules:

The PR does have any do-not-merge/* labels
The PR does not have the needs-ok-to-test label
The PR is mergeable (does not have a needs-rebase label)
The PR is approved (has cncf-cla: yes, lgtm, approved labels)
The PR is failing tests required for merge

You can:

Review the full test history for this PR
Prevent this bot from retesting with /lgtm cancel or /hold
Help make our tests less flaky by following our Flaky Tests Guide

/retest

danwinship · 2022-07-27T17:29:02Z

args used to be an optimization - I wonder if we have accidentally pessimized by breaking apart blocks that used to share one args slice?

yeah, this is why i filed #109481

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label May 28, 2022

k8s-ci-robot requested review from dcbw and freehan May 28, 2022 17:34

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 28, 2022

danwinship mentioned this pull request May 28, 2022

minimize iptables-restore input #110268

Merged

danwinship force-pushed the minimize-prep-reorg branch from 3e0cfe6 to cfecfb4 Compare May 30, 2022 12:53

danwinship force-pushed the minimize-prep-reorg branch from cfecfb4 to 15953b3 Compare June 10, 2022 22:40

danwinship force-pushed the minimize-prep-reorg branch 2 times, most recently from 895624c to 81c55fb Compare June 29, 2022 20:46

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 7, 2022

danwinship added 6 commits July 9, 2022 06:50

danwinship force-pushed the minimize-prep-reorg branch from 81c55fb to 367f18c Compare July 9, 2022 11:12

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 9, 2022

thockin reviewed Jul 27, 2022

View reviewed changes

k8s-ci-robot assigned thockin Jul 27, 2022

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 27, 2022

k8s-ci-robot merged commit ce433f8 into kubernetes:master Jul 27, 2022

k8s-ci-robot added this to the v1.25 milestone Jul 27, 2022

danwinship deleted the minimize-prep-reorg branch July 27, 2022 17:28

danwinship mentioned this pull request Oct 4, 2022

Minimizing iptables-restore input size kubernetes/enhancements#3454

Merged

danwinship mentioned this pull request Jul 7, 2023

fix sync_proxy_rules_iptables_total metric #119140

Merged

saiharshitachava mentioned this pull request Oct 9, 2023

kube-proxy constantly restarts with fatal: bad g in signal handler post 1.25 upgrade #121033

Closed

danwinship mentioned this pull request Nov 29, 2023

proxy chain creation cleanup #122111

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iptables proxy reorg in preparation for minimizing iptables-restore #110266

iptables proxy reorg in preparation for minimizing iptables-restore #110266

danwinship commented May 28, 2022 •

edited

k8s-ci-robot commented May 28, 2022

danwinship commented Jun 6, 2022

danwinship commented Jul 19, 2022

thockin left a comment •

edited

k8s-ci-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

danwinship commented Jul 27, 2022

iptables proxy reorg in preparation for minimizing iptables-restore #110266

iptables proxy reorg in preparation for minimizing iptables-restore #110266

Conversation

danwinship commented May 28, 2022 • edited

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Does this PR introduce a user-facing change?

k8s-ci-robot commented May 28, 2022

danwinship commented Jun 6, 2022

danwinship commented Jul 19, 2022

thockin left a comment • edited

Choose a reason for hiding this comment

k8s-ci-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

k8s-triage-robot commented Jul 27, 2022

danwinship commented Jul 27, 2022

danwinship commented May 28, 2022 •

edited

thockin left a comment •

edited