Pr/v1.12 backports #23260

ldelossa · 2023-01-23T16:36:51Z

test/k8s: remove l7_demos test #20619 -- test/k8s: remove l7_demos test (@tklauser)
Adding/fixing DNSProxy metrics #21267 -- Adding/fixing DNSProxy metrics (@rahulkjoshi)
Update Cilium install guide about EKS aws-node DaemonSet potential connectivity problem on uninstall #22620 -- Update Cilium install guide about EKS aws-node DaemonSet potential connectivity problem on uninstall (@NikAleksandrov)
Add sphinxcontrib-googleanalytics to doc requirements #22821 -- Add sphinxcontrib-googleanalytics to doc requirements (@chalin)
bpf: nodeport: wire up trace aggregation for rev_nodeport_lb6() #22794 -- bpf: nodeport: wire up trace aggregation for rev_nodeport_lb6() (@julianwiedmann)
-> please double-check if this makes sense, rev_nodeport_lb6 changed quite a bit
Update Layer 7 Protocol Visibility Document. #22807 -- Update Layer 7 Protocol Visibility Document. (@obaranov1)
Improve fqdn events logging management #22745 -- Improve fqdn events logging management (@pippolo84)
bpf: nodeport: fix tracing for handle_nat_fwd() #22678 -- bpf: nodeport: fix tracing for handle_nat_fwd() (@julianwiedmann)
Fix missing node neigh metric for counting arping requests #22930 -- Fix missing node neigh metric for counting arping requests (@christarazi)
datapath: Fix L7 ingress with XDP #22985 -- datapath: Fix L7 ingress with XDP (@brb)
Fix crash of CES queue delay metric when CESTracker is nil #22884 -- Fix crash of CES queue delay metric when CESTracker is nil (@dlapcevic)
iptables: skip reverse IP lookup #22977 -- iptables: skip reverse IP lookup (@jibi)
bpf: Fix LB loopback path with ingress policy #22972 -- bpf: Fix LB loopback path with ingress policy (@aditighag)
bpf: test: fix xdp_lb4_forward_to_other_node test #23018 -- bpf: test: fix xdp_lb4_forward_to_other_node test (@julianwiedmann)
ctmap: fix-up host_local flag in the DSR NAT entry for GC test #23037 -- ctmap: fix-up host_local flag in the DSR NAT entry for GC test (@julianwiedmann)
clustermesh: Add missing brackets of etcd option #22962 -- clustermesh: Add missing brackets of etcd option (@YutaroHayakawa)
test: net_policies: delete custom IP routes after test completion #21857 -- test: net_policies: delete custom IP routes after test completion (@julianwiedmann)
.github: Pin docker buildx version to v0.9.1 (v2) #23220 -- .github: Pin docker buildx version to v0.9.1 (v2) (@joestringer)

Once this PR is merged, you can update the PR labels via:

$ for pr in 20619 21267 22620 22821 22794 22807 22745 22678 22930 22985 22884 22977 22972 23018 23037 22962 21857 23220; do contrib/backporting/set-labels.py $pr done 1.12; done

PRs dropped from this round:

CODEOWNERS: Cover test/bpf_tests by sig-datapath #22928 -- CODEOWNERS: Cover test/bpf_tests by sig-datapath (@christarazi)
-> dropped, maintenance branches have different CODEOWNERS
Endpoint/CEP Ownership Fix #21768 -- Endpoint/CEP Ownership Fix (@tommyp1ckles)
-> relies on missing node.GetCiliumEndpointNodeIP()
nodeport: fix CT accounting in revDNAT path #22810 -- nodeport: fix CT accounting in revDNAT path (@julianwiedmann)
-> generated quite a large diff I wasn't sure what to do with
Make cilium pprof listen address configurable #22768 -- Make cilium pprof listen address configurable (@chancez)
-> can probably be made to fit with some work from the author; do we really want/need to backport this?
Enable Google Analytics 4 #22220 -- Enable Google Analytics 4 (@chalin)
datapath: Fix L7 reply to outside when endpoint routes disabled #21980 -- datapath: Fix L7 reply to outside when endpoint routes disabled (@brb)

Manual fix added to skip check-missing-tags-in-tests.sh in v1.12 branch due to 7435adb.
Manual modified cherry-pick of #22980 to avoid eBPF unit test failures
Manual fix added to to avoid eBPF unit test failures: #23092 (comment)

[ upstream commit 1db1156 ] With cilium/cilium-cli#962 in place in cilium-cli v0.12.0 and the CI update to use that version in #20617, the connectivity tests cover all functionality tested by the tests in l7_demos.go. Moreover, the cilium-cli connectivity tests can be run against arbitrary clusters with Cilium deployed, while this test is specific for the test setup based on vagrant VMs. Thus, drop this test. Signed-off-by: Tobias Klauser <tobias@cilium.io> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit bc14b15 ] [ Backporter's notes: ProxyRequestContext did not have the DataSource field in 1.12 so had to be dropped. ] A few changes are made here: - `cilium_fqdn_semaphore_rejected_total` wasn't being updated correctly, due to an incorrect error check. This is fixed now. - Based on the discussion [here](403790a#r936034590), the field `scope:datapathTime` in `cilium_proxy_upstream_reply_seconds` was split into two different scopes: `policyGenerationTime` (for updating the DNS caches and policy caches) and `datapathTime` (which include the async policy map updates and the identity cache updates). Signed-off-by: Rahul Joshi <rkjoshi@google.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 7a6b4d9 ] Update the EKS helm install guide to patch the aws-node daemonset instead of deleting it prior to installation. That aligns with how the cilium cli install works and also the EKS tests. Also a minor style fix - remove the `` around aws-node because they don't get rendered properly in that section (i.e. they're written as-is). Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Nikolay Aleksandrov <nikolay@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 82b9a8e ] Add a note about aws-node flushing Linux routing tables which could cause connectivity issues if Cilium is uninstalled through the cli. Also suggest that deleting aws-node daemonset prior to Cilium installation is recommended. Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Nikolay Aleksandrov <nikolay@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 4bc5629 ] [ Backporter's notes: strange patch without any functional changes, and of course didn't apply cleanly. Not sure if this will work. ] Signed-off-by: Patrice Chalin <chalin@cncf.io> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit ae9dbd8 ] [ Backporter's notes: rev_nodeport_lb6 changed quite a bit, hope this change still makes sense. ] Pass the `monitor` feedback from the CT lookup to __encap_with_nodeid(). A previous commit already added this for rev_nodeport_lb4(), so aim for commonality here. Fixes: 428abc9 ("bpf: pipe forwarding reason into traces for TO_OVERLAY") Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit dc7f561 ] Fixes #22615. Signed-off-by: Oksana Baranova <oksana.baranova@intel.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 1aa45b3 ] Agent monitor supports both consumers and listeners. Unlike consumers, a listener expects an event to be first JSON encoded, then binary encoded using Gob. Since the whole encoding operation may be expensive, we avoid that if there are no active listeners at the moment. Signed-off-by: Fabio Falzoi <fabio.falzoi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit a9abe31 ] Currently, the Log() method is holding the LogRecord lock for the entire record operation. The record operation can be broken up in the following parts: - add metadata to the record - check if the notifier is set - call NewProxyLogRecord on the notifier if not nil The lock is needed both to copy metadata and to get the currently set notifier. Instead, the call to NewProxyLogRecord can be executed without holding the lock if the concrete type that satisfies the LogRecordNotifier interface supports concurrent notifications. Currently there is only one concrete implementation of that interface: the agent monitor. The agent monitor supports two kinds of clients: listeners and consumers. Since listeners expect the notifications to be first JSON marshaled and then Gob encoded, the log record operation is not negligible both in terms of time taken and computational resources. So, in case of: - a very high rate of fqdn related events - low computational resources available for the pod the queue of goroutines waiting on that lock may grow significantly. In that scenario, the total memory used because of all the enqueued goroutines may grow unacceptably. Since the agent monitor already supports concurrent events handling, the commit shortens the critical section, allowing for concurrent execution of the actual log entries recording. Doing so, the number of goroutines waiting for the LogRecord lock is lowered in high load/low resources scenarios. Signed-off-by: Fabio Falzoi <fabio.falzoi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 5c4776b ] The benchmarks evaluate the impact of the previous two commits in improving the events log notification performance. Specifically, the benchmarks mimics what is happening in case of a high rate of DNS related notifications and consequent logging of those events, as done in the `notifyOnDNSMsg` callback. To better understand the scalability improvements, the benchmarks run with various numbers of concurrent notifications: from a single one up to 1000. The first benchmark, called LogNotifierWithNoListeners, shows the improvement that comes from commit `monitor: Do not marshal events without active listeners`, that avoids encoding notifications altogether if no listeners are currently active. As expected, the improvement is remarkable, since no (useless) encoding is taking place at all. The second one, called LogNotifierWithListeners, shows the improvement that comes from commit `logger: Avoid holding the lock for the entire log operation`, where we release the lock earlier to enable concurrent log notifications encoding. As expected, the performance improvement is more evident as the load increases, thanks to the higher level of concurrency. Below the results of running these benchmarks without the two commits (shown as `old`) and with them (shown as `new`), compared with benchstat. $ benchstat old.txt new.txt name old time/op new time/op delta LogNotifierWithNoListeners/OneRecord-8 27.8µs ± 1% 2.6µs ± 3% -90.49% (p=0.008 n=5+5) LogNotifierWithNoListeners/TenRecords-8 284µs ± 1% 34µs ±12% -88.03% (p=0.008 n=5+5) LogNotifierWithNoListeners/HundredRecords-8 2.80ms ± 1% 0.21ms ± 4% -92.65% (p=0.008 n=5+5) LogNotifierWithNoListeners/ThousandRecords-8 28.2ms ± 1% 1.8ms ± 1% -93.63% (p=0.016 n=5+4) LogNotifierWithListeners/OneRecord-8 33.5µs ± 6% 48.6µs ± 2% +45.13% (p=0.008 n=5+5) LogNotifierWithListeners/TenRecords-8 352µs ± 1% 330µs ± 1% -6.30% (p=0.016 n=5+4) LogNotifierWithListeners/HundredRecords-8 3.47ms ± 2% 2.92ms ± 3% -16.06% (p=0.008 n=5+5) LogNotifierWithListeners/ThousandRecords-8 35.3ms ± 1% 31.5ms ± 2% -10.83% (p=0.008 n=5+5) name old alloc/op new alloc/op delta LogNotifierWithNoListeners/OneRecord-8 10.2kB ± 0% 0.9kB ± 0% ~ (p=0.079 n=4+5) LogNotifierWithNoListeners/TenRecords-8 102kB ± 0% 9kB ± 0% -91.19% (p=0.008 n=5+5) LogNotifierWithNoListeners/HundredRecords-8 1.02MB ± 0% 0.09MB ± 0% -91.20% (p=0.008 n=5+5) LogNotifierWithNoListeners/ThousandRecords-8 10.2MB ± 0% 0.9MB ± 0% -91.25% (p=0.008 n=5+5) LogNotifierWithListeners/OneRecord-8 14.4kB ± 0% 14.4kB ± 0% +0.05% (p=0.024 n=5+5) LogNotifierWithListeners/TenRecords-8 143kB ± 0% 123kB ± 0% -13.92% (p=0.008 n=5+5) LogNotifierWithListeners/HundredRecords-8 1.43MB ± 0% 1.17MB ± 1% -18.51% (p=0.008 n=5+5) LogNotifierWithListeners/ThousandRecords-8 14.3MB ± 0% 11.1MB ± 1% -22.37% (p=0.008 n=5+5) name old allocs/op new allocs/op delta LogNotifierWithNoListeners/OneRecord-8 169 ± 0% 9 ± 0% -94.67% (p=0.008 n=5+5) LogNotifierWithNoListeners/TenRecords-8 1.69k ± 0% 0.08k ± 0% -95.20% (p=0.008 n=5+5) LogNotifierWithNoListeners/HundredRecords-8 16.9k ± 0% 0.8k ± 0% -95.26% (p=0.008 n=5+5) LogNotifierWithNoListeners/ThousandRecords-8 169k ± 0% 8k ± 0% -95.28% (p=0.008 n=5+5) LogNotifierWithListeners/OneRecord-8 196 ± 0% 196 ± 0% ~ (all equal) LogNotifierWithListeners/TenRecords-8 1.95k ± 0% 1.83k ± 0% -6.53% (p=0.016 n=4+5) LogNotifierWithListeners/HundredRecords-8 19.5k ± 0% 17.8k ± 0% -8.68% (p=0.008 n=5+5) LogNotifierWithListeners/ThousandRecords-8 195k ± 0% 175k ± 0% -10.57% (p=0.008 n=5+5) Signed-off-by: Fabio Falzoi <fabio.falzoi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 9f6fc7e ] We currently attempt to suppress tracing when handle_nat_fwd() returns, assuming that the inner function (ie. handle_nat_fwd_ipv*()) was inlined and already raised a trace message. But handle_nat_fwd() can return CTX_ACT_OK without even going down the IPv4 / IPv6 path, in which case we end up missing the TRACE_TO_NETWORK. So just clean up things for good by wiring up an alternative function for the inline config. Don't write trace messages from that function. Note that the tracing in to-overlay still looks completely confused, but that's a separate issue. Fixes: 322510d ("bpf: Add missing packet tracing for handle_nat_fwd") Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 6820a56 ] The metric was never enabled due to the missing subsystem from the switch case. Fixes: 42eb1ed ("node-neigh: add metric to count arping requests") Signed-off-by: Chris Tarazi <chris@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit baf7f34 ] Fredrik Björkman has reported that the BPF L7 Ingress (and thus Gateway API) does not work when the Cilium's XDP is enabled. A quick glimps into the nodeport_lb4/6 revealed that in the case of an L7 service the ctx_redirect_to_proxy_hairpin_ipv4/6() is called. The latter invokes the ctx_redirect() to the cilium_host. Unfortunately, the redirect does not work when called from the bpf_xdp, as the cilium_host doesn't have any XDP prog attached. Fix this by offloading the L7 handling to the bpf_host at the TC BPF ingress. Reported-by: Fredrik Björkman <fredrik.bjorkman1@ingka.ikea.com> Signed-off-by: Martynas Pumputis <m@lambda.lt> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 91d52e4 ] Avoid operator crashing from trying to access a field of a nil CESTracker. This can happen rarely with two consecutive CEP deletions that belong to the same CES. While the first CES update is being processed, the second CEP deletion causes the CES in cache to become empty, and CES update gets enqueued anyway, before the first CES update is finished. The first CES update then deletes CES tracker as the CES has 0 CEPs in it. Finally, the second CES update runs into the error. Signed-off-by: Dorde Lapcevic <dordel@google.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit a0a04db ] When listing a table or chain with '-L', iptables will by default try to perform a reverse IP lookup for each IP referenced in the ruleset that is being listed. This adds a useless dependency on DNS, which can lead to an increased initialization time for the agent in case the DNS server is slow to respond. As the reverse lookup is not needed, switch to the '-S' command, which does not perform any reverse lookup. Signed-off-by: Gilberto Bertin <jibi@cilium.io> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 2c9c8c1 ] When a service backend endpoint connects to itself via its service cluster IP address, the traffic is hairpin'd using a loopback IP address. We skip policy enforcement on egress, as a pod is allowed to connect to itself, and users don't have to specify additional policies to allow the hairpin'd traffic. We have a similar expectation on the ingress side. The ingress side was broken because of a regression due to which replies were dropped on the ingress policy enforcement path. The patch that introduced the regression split per packet load-balancing logic into a separate tail call, where (limited) LB state is stored in packet context, and restored later while handling rest of the packet processing including conntrack, and policy enforcement. When a service pod connects to itself via its clusterIP, post the forward service translation, conntrack state update is done on the conntrack entry with loopback address (the restored state). As a result, when a reply is reverse SNAT'd and DNAT'd, the policy verdict for the `CT_NEW` entry is denied, and the reply packet is dropped. Prior to the regression, the original conntrack entry would have the updated state, and loopback flag set. Fix: When hairpin'd traffic is reverse translated, and sent for a local delivery in case of bpf_lxc, we should directly redirect it to the endpoint, thereby skipping the tail call with ingress policy enforcement altogether. Here is the packet flow - Request path: podA -> clusterIP --> svc xlate (dAddr=podA) --> SNAT using loopback address (saddr=loopback IP) --> conntrack entry created (loopback IP, backend IP) with loopback flag state Reply path: cluster IP -> podA --> svc rev-xlate (saddr=podA address) --> SNAT using loopback (daddr=loopback IP) --> CT_REPLY match --> local_delivery Fixes: 7575ba0 ("datapath: Reduce from LXC complexity") Signed-off-by: Aditi Ghag <aditi@cilium.io> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit a064973 ] The test is meant to check that a TCPv4 packet addressed to FRONTEND_IP:FRONTEND_PORT gets DNATed to BACKEND_IP:BACKEND_PORT and TXed back out. But right now we actually assert that the destination port *doesn't* get DNATed, and the test still works! The reason is that the .frag_off field in the IPv4 header isn't built correctly (it has the wrong endianness), so the LB code in lb4_xlate() believes that the packet doesn't have a L4 header and consequently also doesn't apply L4 DNAT on the packet. Fix both aspects. df1bc96 ("bpf: ported existing unit tests to new testing framework") Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 5c522b2 ] DSR NAT entries don't have the .host_local flag set, see eg. snat_v4_create_dsr(). This was presumably just copy&pasted from the other NAT entries in the file. It currently doesn't make a difference for the test, but let's still fix it. Fixes: 7904650 ("ctmap: add support for GC of DSR orphaned entries") Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit b006a84 ] When clustermesh-apiserver Pod is deployed on the IPv6 single-stack cluster, etcd fails to startup with the error like this. ``` invalid value "https://127.0.0.1:2379,https://a:1:0:3::fc75:2379" for flag -listen-client-urls: URL address does not have the form "host:port": https://a:1:0:3::fc75:2379 ``` This happens because we don't put brackets around the IPv6 address. Fix Helm template to correctly handle that. Fixes: #22952 Signed-off-by: Yutaro Hayakawa <yutaro.hayakawa@isovalent.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

This requirement was removed in 7435adb, but new test files are now trickling down into 1.12 through backports. Skip checking tags here as well. Signed-off-by: Timo Beckers <timo@isovalent.com>

[ upstream commit 6554eff ] Don't leave dangling IP routes behind, they can impact subsequent tests. Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: André Martins <andre@cilium.io>

[ upstream commit 87a916c ] This most likely just papers over bugs in other tests that fail to tear down their custom routes. Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: André Martins <andre@cilium.io>

The BPF unit/integration tests had some trouble running on older kernels (<=5.8). That is because the test runner always attempted to load all programs in the ELF. This fix makes it so we only attempt to load XDP and TC programs, both of which have `BPF_PROG_RUN` support. This commit also removes the rlimit memory lock which is required on pre-5.11 kernels. Fixes: #22779 Fixes: #22780 Signed-off-by: Dylan Reimerink <dylan.reimerink@isovalent.com>

this commits bumps cilium/ebpf to v0.10.0 as this version accommodates a change to the minimum ebpf program input size newer kernels will accept. Signed-off-by: Louis DeLosSantos <louis.delos@isovalent.com>

GitHub recently rolled out Docker buildx version v0.10.0 on their builders, which transparently changed the MediaType of docker images to OCI v1 and added provenance attestations. Unfortunately, various tools we use in CI like SBOM tooling and docker manifest inspect do not properly support some aspect of the new image formats. This resulted in breaking CI, with some messages like this: level=fatal msg="generating doc: creating SPDX document: generating SPDX package from image ref quay.io/cilium/docker-plugin-ci:XXX: generating image package" This could also lead CI to fail while waiting for image builds to complete, because the command we use to test whether the image is available did not support the image types. This commit attempts to revert buildx back to v0.9.1 to prevent it from generating the images in a format that other tooling doesn't expect. Over time we can work on migrating to buildx v0.10, testing various parts of our CI as we do so. This is a quick-and-dirty hack to stabilize CI for the short term, then we can figure out over time how to properly resolve the conflict between these systems. Signed-off-by: Joe Stringer <joe@cilium.io>

ldelossa · 2023-01-23T16:40:06Z

re-opened per: #23190 (comment)

chancez

My changed are dropped, and in a separate PR so LGTM.

ldelossa · 2023-01-23T17:56:25Z

Cilium L4LB XDP tests have a legit failure:

 +[21:46:03] docker exec -t lb-node docker exec -t cilium-lb cilium service update --id 1 --frontend 10.0.0.2:80 --backends 172.18.0.3:80 --backend-weights 0 --k8s-node-port
Error: unknown flag: --backend-weights

Usage:

  cilium service update [flags]



Flags:

      --backends strings            Backend address or addresses (<IP:Port>)

      --frontend string             Frontend address

  -h, --help                        help for update

      --id uint                     Identifier

      --k8s-cluster-internal        Set service as cluster-internal for externalTrafficPolicy=Local

      --k8s-external                Set service as a k8s ExternalIPs

      --k8s-host-port               Set service as a k8s HostPort

      --k8s-load-balancer           Set service as a k8s LoadBalancer

      --k8s-node-port               Set service as a k8s NodePort

      --k8s-traffic-policy string   Set service with k8s externalTrafficPolicy as {Local,Cluster} (default "Cluster")

      --local-redirect              Set service as Local Redirect

      --states strings              Backend state(s) as {active(default),terminating,quarantined,maintenance}



Global Flags:

      --config string   config file (default is $HOME/.cilium.yaml)

  -D, --debug           Enable debug messages

  -H, --host string     URI to server-side API



unknown flag: --backend-weights

ldelossa · 2023-01-23T18:00:04Z

/test-backport-1.12

Job 'Cilium-PR-K8s-1.17-kernel-4.9' failed:

Click to show.

Test Name

K8sHubbleTest Hubble Observe Test L3/L4 Flow

Failure Output

FAIL: Found 1 k8s-app=cilium logs matching list of errors that must be investigated:

If it is a flake and a GitHub issue doesn't already exist to track it, comment /mlh new-flake Cilium-PR-K8s-1.17-kernel-4.9 so I can create one.

michi-covalent · 2023-01-23T18:00:16Z

Cilium L4LB XDP tests have a legit failure:

i guess we need to backport #18306 ?

michi-covalent · 2023-01-23T18:16:46Z

i guess we need to backport #18306 ?

oh wait the failure is from ci-l4lb and ci-l4lb-1.13. not sure why these workflows are getting triggered, but ci-l4lb-1.12 is passing so i think we are ok here ✅

ldelossa · 2023-01-23T18:23:59Z

Okay yeah, a whole slew of v1.13 tests tried to run, and have failed, so we'll ignore those.

michi-covalent · 2023-01-23T21:51:17Z

test-1.17-4.9 failed because of this error message:

pod-kube-system-cilium-5sh9s-cilium-agent.log:2023-01-23T19:29:44.656595403Z level=error msg="Could not delete stale CEP" ciliumEndpointName=echo-a-6954f488f5-m92mr error="ciliumendpoints.cilium.io \"echo-a-6954f488f5-m92mr\" not found" k8sNamespace=default subsys=daemon

michi-covalent · 2023-01-23T21:51:55Z

/mlh new-flake Cilium-PR-K8s-1.17-kernel-4.9

👍 created #23264

michi-covalent · 2023-01-23T21:52:11Z

/test-1.17-4.9

ldelossa · 2023-01-24T01:13:00Z

All tests have passed. This PR has a commit ontop to allow us to use the GHA workflow during this pr. I'm going to remove this commit now, and then immediately merge this PR, since we know tests are OK.

joestringer · 2023-01-24T01:20:47Z

@ldelossa awesome thanks! Please be sure to follow up on running the command under Once this PR is merged, you can update the PR labels via:, we haven't automated that step for backports yet.

joestringer · 2023-01-24T01:44:05Z

One more hint: For each PR that was not backported, I would suggest adding the backport/author label and dropping a note for them that the PR will not be backported by tophat, so they will need to propose the backport to ensure it lands in an older release.

tklauser and others added 25 commits January 18, 2023 01:50

Update Layer 7 Protocol Visibility Document.

59f6f9a

[ upstream commit dc7f561 ] Fixes #22615. Signed-off-by: Oksana Baranova <oksana.baranova@intel.com> Signed-off-by: Timo Beckers <timo@isovalent.com>

Makefile: skip check-missing-tags-in-tests.sh

0367f47

This requirement was removed in 7435adb, but new test files are now trickling down into 1.12 through backports. Skip checking tags here as well. Signed-off-by: Timo Beckers <timo@isovalent.com>

test: net_policies: delete custom IP routes

ff030d9

[ upstream commit 6554eff ] Don't leave dangling IP routes behind, they can impact subsequent tests. Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: André Martins <andre@cilium.io>

test: net_policies: Add custom IP routes without 'replace'

51a0e7c

[ upstream commit 87a916c ] This most likely just papers over bugs in other tests that fail to tear down their custom routes. Signed-off-by: Julian Wiedmann <jwi@isovalent.com> Signed-off-by: André Martins <andre@cilium.io>

ci: bump cilium/ebpf for ebpf unit tests

3e257b7

this commits bumps cilium/ebpf to v0.10.0 as this version accommodates a change to the minimum ebpf program input size newer kernels will accept. Signed-off-by: Louis DeLosSantos <louis.delos@isovalent.com>

ldelossa requested review from a team as code owners January 23, 2023 16:36

ldelossa requested a review from nbusseneau January 23, 2023 16:36

maintainer-s-little-helper bot added the backport/1.12 This PR represents a backport for Cilium 1.12.x of a PR that was merged to main. label Jan 23, 2023

maintainer-s-little-helper bot added the kind/backports This PR provides functionality previously merged into master. label Jan 23, 2023

github-actions bot added the kind/community-contribution This was a contribution made by a community member. label Jan 23, 2023

chancez approved these changes Jan 23, 2023

View reviewed changes

nbusseneau approved these changes Jan 23, 2023

View reviewed changes

michi-covalent approved these changes Jan 23, 2023

View reviewed changes

michi-covalent mentioned this pull request Jan 23, 2023

CI: Multicluster / Cluster mesh (ci-multicluster-1.11, ci-multicluster-1.12, ci-multicluster-1.13): allow-all-except-world connectivity test #21655

Closed

This comment was marked as resolved.

Sign in to view

christarazi approved these changes Jan 23, 2023

View reviewed changes

jibi approved these changes Jan 23, 2023

View reviewed changes

ldelossa force-pushed the pr/v1.12-backport-test branch from eb9f6ce to 2dd8d82 Compare January 24, 2023 01:12

ldelossa merged commit bb38189 into v1.12 Jan 24, 2023

ldelossa deleted the pr/v1.12-backport-test branch January 24, 2023 01:13

michi-covalent mentioned this pull request Jan 26, 2023

Prepare for release v1.12.6 #23372

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pr/v1.12 backports #23260

Pr/v1.12 backports #23260

ldelossa commented Jan 23, 2023 •

edited by joestringer

ldelossa commented Jan 23, 2023

chancez left a comment

ldelossa commented Jan 23, 2023

ldelossa commented Jan 23, 2023 •

edited by maintainer-s-little-helper bot

Test Name

Failure Output

michi-covalent commented Jan 23, 2023

michi-covalent commented Jan 23, 2023

ldelossa commented Jan 23, 2023

This comment was marked as resolved.

michi-covalent commented Jan 23, 2023

michi-covalent commented Jan 23, 2023 •

edited by maintainer-s-little-helper bot

michi-covalent commented Jan 23, 2023

ldelossa commented Jan 24, 2023

joestringer commented Jan 24, 2023

joestringer commented Jan 24, 2023

Pr/v1.12 backports #23260

Pr/v1.12 backports #23260

Conversation

ldelossa commented Jan 23, 2023 • edited by joestringer

ldelossa commented Jan 23, 2023

chancez left a comment

Choose a reason for hiding this comment

ldelossa commented Jan 23, 2023

ldelossa commented Jan 23, 2023 • edited by maintainer-s-little-helper bot

Test Name

Failure Output

michi-covalent commented Jan 23, 2023

michi-covalent commented Jan 23, 2023

ldelossa commented Jan 23, 2023

This comment was marked as resolved.

michi-covalent commented Jan 23, 2023

michi-covalent commented Jan 23, 2023 • edited by maintainer-s-little-helper bot

michi-covalent commented Jan 23, 2023

ldelossa commented Jan 24, 2023

joestringer commented Jan 24, 2023

joestringer commented Jan 24, 2023

ldelossa commented Jan 23, 2023 •

edited by joestringer

ldelossa commented Jan 23, 2023 •

edited by maintainer-s-little-helper bot

michi-covalent commented Jan 23, 2023 •

edited by maintainer-s-little-helper bot