fix(bpf): added function supported by arm64 #5

lukidzi · 2022-10-20T10:42:24Z

Looks like fexit and fenter are not supported on arm64

Running:  /kuma/ebpf/mb_netns_cleanup --bpffs /sys/fs/bpf
libbpf: prog 'net_ns_net_exit': failed to attach: ERROR: strerror_r(-524)=22
libbpf: prog 'net_ns_net_exit': failed to auto-attach: -524
attaching mb_netns_cleanup program failed with error: -524

I've also tried to run an example application with fenter/fexit and it didn't work. I've created a macro that depends on the architecture use for arm64 kretprobe and for others fexit.

bpftrace/bpftrace#1833

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

bpf/mb_netns_cleanup.c

lahabana · 2022-10-20T11:22:52Z

bpf/mb_netns_cleanup.bpf.c

@@ -36,7 +36,12 @@ struct {
    __uint(pinning, LIBBPF_PIN_BY_NAME);
 } local_pod_ips SEC(".maps");

+// arm64 doesn't support fexit/fenter


Can you maybe add a link to explain why kretprobe is the right substitute?

lukidzi · 2022-10-20T15:15:58Z

bpf/mb_netns_cleanup.bpf.c

 SEC("fexit/net_ns_net_exit")
+#endif
 int BPF_PROG(net_ns_net_exit, struct net *net, long ret)


Probably for kprobe we need to change the type and parameters to BPF_KPROBE(net_ns_net_exit, struct net *net)

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

instead of cleaning by ourself we can use LRU_HASH map that after reaching max size the oldest element is removed Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana · 2022-10-28T12:20:24Z

bpf/mb_connect.bpf.c

-    __uint(type, BPF_MAP_TYPE_HASH);
-    __uint(max_entries, 1024);
+    __uint(type, BPF_MAP_TYPE_LRU_HASH);
+    __uint(max_entries, 65535);


Might be worth saying where this number is coming from

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana · 2022-10-28T15:30:07Z

bpf/mb_connect.bpf.c

+// run 65535 unique pods, and if new one appears
+// the oldest not accessed entry is going to be
+// removed from the map by the kernel. This ensure
+// that configuration of living pod is removed.


Random question could there be a problem with a pod receiving 0 traffic?

Shouldn't be, if pod doesn't receive traffic the only situation in which it might be removed is when we deploy > 65535 pods in the cluster

Yes but we're not talking about simultaneous pods right? So for example in a cluster that has batch jobs we create new pods all the time (that are short lived). Could this potentially be a problem?

This doesn't have to be fixed now as it's an edge case but might be worth at least understanding if it's possible

Yes, that might be the edge case. 65k is the number of pods per node - not cluster. But true, if we have a lot of short-living pods their configuration is placed in the map and if we have a pod without any traffic(even healthcheck) it might cause this.

lukidzi added 2 commits October 20, 2022 12:29

fix(bpf): added function supported by arm64

3ef11ab

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(bpf): changed cleanup logic for netns if file already exists

3172f0b

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana reviewed Oct 20, 2022

View reviewed changes

lukidzi commented Oct 20, 2022

View reviewed changes

lukidzi added 17 commits October 24, 2022 12:53

fix(ebpf): changed net_ns_net_exit to proc_free_inum

9162dc9

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): changed method name

2835101

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): changed to direct

cb28351

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): change to unsigned int

88cc186

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): test kprobe

012fe23

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): test kprobe

c9abc5b

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test

f7ccfc7

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test2

3fb0a92

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test3

66f7d91

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test

0564078

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test

d4ee485

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test

4f601fc

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): remove logs

8176da3

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): fixed variable

b821260

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

test

d00eeb4

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): working on arm and amd

4dc541b

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

fix(ebpf): added information about using kprobe

c165846

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana approved these changes Oct 27, 2022

View reviewed changes

fix(ebpf): removed the oldest accessed item from the map

37de16a

instead of cleaning by ourself we can use LRU_HASH map that after reaching max size the oldest element is removed Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana reviewed Oct 28, 2022

View reviewed changes

fix(ebpf): added description why we use specific size of the map

c6a8104

Signed-off-by: Łukasz Dziedziak <lukidzi@gmail.com>

lahabana reviewed Oct 28, 2022

View reviewed changes

lukidzi mentioned this pull request Oct 31, 2022

fix(ebpf): removed net cleanup script which is not used anymore kumahq/kuma-net#111

Merged

lukidzi merged commit a137204 into kumahq:main Oct 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(bpf): added function supported by arm64 #5

fix(bpf): added function supported by arm64 #5

lukidzi commented Oct 20, 2022 •

edited

Loading

lahabana Oct 20, 2022

lukidzi Oct 20, 2022

lahabana Oct 28, 2022

lahabana Oct 28, 2022

lukidzi Oct 31, 2022

lahabana Oct 31, 2022

lukidzi Oct 31, 2022

fix(bpf): added function supported by arm64 #5

fix(bpf): added function supported by arm64 #5

Conversation

lukidzi commented Oct 20, 2022 • edited Loading

lahabana Oct 20, 2022

Choose a reason for hiding this comment

lukidzi Oct 20, 2022

Choose a reason for hiding this comment

lahabana Oct 28, 2022

Choose a reason for hiding this comment

lahabana Oct 28, 2022

Choose a reason for hiding this comment

lukidzi Oct 31, 2022

Choose a reason for hiding this comment

lahabana Oct 31, 2022

Choose a reason for hiding this comment

lukidzi Oct 31, 2022

Choose a reason for hiding this comment

lukidzi commented Oct 20, 2022 •

edited

Loading