feat: accept protobuf responses where possible from kube-api (#18603) #18602

cnmcavoy · 2024-06-11T20:47:23Z

Checklist:

codecov · 2024-06-11T22:01:38Z

Codecov Report

Attention: Patch coverage is 17.33333% with 62 lines in your changes missing coverage. Please review.

Project coverage is 50.34%. Comparing base (1aa898c) to head (5a30eb7).
Report is 5 commits behind head on master.

Files	Patch %	Lines
cmd/argocd-server/commands/argocd_server.go	0.00%	37 Missing ⚠️
pkg/apis/application/v1alpha1/types.go	0.00%	6 Missing and 1 partial ⚠️
cmd/argocd-dex/commands/argocd_dex.go	0.00%	6 Missing ⚠️
...ntroller/commands/argocd_application_controller.go	0.00%	4 Missing ⚠️
cmd/argocd-notification/commands/controller.go	0.00%	4 Missing ⚠️
...t-controller/commands/applicationset_controller.go	66.66%	1 Missing and 1 partial ⚠️
controller/appcontroller.go	80.00%	1 Missing ⚠️
controller/cache/cache.go	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18602      +/-   ##
==========================================
- Coverage   50.36%   50.34%   -0.02%     
==========================================
  Files         315      315              
  Lines       43190    43219      +29     
==========================================
+ Hits        21752    21758       +6     
- Misses      18954    18975      +21     
- Partials     2484     2486       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jannfis · 2024-06-11T22:52:32Z

I wonder how big of an improvement this is, @cnmcavoy. Do you have any relevant numbers please?

chaochn47 · 2024-06-13T01:31:58Z

This is the result I got when experimenting on a 15k pods cluster before giving the advise to tweak the content type to proto.

Expect the diff would grow as least linearly when the number of pods grows.

(24-06-13 1:08:36) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=true
I0613 01:08:51.106629  140283 main.go:80] took 3.154472032s, pods resourceVersion 700889256, 15025 pods

(24-06-13 1:08:51) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=false
I0613 01:09:07.317080  140898 main.go:80] took 8.381112862s, pods resourceVersion 700895115, 15025 pods

(24-06-13 1:09:07) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=true
I0613 01:19:23.438321  149026 main.go:80] took 2.846576869s, pods resourceVersion 701210537, 15025 pods

(24-06-13 1:19:23) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=false
I0613 01:19:33.774061  149092 main.go:80] took 8.52089375s, pods resourceVersion 701214264, 15025 pods

(24-06-13 1:19:33) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=true
I0613 01:30:38.632484  157860 main.go:80] took 2.890275373s, pods resourceVersion 701554289, 15025 pods

(24-06-13 1:30:38) <0> [~/workplace/EKSKubernetes/src/EKSDataPlaneKubernetes/staging/src/k8s.io/client-go/examples/out-of-cluster-client-configuration]
dev-dsk-chaochn-2c-a26acd76 % ./app --use-proto=false
I0613 01:30:48.910746  157947 main.go:80] took 8.345562795s, pods resourceVersion 701557468, 15025 pods

crenshaw-dev · 2024-06-17T19:49:23Z

Tested against an internal instance, and I'm not seeing a significant difference in resource usage (memory, cpu, network).

I think part of the problem is that the PR only sets content types on some of the clients, mostly the ones that are used to get resources the component immediately needs from the local k8s API (i.e. things like secrets and configmaps).

The heavy lifting of the application controller is done with separately-initialized k8s clients in the cluster cache code. I tried to set that to use protobuf too here, but am so far not seeing significant wins: b98df97

crenshaw-dev · 2024-06-17T19:55:16Z

Here's unchanged vs. this PR vs. this PR + cluster cache change:

I'm surprised not to see any difference after the cluster cache change, but maybe there's just not enough activity on the instance for it to matter.

cnmcavoy · 2024-06-18T19:38:21Z

I think part of the problem is that the PR only sets content types on some of the clients, mostly the ones that are used to get resources the component immediately needs from the local k8s API (i.e. things like secrets and configmaps).

I updated the PR to ensure that the protobuf headers get set in the other places and also tried to wire up the user agents correctly. The cluster cache and several controllers were defaulting to the stock user agent, which made it hard to trace who was making the k8s api calls.

Signed-off-by: Cameron McAvoy <cmcavoy@indeed.com>

Signed-off-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>

…ults Signed-off-by: Cameron McAvoy <cmcavoy@indeed.com>

cnmcavoy requested a review from a team as a code owner June 11, 2024 20:47

cnmcavoy changed the title ~~Accept protobuf responses where possible from kube-api~~ feat: accept protobuf responses where possible from kube-api (#18603) Jun 11, 2024

cnmcavoy force-pushed the cmcavoy/kube-api-protobuf branch 2 times, most recently from b3a9a17 to 81b6846 Compare June 11, 2024 21:50

cnmcavoy requested a review from a team as a code owner June 11, 2024 22:40

cnmcavoy force-pushed the cmcavoy/kube-api-protobuf branch 2 times, most recently from f838bf9 to 9c5f234 Compare June 11, 2024 22:48

cnmcavoy force-pushed the cmcavoy/kube-api-protobuf branch 2 times, most recently from e0162be to 8eb4284 Compare June 18, 2024 18:22

cnmcavoy and others added 4 commits July 1, 2024 16:53

Accept protobuf responses where possible from kube-api

171ae63

Signed-off-by: Cameron McAvoy <cmcavoy@indeed.com>

Update docs

9e437ad

Signed-off-by: Cameron McAvoy <cmcavoy@indeed.com>

make configurable via cm, small wording tweaks

159d2db

Signed-off-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>

codegen

b1cbc61

Signed-off-by: Michael Crenshaw <350466+crenshaw-dev@users.noreply.github.com>

cnmcavoy force-pushed the cmcavoy/kube-api-protobuf branch from 8eb4284 to 4a3615b Compare July 1, 2024 21:57

Capture user agents with rest config, move protobuf setting into defa…

5a30eb7

…ults Signed-off-by: Cameron McAvoy <cmcavoy@indeed.com>

cnmcavoy force-pushed the cmcavoy/kube-api-protobuf branch from 4a3615b to 5a30eb7 Compare July 1, 2024 22:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: accept protobuf responses where possible from kube-api (#18603) #18602

feat: accept protobuf responses where possible from kube-api (#18603) #18602

cnmcavoy commented Jun 11, 2024 •

edited

Loading

codecov bot commented Jun 11, 2024 •

edited

Loading

jannfis commented Jun 11, 2024

chaochn47 commented Jun 13, 2024 •

edited

Loading

crenshaw-dev commented Jun 17, 2024

crenshaw-dev commented Jun 17, 2024

cnmcavoy commented Jun 18, 2024

feat: accept protobuf responses where possible from kube-api (#18603) #18602

Are you sure you want to change the base?

feat: accept protobuf responses where possible from kube-api (#18603) #18602

Conversation

cnmcavoy commented Jun 11, 2024 • edited Loading

codecov bot commented Jun 11, 2024 • edited Loading

Codecov Report

jannfis commented Jun 11, 2024

chaochn47 commented Jun 13, 2024 • edited Loading

crenshaw-dev commented Jun 17, 2024

crenshaw-dev commented Jun 17, 2024

cnmcavoy commented Jun 18, 2024

cnmcavoy commented Jun 11, 2024 •

edited

Loading

codecov bot commented Jun 11, 2024 •

edited

Loading

chaochn47 commented Jun 13, 2024 •

edited

Loading