[FEATURE] Enable resource profiling for IM #6377

innobead · 2023-07-24T03:31:20Z

          > How many bad backups is it?

168

Would it be interesting to take a look at profiling data to determine which part of the controller is eating CPU? https://longhorn.io/kb/troubleshooting-generate-pprof-runtime-profiling-data/

This will definitely be useful to me going forward. Thanks for the call out. However, it looks like most of the CPU is eaten up by instance-manager, which we don't appear to have enabled pprof for.

Originally posted by @ejweber in #6358 (comment)

The text was updated successfully, but these errors were encountered:

innobead · 2023-07-24T03:32:02Z

Even though IM is system managed pod by Longhorn, but it still deserves profiling at runtime.

shuo-wu · 2023-07-25T15:10:15Z

One quick question, does this mean that we need a HTTP server inside IM pod for pprof?

innobead · 2023-09-14T07:05:52Z

It seems @Vicente-Cheng is working on this.

Vicente-Cheng · 2023-09-15T08:04:51Z

Maybe we could make it related to #6282
These would be similar.

innobead · 2023-11-13T02:32:48Z

@derekbit is working on it.

longhorn-io-github-bot · 2023-11-13T02:58:06Z

Pre Ready-For-Testing Checklist

Where is the reproduce steps/test steps documented?
The reproduce steps/test steps are at:
Does the PR include the explanation for the fix or the feature?
Have the backend code been merged (Manager, Engine, Instance Manager, BackupStore etc) (including backport-needed/*)?
The PR is at

longhorn/longhorn-instance-manager#307

Which areas/issues this PR might have potential impacts on?
Area: profiling
Issues

roger-ryao · 2023-12-29T03:47:52Z

Verified on master-head 20231229

longhorn master-head 07bdb41
longhorn-instance-manager master-head longhorn/longhorn-instance-manager@34c40f0

The test steps

Forward the port 6060 from the instance-manager pod to local port 6060:

kubectl port-forward ${instance-manager-pod-name} -n longhorn-system 6060:6060

see the pprof debug web page: http://localhost:6060/debug/pprof/

Result Passed

pprof service works as expected.

innobead assigned ejweber Jul 24, 2023

innobead added this to the v1.6.0 milestone Jul 24, 2023

innobead added component/longhorn-instance-manager Longhorn instance manager (interface between control and data plane) priority/2 Nice to fix in this release (managed by PO) area/troubleshoot Troubleshoot related labels Jul 24, 2023

ejweber mentioned this issue Aug 4, 2023

[BUG][1.5.0] Longhorn Manager High Memory Consumption #6315

Open

innobead added priority/1 Highly recommended to fix in this release (managed by PO) and removed priority/2 Nice to fix in this release (managed by PO) labels Sep 14, 2023

derekbit mentioned this issue Nov 9, 2023

Add pprof server in instance-manager pod longhorn/longhorn-instance-manager#307

Merged

innobead assigned derekbit and unassigned ejweber Nov 13, 2023

innobead modified the milestones: v1.6.0, v1.7.0 Nov 29, 2023

innobead modified the milestones: v1.7.0, v1.6.0 Dec 27, 2023

innobead assigned roger-ryao Dec 27, 2023

roger-ryao closed this as completed Dec 29, 2023

shuo-wu mentioned this issue Mar 14, 2024

[BUG] Replica rebuild failed #8091

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Enable resource profiling for IM #6377

[FEATURE] Enable resource profiling for IM #6377

innobead commented Jul 24, 2023

innobead commented Jul 24, 2023

shuo-wu commented Jul 25, 2023

innobead commented Sep 14, 2023

Vicente-Cheng commented Sep 15, 2023

innobead commented Nov 13, 2023

longhorn-io-github-bot commented Nov 13, 2023 •

edited by derekbit

roger-ryao commented Dec 29, 2023

[FEATURE] Enable resource profiling for IM #6377

[FEATURE] Enable resource profiling for IM #6377

Comments

innobead commented Jul 24, 2023

innobead commented Jul 24, 2023

shuo-wu commented Jul 25, 2023

innobead commented Sep 14, 2023

Vicente-Cheng commented Sep 15, 2023

innobead commented Nov 13, 2023

longhorn-io-github-bot commented Nov 13, 2023 • edited by derekbit

Pre Ready-For-Testing Checklist

roger-ryao commented Dec 29, 2023

longhorn-io-github-bot commented Nov 13, 2023 •

edited by derekbit