Search before asking
Motivation
When pushing pulsar clusters to see maximum performance it is important to understand which metric is causing bundle splits and unloading. The metrics shared only show that the split and/or unload occurred.
- bundle unload metrics
- bundle split metrics
The logs give values, but it is hard to reason about these or monitor them.
Solution
Add reason counts for load balancing: bundle splits, and unloads.
- Bandwidth
- CPU
- Direct Memory
- Memory
- Admin
Alternatives
Make the log files be much clearer about which metric has triggered a bundle split or unload.
Anything else?
This is related to #16691
Are you willing to submit a PR?