mds: add new performance and subvolume utilization metrics#66551
mds: add new performance and subvolume utilization metrics#66551
Conversation
|
jenkins test make check |
|
jenkins test make check arm64 |
a298bcc to
0e296e4
Compare
0e296e4 to
a589dcf
Compare
|
jenkins test windows |
1 similar comment
|
jenkins test windows |
|
jenkins test api |
a589dcf to
9e64628
Compare
1589259 to
6ec5eb9
Compare
|
jenkins test api |
vshankar
left a comment
There was a problem hiding this comment.
Dropping some more comments. I'm still reviewing this closely and would be done by tomorrow.
vshankar
left a comment
There was a problem hiding this comment.
Minor nits. Otherwise LGTM and would be ready to run through QA tests once the nits are addressed. Nice work @salieri11
|
jenkins retest this please |
|
jenkins test make check |
|
Nice. Can we expect a backport to Tentacle or Squid? |
Sorry! This is based on a biggish feature that will be available for Umbrella. |
|
This PR is under test in https://tracker.ceph.com/issues/75073. |
Perf merics: CPU% and number of open requests Subvolume utilization metrics: quota info and current size Signed-off-by: Igor Golikov <igolikov@redhat.com> Fixes: https://tracker.ceph.com/issues/74135 Fixes: https://tracker.ceph.com/issues/73700
Signed-off-by: Igor Golikov <igolikov@redhat.com> Fixes: https://tracker.ceph.com/issues/74135
test for CPU utilizationa and number of open requests Signed-off-by: Igor Golikov <igolikov@redhat.com> Fixes: https://tracker.ceph.com/issues/73700
Add comperehensive tests to validate correct quota and current size metrics for subvolumes Signed-off-by: Igor Golikov <igolikov@redhat.com> Fixes: https://tracker.ceph.com/issues/74135
docs for subvolume utilization and MDS perf metrics Signed-off-by: Igor Golikov <igolikov@redhat.com> Fixes: https://tracker.ceph.com/issues/74135 Fixes: https://tracker.ceph.com/issues/73700
…ifecycle() Signed-off-by: Venky Shankar <vshankar@redhat.com>
* refs/pull/66551/head:
a09dbb5 to
8b9eddf
Compare
|
added a fix to use safe_while to avoid racy check and squashed commits. |
|
jenkins test make check arm64 |
|
jenkins test make check |
8b9eddf to
2850b3f
Compare
|
force pushed -- no changes |
|
This is an automated message by src/script/redmine-upkeep.py. I have resolved the following tracker ticket due to the merge of this PR: No backports are pending for the ticket. If this is incorrect, please update the tracker Update Log: https://github.com/ceph/ceph/actions/runs/22619802892 |
|
This is an automated message by src/script/redmine-upkeep.py. I have resolved the following tracker ticket due to the merge of this PR: No backports are pending for the ticket. If this is incorrect, please update the tracker Update Log: https://github.com/ceph/ceph/actions/runs/22619802892 |
Summary
This PR adds new observability metrics for CephFS MDS:
Fixes: https://tracker.ceph.com/issues/73700
Fixes: https://tracker.ceph.com/issues/74135
Changes
New per-rank counters exposed via
mds_rank_perflabeled perf counters:cpu_usageopen_requestsThese metrics are sampled periodically and aggregated by rank 0 for cluster-wide visibility.
Subvolume Utilization Metrics
New fields added to subvolume metrics:
quota_bytesused_bytesrstat.rbytes)Key implementation details:
MetricsHandler::subvolume_quotamap, updated viaMDCache::broadcast_quota_to_client2 × subv_metrics_window_intervalof inactivity to prevent unbounded memory growthused_bytesis sourced from cached broadcast values with fallback to dynamicrstatfetchImportant behavioral note:
used_bytesupdates on next data I/O or quota broadcastContribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an
xbetween the brackets:[x]. Spaces and capitalization matter when checking off items this way.Checklist
Show available Jenkins commands
jenkins test classic perfJenkins Job | Jenkins Job Definitionjenkins test crimson perfJenkins Job | Jenkins Job Definitionjenkins test signedJenkins Job | Jenkins Job Definitionjenkins test make checkJenkins Job | Jenkins Job Definitionjenkins test make check arm64Jenkins Job | Jenkins Job Definitionjenkins test submodulesJenkins Job | Jenkins Job Definitionjenkins test dashboardJenkins Job | Jenkins Job Definitionjenkins test dashboard cephadmJenkins Job | Jenkins Job Definitionjenkins test apiJenkins Job | Jenkins Job Definitionjenkins test docsReadTheDocs | Github Workflow Definitionjenkins test ceph-volume allJenkins Jobs | Jenkins Jobs Definitionjenkins test windowsJenkins Job | Jenkins Job Definitionjenkins test rook e2eJenkins Job | Jenkins Job DefinitionYou must only issue one Jenkins command per-comment. Jenkins does not understand
comments with more than one command.