New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mgr/prometheus: fix pool_objects_repaired and daemon_health_metrics format #51090
mgr/prometheus: fix pool_objects_repaired and daemon_health_metrics format #51090
Conversation
Looks great. Thanks @banuchka for taking care of this. The Signed-off-by is missing from the commit message and the Fixes: too. If you can change that and past the current output of /metrics it would be awesome. |
Btw, just curious, is repeated headers an issue or just cumbersome? |
btw @banuchka i see you targetted this PR for quincy only. Our normal way of workflow is to first create the PR in main and merge it there. Then cherry-pick those commits to stable branches like |
@pereman2 It is a not valid output format for the Prometheus scrapper, as an example below: |
do I need to create a new PR or is it possible to change the target on the current one? (I cant find how to do that, my bad) |
3031f19
to
9c6258d
Compare
This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved |
You should set the backport field in the tracker ticket (I did it), and then when this PR is merged to the main branch and the ticket status is changed to "Pending backport" the backport tickets will be created automatically. |
@banuchka The update looks good, please squash it into the original commit and:
|
123f59c
to
806e600
Compare
@idryomov done |
I would say that it's a separate issue (and we still need to hear from @pereman2 on why this metric deviated from the rest in this regard). |
@idryomov I honestly didn't give much thought on it. I remember pushing those PR quickly so poolid was a clear mistake. I think it is great news if this can be fixed in this PR. |
@pereman2 Could you please respond on #48843 (comment)? Then perhaps @banuchka could address all these issues in this PR and we could backport in one go. |
806e600
to
ad365fb
Compare
…ormat mgr/prometheus: fix pool_objects_repaired and daemon_health_metrics format - fix "error reading metrics for http://****:***/metrics: reading text format failed: text format parsing error in line 2010: second HELP line for metric name "ceph_pool_objects_repaired" error - rename label name "poolid" to "pool_id" like all other metrics - change type for the "daemon_health_metrics" to gauge Fixes: https://tracker.ceph.com/issues/59505 Signed-off-by: banuchka <tyrchenok@gmail.com>
ad365fb
to
95d5303
Compare
@idryomov Now it should be better as I hope. |
Dismissing since the new version needs to be retested. @pereman2 please test as you did before and ideally post the output here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# HELP ceph_pool_objects_repaired Number of objects repaired in a pool
# TYPE ceph_pool_objects_repaired counter
ceph_pool_objects_repaired{pool_id="9"} 0.0
ceph_pool_objects_repaired{pool_id="8"} 0.0
ceph_pool_objects_repaired{pool_id="7"} 0.0
ceph_pool_objects_repaired{pool_id="2"} 0.0
ceph_pool_objects_repaired{pool_id="1"} 0.0
ceph_pool_objects_repaired{pool_id="3"} 0.0
ceph_pool_objects_repaired{pool_id="4"} 0.0
ceph_pool_objects_repaired{pool_id="5"} 0.0
ceph_pool_objects_repaired{pool_id="6"} 0.0
# HELP ceph_daemon_health_metrics Health metrics for Ceph daemons
# TYPE ceph_daemon_health_metrics gauge
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="mon.a"} 0.0
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="mon.b"} 0.0
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="mon.c"} 0.0
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="osd.0"} 0.0
ceph_daemon_health_metrics{type="PENDING_CREATING_PGS",ceph_daemon="osd.0"} 0.0
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="osd.1"} 0.0
ceph_daemon_health_metrics{type="PENDING_CREATING_PGS",ceph_daemon="osd.1"} 0.0
ceph_daemon_health_metrics{type="SLOW_OPS",ceph_daemon="osd.2"} 0.0
ceph_daemon_health_metrics{type="PENDING_CREATING_PGS",ceph_daemon="osd.2"} 0.0
looks good.
jenkins test make check |
jenkins test dashboard |
1 similar comment
jenkins test dashboard |
I created a bug report on Ceph Tracker - BUG #59505
This PR is a fix for: