Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose prometheus metrics per node for current total pins queued or errored #1470

Closed
olizilla opened this issue Sep 15, 2021 · 1 comment · Fixed by #1637
Closed

Expose prometheus metrics per node for current total pins queued or errored #1470

olizilla opened this issue Sep 15, 2021 · 1 comment · Fixed by #1637
Labels
effort/days Estimated to take multiple days, but less than a week exp/intermediate Prior experience is likely helpful need/triage Needs initial labeling and prioritization P1 High: Likely tackled by core team if no one steps up status/ready Ready to be worked

Comments

@olizilla
Copy link
Contributor

We'd like to be able to scrape totals from each cluster node for pins that are in pin_queued or pin_error states, so we can chart them over time and alert on them when the numbers get high.

Right now we run ipfs-cluster-ctl manually when someone reports degraded service, and those are the things we check for, so it would be great to be able to automate it.

@olizilla olizilla added the need/triage Needs initial labeling and prioritization label Sep 15, 2021
@hsanjuan hsanjuan added effort/days Estimated to take multiple days, but less than a week exp/intermediate Prior experience is likely helpful P1 High: Likely tackled by core team if no one steps up status/ready Ready to be worked labels Sep 15, 2021
@hsanjuan hsanjuan added this to the Release v1.0.0 milestone Apr 22, 2022
hsanjuan added a commit that referenced this issue Apr 22, 2022
@olizilla
Copy link
Contributor Author

🎉 nice!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
effort/days Estimated to take multiple days, but less than a week exp/intermediate Prior experience is likely helpful need/triage Needs initial labeling and prioritization P1 High: Likely tackled by core team if no one steps up status/ready Ready to be worked
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants