Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Added a volume heartbeat graph to grafana dashboard #2511

Merged
merged 2 commits into from
Dec 15, 2021

Conversation

ErikCoder07
Copy link
Contributor

In the current dashboard there is no way to know the status of
volume servers, as prometheus only accepts push information, if
any server volume goes offline, the last information sent will
be read continuously by grafana.

Added a graph that calculates the delta between the information
of the last push timestamp with the timestamp of 15 seconds ago,
if the timestamp has not been changed, server state will be 0,
if volume server sent any information the server state will be 1.

-This graph will have to be modified if the time interval of pushs
is modified, a suggestion for improvement is to send the interval
in seconds along with the metrics.

-Only local tests were run, maybe if the latency between pushgateway
and volume server is too high, the delta calculation could be
affected

grafana_heartbeat_example
.

In the current dashboard there is no way to know the status of
volume servers, as prometheus only accepts push information, if
any server volume goes offline, the last information sent will
be read continuously by grafana.

Added a graph that calculates the delta between the information
of the last push timestamp with the timestamp of 15 seconds ago,
if the timestamp has not been changed, server state will be 0,
if volume server sent any information the server state will be 1.

-This graph will have to be modified if the time interval of pushs
is modified, a suggestion for improvement is to send the interval
in seconds along with the metrics.

-Only local tests were run, maybe if the latency between pushgateway
and volume server is too high, the delta calculation could be
affected.
@chrislusf
Copy link
Collaborator

Nice. Could this be merged with existing dashboards?

@ErikCoder07
Copy link
Contributor Author

Yes, I used the "seaweedfs/k8s/helm_charts2/dashboards/seaweedfs-grafana-dashboard.json" as a base, just added the graph in the volume server information. I created a new file because as I mentioned in the commit, it may be that in a situation of use that takes a little more than 15 seconds to push the metrics, in this case the delta calculation may not work, in the print I sent the volumes are in the same environment than the master, if everything is ok even with external volume servers it can be merged!

@chrislusf chrislusf merged commit f0492ac into seaweedfs:master Dec 15, 2021
chrislusf added a commit that referenced this pull request Dec 15, 2021
@chrislusf
Copy link
Collaborator

thanks and merged. I adjusted the file name to be consistent. Could you please add a README.md file there also?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants