Skip to content

fasrc/prometheus-slurm-exporter

Repository files navigation

prometheus-slurm-exporter

Prometheus Exporter for Slurm. Uses the prometheus python implementation.

Description

These collectors are intended to be used with prometheus to ship stats. Each collector collects data on a different aspect of slurm. Feel free to add or update these collectors to suit your needs.

SlurmSchedStatsCollector

This collector is a prometheus version of this:

http://giovannitorres.me/graphing-sdiag-with-graphite.html

This collector will collect sdiag stats allowing you to chart your scheduler performance over time.

SlurmSshareCollector

This collector grabs the current sshare data for users. This assumes that you are using a two tier simple fairshare system of accounts and users of those accounts.

SlurmClusterStatusCollector

This collector pulls the current state of all the nodes in the cluster and then computes overall stats of the cluster such as number of nodes down, number of nodes in use, etc.

SlurmSeasStatsCollector

This collector pulls stats for Harvard SEAS.

Usage

Simply build the rpm via rpmbuild -ba prometheus-slurm-exporter.spec and install then use systemd to run the various unit files to get the exporters running.

Dashboards

You can example dashboards for the various collectors in the dashboards directory.

About

Prometheus Exporter for Slurm

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published