A collection of diamond collectors for slurm.
Branch: master
Clone or download
Latest commit bd97367 Jan 7, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
slurm_cluster_status_diamond.py Fixing bug. Jan 7, 2019
slurm_job_leaderboard.py
slurm_sched_stats_diamond.py
slurm_sshare_diamond.py

README.md

slurm-diamond-collectors

A collection of custom diamond collectors to gather various slurm stats.

Description

These collectors are intended to be used with diamond to ship stats to graphite. Each collector collects data on a different aspect of slurm. Feel free to add or update these collectors to suit your needs.

SlurmSchedStatsCollector

This collector is a diamond version of this:

http://giovannitorres.me/graphing-sdiag-with-graphite.html

As described in the above page it will require you to install pyslurm. This collector will collect sdiag stats allowing you to chart your scheduler performance over time.

SlurmSshareCollector

This collector grabs the current sshare data for users. This assumes that you are using a two tier simple fairshare system of accounts and users of those accounts.

SlurmClusterStatusCollector

This collector pulls the current state of all the nodes in the cluster and then computes overall stats of the cluster such as number of nodes down, number of nodes in use, etc.

SlurmJobLeaderBoard

This collector pulls in the current job information for the last hour. It then summarizes the data per user to be plugged into a leaderboard for the top users.

Usage

Simply add them to /usr/share/diamond/collectors and then activate them in diamond and you should be good to go.