Skip to content

This script uses the sampling facilities built into Qumulo Core to provide an overview of capacity consumption first by user, and then by area of the tree.

License

Notifications You must be signed in to change notification settings

patjakdev/capacity_by_user

 
 

Repository files navigation

Capacity By User

What's the point?

This script uses the sampling facilities built into Qumulo Core to provide an overview of capacity consumption first by user, and then by area of the tree.

Scenario

Imagine you have a hundred or a thousand artists or researchers all busily collaborating on a project. They create data, and sometimes they forget to go clean it up. You want to get your army of talented professionals to reduce their capacity consumption. What do you tell them? This script breaks down a tree by user and then shows, for each user, where that user consumes space in the tree.

Usage

usage: capacity_by_user.py [-h] [-U USER] [-P PASSWORD] [-C CLUSTER] [-p PORT]
                           [-s SAMPLES] [-c CONCURRENCY] [-m MIN_SAMPLES]
                           [-x MAX_LEAVES] [-D DOLLARS_PER_TERABYTE] [-i] [-A]
                           path

positional arguments:
  path                  Filesystem path to sample

optional arguments:
  -h, --help            show this help message and exit
  -U USER, --user USER  The user to connect as (default: admin)
  -P PASSWORD, --password PASSWORD
                        The password to connect with (default: admin)
  -C CLUSTER, --cluster CLUSTER
                        The hostname of the cluster to connect to (default:
                        qumulo)
  -p PORT, --port PORT  The port to connect to (default: 8000)
  -s SAMPLES, --samples SAMPLES
                        The number of samples to take (default: 2000)
  -c CONCURRENCY, --concurrency CONCURRENCY
                        The number of threads to query with (default: 10)
  -m MIN_SAMPLES, --min-samples MIN_SAMPLES
                        The minimum number of samples to show at a leaf in
                        output (default: 5)
  -x MAX_LEAVES, --max-leaves MAX_LEAVES
                        The maximum number of leaves to show per user
                        (default: 30)
  -D DOLLARS_PER_TERABYTE, --dollars-per-terabyte DOLLARS_PER_TERABYTE
                        Show capacity in dollars. Set conversion factor in
                        $/TB/month
  -i, --confidence-interval
                        Show 95% confidence intervals
  -A, --allow-self-signed-server
                        Silently connect to self-signed servers

About

This script uses the sampling facilities built into Qumulo Core to provide an overview of capacity consumption first by user, and then by area of the tree.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 93.5%
  • Shell 6.5%