Skip to content
master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

SLURM Cluster Via Docker

This is a minimal and bad, but Working! example of a Slurm (aka slurm-wlm, aka slurm llnl) compute cluster. This should work on bare metal ubuntu 18.04 machines, or in docker.

This isn't anywhere near production ready, treat it as useful infomration on your own slurm sysadmin quest.

I couldn't get slurm to compile reliantly, so I'm using the verison from Ubuntu's Repos. Ubuntu 18.04 has a resonably new version.

Slurm images

There are three images

  • slurm_ctld - Slurm controller
  • slurm_node - lurm compute node
  • hpc_frontend - has slurm commandline tools, a simulation to how you may setup your own cluster

LDAP

An annoying trait of slurm is that if user Bob runs a job, Bob must have an account on every node that the job runs on.

I've taken the LDAP route to sync users accross the cluster. Docker image ldap_host hosts an ldap server, with a php web interface ( "lam" dap-account-manager. Beacuse phpldapadmin is old and busted).

Each of the other docker images are configured as LDAP clients. If you are using this for your own infrastructure it shouldn't be too hard to drop my ldap host image and use your own ldap server.

No Warranty

I've wasted far too many weekends getting slurm to work.

I hope you fid this usefull

Don't expect this to work forever.

About

Minimum working example of a slurm cluster, using LDAP and Docker

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published