Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidate vagrant setup for module validation on a virtual cluster #17

Closed
Falkor opened this issue Feb 1, 2019 · 0 comments
Closed
Assignees
Milestone

Comments

@Falkor
Copy link
Contributor

Falkor commented Feb 1, 2019

Vagrant-based deployment currently focus on a single VM et quickly tests a few testing manifests but does not permit to really to test the slurm module in an environment as close as possible to a cluster.
In particular, the flexibility that could offer hiera for custom checks in a multi-VM deployment (each with dedicated roles) is not yet addresed.

Objectives

  1. We should allow for a full virtual cluster setup by default, including:

    • 1 Slurm controller (including the Slurm accounting DB)
    • 1 login node
    • 2 or more compute nodes
  2. Provisionning of each VM should be operated by vagrant/puppet using the current module and sample profile classes which, coupled with Hiera, would illustrate the usage of the module as done in the ULHPC control repo.

  3. Modifying/customizing the default deployment setup (number of compute nodes, separation of the Slurm accounting DB on a separate host etc.) should be made flexible through changes in a single config.yaml file that would permit to overwrite the default settings

  4. Hiera hierachy with default settings for the module should be proposed (to illustrate again the way we use this module) and, more importantly, should also allow for quick customs tests to be perfomed by placing hieradata/custom.yaml (not tracked in the Git repository) at the highest hierachy, to allow for a fast application accross all or a subset of the deployed VMs with vagrant provision --provision-with puppet

@Falkor Falkor self-assigned this Feb 1, 2019
@Falkor Falkor added this to the 1.2 milestone Feb 1, 2019
Falkor pushed a commit that referenced this issue Feb 1, 2019
- Bugfix #17
- config.yaml can overwrite default settings
- role-based deployment
- By default, deployment of:
    * 1 Slurm controller (including the Slurm accounting DB)
    * 1 login node (no daemon -- see #7)
    * 2 compute nodes

Signed-off-by: Sebastien Varrette <Sebastien.Varrette@uni.lu>
@Falkor Falkor closed this as completed Feb 1, 2019
Falkor pushed a commit that referenced this issue Feb 3, 2019
TRES, scheduler parameters and custom content management

- Bugfix #2: slurmdbd on separate host bug
- Bugfix #7: slurm-login nodes without daemons
- Bugfix #12: slurm-17.11.3 install fails bug
- Bugfix #13: Add SchedulerParameters to slurm.conf.erb template
- Bugfix #14: Prevent improper use of SchedMD resources
- Bugfix #16: Allow for TRESBillingWeights enhancement
- Bugfix #17: Consolidate vagrant setup for module validation on a virtual cluster enhancement
- Bugfix #18: Allow for custom content enhancement
uvNikita pushed a commit to uvNikita/puppet-slurm that referenced this issue Apr 5, 2019
- Bugfix ULHPC#17
- config.yaml can overwrite default settings
- role-based deployment
- By default, deployment of:
    * 1 Slurm controller (including the Slurm accounting DB)
    * 1 login node (no daemon -- see ULHPC#7)
    * 2 compute nodes

Signed-off-by: Sebastien Varrette <Sebastien.Varrette@uni.lu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant