Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Build slurm for distribution of the submitter #177

Closed
cartalla opened this issue Nov 29, 2023 · 0 comments · Fixed by #182
Closed

[FEATURE] Build slurm for distribution of the submitter #177

cartalla opened this issue Nov 29, 2023 · 0 comments · Fixed by #182
Assignees

Comments

@cartalla
Copy link
Contributor

Is your feature request related to a problem? Please describe.

If you configure a submitter that is using a different OS distribution than the cluster then the modulefile doesn't exist and the
binaries are incompatible.

Describe the solution you'd like

Detect this and build a version of slurm on the submitter instance and then create a modulefile for the submitter's distribution.
Save the results on the cluster's shared file system so that it can be used by all submitters with the same OS.

@cartalla cartalla self-assigned this Nov 29, 2023
cartalla added a commit that referenced this issue Nov 30, 2023
So that it can submit jobs to a cluster that uses a different OS.

Resolves #177
cartalla added a commit that referenced this issue Dec 1, 2023
So that it can submit jobs to a cluster that uses a different OS.

Fix errors in configuration scripts.

Resolves #177
cartalla added a commit that referenced this issue Dec 1, 2023
So that it can submit jobs to a cluster that uses a different OS.

Fix errors in configuration scripts.

Resolves #177
cartalla added a commit that referenced this issue Dec 1, 2023
So that it can submit jobs to a cluster that uses a different OS.

Fix errors in configuration scripts.

Resolves #177
cartalla added a commit that referenced this issue Dec 3, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 3, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 3, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 5, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
@cartalla cartalla linked a pull request Dec 5, 2023 that will close this issue
cartalla added a commit that referenced this issue Dec 5, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 8, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 9, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 9, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 9, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 11, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 11, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 12, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 12, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 12, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 13, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 14, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 14, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 15, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 16, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
cartalla added a commit that referenced this issue Dec 16, 2023
So that it can submit jobs to a cluster that uses a different OS.

Automate the configuration of the cluster manager and submitters in RES.

Fix errors in configuration scripts.

Add a parameter to the ParallelCluster custom resource that has a hash
of all of the config file contents so that the resource gets updated
whenever anything changes.
This current issue with this implementation is that the ParallelCluster
still doesn't change so even though the custom resource is updated,
the cluster doesn't because it's configuration is unchanged.

Fix a bug in configuring licenses.
After initial configuration the playbook was failing on subsequent updates.

Support custom munge key which must be specified in secretsmanager

Try to allow RES /home to be mounted, but currently causes a validation error.

Update EDA instance types to the latest and so that all memory sizes get built.

Restrict lambda function IAM roles and cleanup cfn_nag errors and warnings.

Resolves #177
Resolves #178
Resolves #180
Resolves #181
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant