Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make an sbatch launcher class #12

Merged
merged 8 commits into from
Mar 13, 2018
Merged

Conversation

mdpiper
Copy link
Member

@mdpiper mdpiper commented Mar 12, 2018

The new CSDMS cluster, blanca, uses Slurm as a job scheduler instead of Torque, which implies a new Launcher class.

This launcher, an analogue to the qsub and bash launcher classes,
will be used on blanca.

I hardcoded the Slurm `module load` call into the class. This needs
to be changed, perhaps read from some environment setting.
The purpose of the run script is to call `module load slurm/blanca`,
which has to done before calling the launch script (which includes
SBATCH commands), and which can't be called through subprocess.call.
To do this, I added a new class attribute, `_run_script`, a new attr
to SbatchLauncher, `run_script_path`, and I overrode several methods.
This was causing jobs to fail on the CSDMS compute node on
blanca. (They would run on the login node.)

@mcflugen pointed out that this may be a hardware issue--the compute
nodes may vary. I'll try to isolate this and notify RC.
@mdpiper
Copy link
Member Author

mdpiper commented Mar 12, 2018

*software, not hardware, issue in 3517098. We likely just need Qt installed.

I experimented with os.system to see whether it behaved differently.
Answer=no.
@mdpiper mdpiper merged commit f3c58e8 into master Mar 13, 2018
@mdpiper mdpiper deleted the mdpiper/make-sbatch-launcher branch March 13, 2018 21:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant