Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tutorial for SmartSim on Slurm #30

Closed
5 tasks done
Spartee opened this issue Apr 6, 2021 · 5 comments
Closed
5 tasks done

Tutorial for SmartSim on Slurm #30

Spartee opened this issue Apr 6, 2021 · 5 comments
Labels
area: docs Issues related to documentation

Comments

@Spartee
Copy link
Contributor

Spartee commented Apr 6, 2021

Description

Add a tutorial for using SmartSim on Slurm based systems.

Justification

Currently, we only have two tutorials and neither cover Slurm functionality

Implementation Strategy

The following items should be covered in a SmartSim-Slurm tutorial

  • Launching jobs in a previously obtained interactive slurm allocation (using SrunSettings)
  • Launching batch jobs with SbatchSettings in ensembles
  • Getting and releasing allocations through the slurm interface
  • Launching on allocations obtained through the smartsim.slurm interface
  • Creating and launching the SlurmOrchestrator on interactive allocations and as a batch

The tutorial should be implemented as a Jupyter Notebook and use nbsphinx to host as a part of the documentation.

@Spartee Spartee added area: Slurm area: docs Issues related to documentation labels Apr 6, 2021
@neuralvis
Copy link

I had started some examples based on your pointers, @Spartee - https://github.com/neuralvis/SmartSim/tree/slurm_examples/tutorials/slurm/HelloSmartSim

They still need to be cleaned-up a bit. Additional examples would be good too. But it solves the first two items on your list, hopefully !

@Spartee
Copy link
Contributor Author

Spartee commented Apr 7, 2021

@neuralvis these look great! A couple notes:

  • It looks like your using 0.2.0. You should update to 0.3.0 so you don't need to worry about the SMARTSIMHOME path and are on a stable version. You can just follow the instructions for the quick (or full if you want GPU support) install, but cloning the repo instead of building from the release. instructions for pip install will work the same from release tarball as from source. Interface should be the same for the most part so the examples don't need to change much.
  • We have been shifting to doing tutorials in Jupyter and "sphinxing" them with nbsphinx. It would be great if we could put these examples in a notebook like this tutorial we did

Let me know if I can help with anything.

Thanks!

@neuralvis
Copy link

Thanks for the notes @Spartee ! I'll port these starter examples into a notebook format. Are the examples expected to work from jupyter as well ? Or are they merely "written" in notebook style to "sphinxify" them ?

@Spartee
Copy link
Contributor Author

Spartee commented Apr 12, 2021

All the examples should work from jupyter, if you haven't setup jupyter manually on an XC (or EX :) before, I can also help with that.

@Spartee
Copy link
Contributor Author

Spartee commented Aug 12, 2021

closing as README now has slurm examples

@Spartee Spartee closed this as completed Aug 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area: docs Issues related to documentation
Projects
None yet
Development

No branches or pull requests

2 participants