Skip to content

kageback/sge-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sge-python

SGE-Python makes it easy to run python code on a GridEngine (GE) cluster and help you manage the results, getting rid of the need to create shell script or manually submitting jobs to the queue .

SGE-Python is designed to make the underlying execution transperent so that you can run the same code when developing/debugging localally as when running the your experiment on the cluster. Further, it lets you shedule the entire experiment at ones even when there are dependencies between the jobs and GE will subseqently run each job in parallel by stepping through the job dependency graph. This makes hyperparameter search in partally shared pipelines, e.g. where some of the initial steps in the pipeline is shared between experiments, easy to set up and efficent whichout loosing traceablitity of each processing step taken for each individual hyperparameter setting.

Note: SGE-Python was developed and used by me during my PhD in Machine Learning so the use cases it has been tested for are mainly connected to that area.

Example Use cases - Hyper Param Search

installation

(1) Clone/download the repositry to some place you can import from. E.g. add it as a submodule in the root folder of your project by running: git submodule add https://github.com/kageback/sge-python.git sge_python

(2) Optional: If you are going to run the experiment from another computer than the cluster head node you need to set up a shared ssh key between the compter running the experiment and the cluster head node.

GPU allocation

This lib has support for allocating GPU resources on the grid. However, it assumes that the SGE instance has been set up using using to handle GPUs as a resource. E.g. using https://github.com/kageback/sge-gpuprolog

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages