GitHub - gnperdue/plasma-python: PPPL deep learning disruption prediction package

FRNN - PPPL deep learning disruption prediction package

The FRNN code workflow is similar to that characteristic of typical distributed deep learning projects. First, the raw data is preprocessed and normalized. The pre-processing step involves cutting, resampling, and structuring the data - as well as determining and validating the disruptive properties of the shots considered. Various options for normalization are implemented.

Secondly, with respect to distributed data-parallel training of the model, the associated parameters are check-pointed after each epoch on the disk, in HDF5 file format. Finally – regarding the cross validation and prediction step on unlabeled data, it is planned to also implement a hyper-parameter tuning; approach using a random search algorithm.

The results are stored as HDF5 files, including the final neural network model parameters together with statistical summaries of the variables used during training to allow researchers to produce learning curves and performance summary plots.

The Fusion Recurrent Neural Net (FRNN) deep learning code is implemented as a Python package consisting of 4 core modules:

models: Python classes necessary to construct, train and optimize deep RNN models. Including a distributed data-parallel implementation of mini-batch gradient descent with MPI
preprocessors: signal preprocessing and normalization classes, including the methods necessary to prepare physical data for stateful RNN training.
primitives: contains abstractions specific to the domain implemented as Python classes. For instance: Shot - a measurement of plasma current as a function of time. The Shot object contains attributes corresponding to unique identifier of a shot, disruption time in milliseconds, time profile of the shot converted to time-to- disruption values, validity of a shot (whether plasma current reaches a certain value during the shot), etc
utilities: a set of auxiliary functions for preprocessing, performance evaluation and learning curves analysis

This is a pure Python implementation for Python versions 2.6 and 2.7.

Installation

The package comes with a standard setup script and a list of dependencies which include: mpi4py, Theano, Keras, h5py, Pathos. It also requires a standard set of CUDA drivers to run on GPU.

Run:

pip install -i https://testpypi.python.org/pypi plasma

optionally add --user to install in a home directory.

Alternatively, use the setup script:

python setup.py install

with sudo if superuser permissions are needed or --home=~ to install in a home directory. The latter option requires an appropriate PYTHONPATH.

Module index

Tutorials

Sample usage on Tiger

module load anaconda cudatoolkit/7.5 cudann openmpi/intel-16.0/1.8.8/64
source activate environment
python setup.py install

Where environment should contain the Python packages as per requirements.txt file.

Preprocessing

python guarantee_preprocessed.py

Training and inference

Use Slurm scheduler to perform batch or interactive analysis on Tiger cluster.

Batch analysis

For batch analysis, make sure to allocate 1 process per GPU:

#SBATCH -N X
#SBATCH --ntasks-per-node=4
#SBATCH --ntasks-per-socket=2
#SBATCH --gres=gpu:4

where X is the number of nodes for distibuted data parallel training.

sbatch slurm.cmd

Interactive analysis

The workflow is to request an interactive session:

salloc -N [X] --ntasks-per-node=16 --ntasks-per-socket=8 --gres=gpu:4 -t 0-6:00

where the number of GPUs is X * 4.

Then launch the application from the command line:

cd plasma-python
mpirun -npernode 4 python examples/mpi_learn.py

Note: there is Theano compilation going on in the 1st epoch which will distort timing. It is recommended to perform testing setting num_epochs >= 2 in conf.py.

Name		Name	Last commit message	Last commit date
Latest commit History 337 Commits
data		data
docs		docs
examples		examples
plasma		plasma
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
makedocs.py		makedocs.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FRNN - PPPL deep learning disruption prediction package

Installation

Module index

Tutorials

Sample usage on Tiger

Preprocessing

Training and inference

Batch analysis

Interactive analysis

Status

About

Uh oh!

Releases

Packages

Languages

gnperdue/plasma-python

Folders and files

Latest commit

History

Repository files navigation

FRNN - PPPL deep learning disruption prediction package

Installation

Module index

Tutorials

Sample usage on Tiger

Preprocessing

Training and inference

Batch analysis

Interactive analysis

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages