Skip to content

Latest commit

 

History

History

DeepLearning-GPU

Deep Learning and GPU Examples

SWAN Open in Colab

This folder contains Jupyter notebook examples of common deep learning tools, running on GPU resources.
To use GPU resources in SWAN, you need to

  • Access SWAN from you browser: https://swan.cern.ch
  • Select a software stack with GPU
    • To get the latest version of the tools used here select the 'bleeding edge' software stack

Contact: Luca.Canali@cern.ch

Getting started with Deep Learning

These notebooks implement a classifier for digit recognition using the MNIST dataset, it is a sort of "Hello World!" for Deep Learning.

Deep Learning and basic Data pipelines

These notebooks provide examples of how to integrate Deep Learning frameworks with some basic data pipelines using Pandas to feed data into the DL training step.
They implement a Particle classifier using different DL frameworks. The data is stored in Parquet format, which is a columnar format that is very efficient for reading data, it processed using Pandas, and then fed into the DL training step.

More advanced Data pipelines

These examples show some more advanced data pipelines, useful for training with large data sets. They show how to use the Petastorm library to read data from Parquet files with TensorFlow and PyTorch, and how to use the TFRecord format with TensorFlow.

Additional complexity with models and data

These examples implement the same particle classifier as in the previous examples, but with a more complex model and bigger data set.

  • Data in TFRecord, TensorFlow on GPU, GRU-based model
    • Description: This notebook showcases the training process for the Inclusive Particle Classifier model, using data stored in TFRecord format.
    • TensorFlow is configured to run on a GPU, and a GRU-based model architecture is employed.
  • Data in TFRecord, TensorFlow on GPU, Transformer-based model
    • Description: This notebook showcases the training process for the Inclusive Particle Classifier model, using data stored in TFRecord format.
    • TensorFlow is configured to run on a GPU, and a Transformer-based model architecture is employed.