Pytorch Convolutional Neural Network Starter Template

Summary

After building out many Convolutional Neural Networks (CNNs) for several research projects I decided to make a template to make getting started easy while still having access to all the functions for per project customization if needed.

This template tackles most of the formats I have needed to load data and models so far (See next sections). If it doesn't cover your use case the built functions should provide a good starting point.

This template also contains a bare bones flask app that returns json response with the pytorch models predicted class.

Loading Data

Visit the Example Jupyter Notebook for examples of how to use the data loading functions. Or read next section on how to use data with train.py

pre-built functions exsist for loading image data in two different ways:

See the functions in helpers/data_loader.py

Directory (Images in folders seperated by class)

Directory data loader expects a root directory path containing sub folders for each class.

Example:

root/dog/xxx.png
root/cat/xxx.png
root/panda/xxx.png

if this root directory exsists in the same directory as this template:

use the path: root/

CSV

The CSV data loader expects a root directory and CSV file containing two columns:

relative path to image from root directory
label (class) of image ( Cat, Dog, Panda, etc..)

Example:

images within root directory

root/dog_xxx.png
root/cat_xxx.png
root/panda_xxx.png

labels.csv:

FilePath	Label
cat_xxx.png	cat
dog_xxx.png	dog
panda_xxx.png	Panda

if this root directory exsists in the same directory as this template:

use the path: root/

Default transforms

By default the only transforms applied to images is resize, toTensor, and normalization based on imagenet data set.

See the image_transforms function in helpers/data_loader.py to make changes

read about pytorch transforms here

img_transforms = transforms.Compose(
        [transforms.Resize(size=(img_size,img_size)),
                    transforms.ToTensor(),
                    transforms.Normalize(
                    mean=[0.485, 0.456, 0.406],
                    std=[0.229, 0.224, 0.225] )])

Training a Model

Visit the Example Jupyter Notebook for examples of how to use the individual pre-built training functions.

note: By default the template is set to use a pre-trained resnet50, VGG, or a model programmed from scratch. Reference the models folder for how a model is built. It is fairly straight foward to add more pre-built models and even add them as options to train.py. Experiement with your own by writing it in /models/scratch.py

All layers but output in Resnet and VGG are frozen by default

Using train.py:

train.py is an option to train directly from your terminal by passing in arguments. The argument options are very robust accounting for many data input options.

Argument Options for python train.py

--data_dir

Root directory for dataset. Required for all formats of data sets
default=None
Example: --data_dir=../datasets/animals/

--val_data

Validation dataset path. If none provided 20% of training data will be randomly taken for validation. If CSV label same roor directory is expected.
default=None
Example: --val_data=../datasets/dog_breeds/valid

--save_path

Path to save model and training history. folder and file name for wights and file name expacted. Extensions .pt & .csv wil be added.
default=None
example: --save_path=trained_models/dogbreeds_vgg

--csv_labels

Path to trainig data & labels if they are in a CSV. Expected to contain columns FilePath & Label
default=None
example: --csv_labels=../datasets/animals/labels.csv

--model_type

CNN model architecture to train with data
Options: vgg16, scratch, resnet50
default='resnet50'
example: --model_type=vgg16

--num_classes

Number of classes for model to predict
default=2
example: --num_classes=3

--n_epochs

Number of epochs to train model weights for
default=3
example: --n_epochs=30

--learn_rate

Learning rate for training model weights
default=0.001
example: --learn_rate=0.0005

--img_size

Dimensions to resize images for training.
default=244
Example: --img_size=64

--batch_size

batch size for training model
default=8
Example: --batch_size=32

--verbose

Option for a more verbose output while loading the data and models Most notably this will show a preview of images from the training and validation sets. Good for verifying your data is what you expect.
default=False
Example: --verbose=True

--device

Device for running model computations on. see CUDA semantics in pytorch
default='cpu'
Example: --device=cuda

--num_workers

number of workers for data loaders. Guidelines
default=0
Example: num_workers=2

Common examples:

Train with CSV Labels

Example loading data from a csv file with NO validation set

python train.py --verbose=True --device=cuda --num_classes=3 --n_epochs=30 
--learn_rate=0.001 --save_path=trained_models/test_train
--csv_labels=../datasets/animals/labels.csv --img_size=244 --batch_size=32 
--num_workers=0 --data_dir=../datasets/animals/

Example loading data from a csv file WITH validation set. Expected validation dataset is in same root dir as training dataset

python train.py --verbose=True --device=cuda --num_classes=3 --n_epochs=30 
--learn_rate=0.001 --save_path=trained_models/test_train
--csv_labels=../datasets/animals/labels.csv --img_size=244 --batch_size=32 
--num_workers=0 --data_dir=../datasets/animals/ 
--val_data=../datasets/test_animals/test_labels.csv

Training with Directory Labels:

Example loading data from a directory with NO validation set

python train.py --verbose=True --device=cuda --num_classes=3 --n_epochs=30 
--learn_rate=0.001 --save_path=trained_models/test_train --img_size=244 
--batch_size=32 --num_workers=0 --data_dir=../datasets/animals/

Example loading data from a directory WITH validation set

python train.py --device=cuda --num_classes=133 --n_epochs=30 
--learn_rate=0.001 --save_path=trained_models/test_train 
--img_size=244 --batch_size=32 --num_workers=0 
--data_dir=../datasets/dog_breeds/train --model_type=vgg16 
--val_data=../datasets/dog_breeds/valid

Predict Classes with models

Visit the Example Jupyter Notebook for examples of how to use the individual pre-built training functions.

Using test.py:

test.py is an option to test a model directly from your terminal by passing in arguments. Both testing from directory or CSV labels are options.

Run test.py

--model_type

CNN model architecture to load weights into
Options: vgg16, scratch, resnet50
default='resnet50'
example: --model_type=vgg16

--weights

Trained weights to use for prediction. Weights must match model type.
default=None
Example: trained_models/test_train.pt

--data_csv

Path to data & labels if they are in a CSV.
default=None
Example: --data_csv=../datasets/test_animals/test_labels.csv

--data_dir

Root directory for dataset. Required for all formats of data sets
default=None
Example: ../datasets/test_animals/

--num_classes

Number of classes the model was trained to predict
default=2
example: --num_classes=3

--device

Device for running model computations on. see CUDA semantics in pytorch
default='cpu'
Example: --device=cuda

Common examples:

prediction with CSV labeled data

python test.py --device=cuda --weights=trained_models/test_train.pt
--data_csv=../datasets/test_animals/test_labels.csv
--data_dir=../datasets/test_animals/ --num_classes=3 --model_type=vgg16

prediction with directory labeled data

python test.py --device=cuda --weights=trained_models/test_train_tmp.pt
--data_dir=../datasets/test_animals/ --num_classes=3 --model_type=vgg16

Run Flask Server

This is a very bare bones fask set up to help get you started hosting your model You can upload an image and get back a response with the prediction index.

The pytorch docs have a good flask example here.

I'll be building a more robust example in a different repo soon, but hopefully this will be enough to get you started!

app.py contains the flask app

flask_test.py contains some example requests you can modify to quickly check your app behavior on different images.

Windows:

set FLASK_ENV=development

set FLASK_APP=app.py

Mac/linux:

export FLASK_ENV=development

export FLASK_APP=app.py

flask run

Requirments

requirements.txt

The versions of libraries I used / tested on

python 3.7.5
cudatoolkit 10.1.243
flask 1.1.1
matplotlib 3.1.2
numpy 1.17.2
pandas 0.25.2
pillow 6.2.1
pytorch 1.3.1
seaborn 0.9.0
torchvision 0.4.2
jupyter 1.0.0 (optional: only if you want to load in notebook)
click 7.1.1
tqdm 4.41.1

Acknowledgements

Inspiration for building this came while I was doing Udacity's Deep Learning Nanodegree and working on the Dog Breed classification project. You can find their prompt here. This goes beyond the scope they covered but I want to thank them for a great intro to Pytorch.

And of course the offcial Pyorch docs were handy. Specifically the docs on creating a custom data loader, and running a model on flask

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
examples		examples
helpers		helpers
models		models
.gitignore		.gitignore
README.md		README.md
animal_load_train_csv.py		animal_load_train_csv.py
app.py		app.py
flask_test.py		flask_test.py
requirements.txt		requirements.txt
shap_test.py		shap_test.py
test.py		test.py
train.py		train.py

sagecodes/cnn-pytorch-starter

Folders and files

Latest commit

History

Repository files navigation

Pytorch Convolutional Neural Network Starter Template

Summary

Loading Data

Directory (Images in folders seperated by class)

CSV

Default transforms

Training a Model

Using train.py:

Argument Options for python train.py

--data_dir

--val_data

--save_path

--csv_labels

--model_type

--num_classes

--n_epochs

--learn_rate

--img_size

--batch_size

--verbose

--device

--num_workers

Common examples:

Train with CSV Labels

Training with Directory Labels:

Predict Classes with models

Using test.py:

--model_type

--weights

--data_csv

--data_dir

--num_classes

--device

Common examples:

prediction with CSV labeled data

prediction with directory labeled data

Run Flask Server

Requirments

Acknowledgements

Improvments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages