A journey to explore neural style algorithm: From GPU to Mobile!
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


A journey into Neural Style


This repository contains some work i did on the neural style algorithm to get a better sense of it. You can find all the different script to download, pretrain and train model. Using tensorflow, tensorboard and Keras.

You potentially can change the Keras backend to Theano, but you might have to tweak some stuff here and there then.


  • data: Holds all images input/outputs
  • docker: Holds the dockerfile used to run those experiments (see the docker section for more information)
  • mobile_app: Holds all the mobile files and README to make your tensorflow model work on IOS
  • models: Holds models/layers python python files + models architecture/weights output files
    • ATrouConvolution layer for Keras
    • ConvolutionTranspose2D layer for Keras
    • ScaledSigmoid layer for Keras
  • perf: Holds very simple perf scripts to have an idea on how much you loose when you go from the titan X to an iphone 6s...
  • tests: Holds some tests sherlocks
  • torch: Holds some work done in torch, especially a keras neural net importer
  • utils: Holds utils to do preprocessing/training etc.
  • vgg: Holds vgg19 data and script files
  • Python files:
    • alpha.py: See the impact of alpha parameter
    • layer_reconstruction.py: Reconstruct data with iteration from the VGG
    • layer_influence.py: See the influence of each layer against each layer on VGG
    • ltv.py: See the influence of loss total variation
    • predict.py: Generate new image from a trained model
    • pretrain_model.py: Pretrain model on overfit data (see below)
    • train.py: Train a pretrain model to achieve deep art
    • .sh: files: Some script to do batch work


You should know if you want to use GPU or CPU (you might be an immortal dude, who knows) and if you want everything installed locally or on a container

  • Use pip install -r requirements.txt to have everything locally (use virtualenv, you fool)
  • Use nvidia-docker to run my docker-image (Beware you need cuda 7.5 and cudnn5 locally):
# port 6006 used for tensorboard
nvidia-docker run -it -d -p 6006:6006 -v /my/aboslute/path/to/deepback:/root/deepback --name deepback morgangiraud/dlsak

I want to create deep art

First: Download data! (Be aware, this will download around 20GB of data...)

# From data directory

Second: Download the VGG19 and preprocess it!

# From vgg19 directory

Third: Build some overfit data to pretrain some models! (You can control how many overfit images you want)

# From root directory

Fourth: Pretrain models

If you only want to initialise a model weights, you can do it by setting the --nb_epoch parameter to 0 manually

# From root directory

Fifth: Finally, train your model!

# From root directory
python train.py --model_dir models/path/to/model

sixth: Generate some images!

# From root directory
./predict.sh path/to/my/model


Making it happen in your Iphone:

First: Export your trained model

python export_keras_model.py --model_dir models/path/to/model

Second: Copy the pb file into mobile_app/data directory

Third: See the Mobile Readme


Tensorboard feature is built-in only in the train.py file.

While you are training our model, you can see in real-time your loss evolution and intermediate images result (train/cross val):

docker exec -it my-nvidia-docker-name bash
tensorboard --logdir /path/to/my/parent/models/folder --reload_interval 20
# If you launch tensorbaord in an already populated folder, it might take time before it can 
# squash all the data

Access your board: http://my-dns.com:6006 and get something like that: Tensorboard

Tensorboard doc here

Performance result

We can see a x703 time slower in a tensorflow CPU non-quantized implementation versus the AWS GPU

Infrastructure Time
tensorflow GPU K420 AWS: ~0.0329s (batching) 0.026s (looping)
tensorflow mac os: 4.13s (batching) 3.31s (looping)
tensorflow mac os simulateur: 5.86s (float32) 16.9 (quantized, this is weird)
tensorflow iphone 6s CPU: 18.30s (float32)

Image size: 600x600

batching: all image are processed in parallel

looping: one image at a time

quantized: Using tensorflow quantization system, (Shouldn't be slower, probably needs a cleaner tf graph)

Other stuffs

Using ffmpeg to create unstable deep art videos

Resize: ffmpeg -i test_video.MOV -filter:v "crop=600:600:70:400" out.mp4 video to frames: ffmpeg -i out.mp4 -r 30/1 $filename%03d.jpeg predict frames HERE frames to video: ffmpeg -framerate 30/1 -i %03d.jpeg -c:v libx264 -r 30 -pix_fmt yuv420p out.mp4