Improving Performance To Allow Larger Images #22

ProGamerGov · 2017-04-08T00:50:31Z

As you probably already know, deep-photo-styletransfer is based on the Neural-Style code, and thus shares the same strengths and weaknesses. The current deep-photo-styletransfer code in both neuralstyle_seg.lua and deepmatting_seg.lua is based on an older and less memory efficient version Neural-Style.

I am posting this propsal here, because luanfujun is unlikely to change his code. This is because changing the code would make it different than what was used to create the images in the research paper. Though the changes I am purposing are not very drastic, and will allow for larger images to be created.

On Dec 2016, this commit changed the structure of the style loss and content loss functions, into a more efficient structure: jcjohnson/neural-style@ea75cbc

Specifically as outlined in the commit:

Lots of changes to enable much bigger images:

Use modal content and style loss modules similar to fast-neural-style
for cleaner logic around network setup.

More memory-efficient Gram matrix implementation similar to
fast-neural-style.

Multi-gpu support! Use nn.GPU decorators to compute different layers
of the loss network on different GPUs.

Now, quantifying these changes, the newer more efficient code uses up to 2GB less than the old code, while performing the same task. Though it will likely use less memory, as you increase the image size higher than the maximum size of 1536 that used in my tests.

I graphed the results of my experiments below, with blue as the old code, and red as the new code:

The GPU usage on the graph is measured in MiB.
An extra 2-4 MiB are used by the system depending on the chosen -image_size value for other things, but I felt this amount was too small to factor into the graph.
You can find the two versions (old and new) of Neural-Style, here: https://gist.github.com/ProGamerGov/34cb206a1f0fa8d7e7a1d7aed0048554
The experiments were performed with a Tesla K80 GPU.
The -image_size values used, were 256, 512, 1000, 1280, and 1536.
The old code using -image_size 1536 failed due to lack of memory, so it is likely far higher than 12GB in terms of GPU usage.
The content image dimensions were: 1897x2441
The style image dimensions were: 3000x1688

The commands I used used for the experiment:

th neural_style_version.lua -image_size value -init image -output_image out.png -content_image in/tar1.png -style_image style/tar1.png -save_iter 50 -print_iter 50 -seed 876 -backend cudnn -cudnn_autotune

If we get the deep-photo-styletransfer code updated, into the new format, we should be able to create larger images with the same hardware.

Currently, I have gotten everything but the segmentation/masks working with the newer code, here. I require help with getting the multi-color segmentation working.

The text was updated successfully, but these errors were encountered:

martinbenson · 2017-04-08T07:17:09Z

I'm definitely on board for this - in fact I'd noticed that there seemed to be a bunch of new stuff in Johnson's code and so had intended to spend some time today looking at merging his latest version. Presumably that's what you started in the gist above?

ProGamerGov · 2017-04-08T17:24:34Z

@martinbenson Yes, the gist has the laplacian part setup with the newer code, but it's made so that the laplacian code is only used if you supply a laplacian.

subzerofun · 2017-04-08T17:40:14Z

@ProGamerGov @martinbenson Thank you so much for your effort to extend the original code! I wish i could help, but unfortunately i'm just taking beginner courses in ML and don't have much experience with Torch or the Lua language.

And thanks for the info – i didn't know that the neural style code has been improved to support larger resolutions, will have to look at the project again! So with the updated code i should achieve 1000 – 1280px images with a 6GB card. And when i get my 1080 Ti images with up to 1536px – that would be nice.

BTW: I know you are probably Linux users, but i just read yesterday that with the launch of the new Titan XP Nvidia will also release Mac drivers for the whole 10XX series. So hopefully more Mac users in the ML community can benefit from the faster cards with more VRAM!

ProGamerGov · 2017-04-08T20:57:36Z

@subzerofun With the updated code, in addition to increased memory efficiency, you could also use both your current 6GB card, and your 11GB 1080 Ti at the same time. To use them at the same time, you would use the -multigpu_strategy parameter to direct the workload so that each card is used to its fullest potential. This means you could go larger than 1536, though I don't know by how much.

subzerofun · 2017-04-08T21:13:04Z

@ProGamerGov Oh thanks, i didn't even think about that. But i would need to change my power supply for a second card (only 500W atm). Fortunately i have a 1000W power supply lying around :-).

I still have my old GTX 770 with 2GB VRAM – do you think it would help to combine it with my GTX 780 (6GB)?
Or would the 2 GB have a negative effect, because it could stall (or halt) the processing where more VRAM is needed?

So if i want to test the multi-gpu function, i could try the file neural_style_post-dec.lua from your gist?
https://gist.github.com/ProGamerGov/34cb206a1f0fa8d7e7a1d7aed0048554#file-neural_style_post-dec-lua

The only things missing from rewriting the original code now are the segmentation functions – or is there something else?

ProGamerGov · 2017-04-08T22:51:40Z

@subzerofun neural_style_post-dec.lua is the most recent version of neural_style.lua from: https://github.com/jcjohnson/neural-style/blob/master/neural_style.lua

I still have my old GTX 770 with 2GB VRAM – do you think it would help to combine it with my GTX 780 (6GB)?
Or would the 2 GB have a negative effect, because it could stall (or halt) the processing where more VRAM is needed

I haven't seen anyone experiment with that combination, and I don't have multiple GPUs, so I can't say whether or not that would help. So you'll probably have to experiment with that yourself.

The only things missing from rewriting the original code now are the segmentation functions – or is there something else?

Yes, as far as I know, only the segmentation related code is missing.

I have been trying to add segmentation into the current version of Neural-Style, for a long time from NeuralImageSynthesis, but I still haven't been able to get things working (Specifically the style and content loss function related code). Then deep-photo-styletransfer came along, with masks that supported multiple colors, but the code was made in an older less efficient version of Neural-Style.

martinbenson · 2017-04-09T19:27:22Z

As a first step to this, I've moved to the newer version of Justin's neural style code.

I've not tried to sort out masking yet though - so as of now that functionality is removed/broken.

martinbenson · 2017-04-17T15:37:06Z

Now done!

ProGamerGov · 2017-04-21T20:08:07Z

@martinbenson I found another way to farther optimize the code: #32.

It seems that the way the mask regions are setup, they waste a lot of precious GPU resources.

ProGamerGov mentioned this issue Apr 8, 2017

Did someone figure out how to get rid of the "out of memory" issue while running Torch? luanfujun/deep-photo-styletransfer#42

Open

ProGamerGov mentioned this issue Apr 9, 2017

Have you try 'Spatial Control' ProGamerGov/Neural-Tools#1

Open

martinbenson closed this as completed Apr 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving Performance To Allow Larger Images #22

Improving Performance To Allow Larger Images #22

ProGamerGov commented Apr 8, 2017 •

edited

Loading

martinbenson commented Apr 8, 2017

ProGamerGov commented Apr 8, 2017

subzerofun commented Apr 8, 2017

ProGamerGov commented Apr 8, 2017

subzerofun commented Apr 8, 2017 •

edited

Loading

ProGamerGov commented Apr 8, 2017 •

edited

Loading

martinbenson commented Apr 9, 2017

martinbenson commented Apr 17, 2017

ProGamerGov commented Apr 21, 2017

Improving Performance To Allow Larger Images #22

Improving Performance To Allow Larger Images #22

Comments

ProGamerGov commented Apr 8, 2017 • edited Loading

martinbenson commented Apr 8, 2017

ProGamerGov commented Apr 8, 2017

subzerofun commented Apr 8, 2017

ProGamerGov commented Apr 8, 2017

subzerofun commented Apr 8, 2017 • edited Loading

ProGamerGov commented Apr 8, 2017 • edited Loading

martinbenson commented Apr 9, 2017

martinbenson commented Apr 17, 2017

ProGamerGov commented Apr 21, 2017

ProGamerGov commented Apr 8, 2017 •

edited

Loading

subzerofun commented Apr 8, 2017 •

edited

Loading

ProGamerGov commented Apr 8, 2017 •

edited

Loading