GPU efficient Densenets #797

ekagra-ranjan · 2019-03-12T17:08:27Z

In reference to #691, this PR provides the option for memory efficient implement of densenet models.

I tested the models (original implementation and the new implementation with efficient=False as well as with efficient=True) on hymenoptera dataset which gave the follow results

Benchmark results (Batch size = 8, image size = 224x224):


                                                 Time taken         GPU Memory Consumption
Original                                          2m 55s                    1668mb
New (`efficient=False`)                           2m 58s                    1667mb
New (`efficient=True`)                            4m 6s                     1115mb

There was no significant change in the accuracy of the trained models. The implementation does not change the performance in terms of accuracy.

The new implementation with efficient=True seems to comsume ~1.5 times lesser GPU memory at the cost of ~1.4 times increased compute time.

cc: @soumith

update before transform

update 9/03/19

11/03/19

12/03/10 10:34pm

codecov-io · 2019-03-12T17:22:03Z

Codecov Report

Merging #797 into master will decrease coverage by 8.15%.
The diff coverage is 92.3%.

@@            Coverage Diff             @@
##           master     #797      +/-   ##
==========================================
- Coverage   60.03%   51.87%   -8.16%     
==========================================
  Files          64       34      -30     
  Lines        5054     3352    -1702     
  Branches      754      534     -220     
==========================================
- Hits         3034     1739    -1295     
+ Misses       1817     1484     -333     
+ Partials      203      129      -74

Impacted Files	Coverage Δ
torchvision/models/densenet.py	`67.4% <92.3%> (-17.82%)`	⬇️
torchvision/models/vgg.py	`65.65% <0%> (-23.9%)`	⬇️
torchvision/datasets/utils.py	`35.1% <0%> (-13.32%)`	⬇️
torchvision/utils.py	`51.92% <0%> (-9.62%)`	⬇️
torchvision/datasets/folder.py	`68.23% <0%> (-8.02%)`	⬇️
torchvision/datasets/coco.py	`22.41% <0%> (-6.86%)`	⬇️
torchvision/datasets/svhn.py	`30% <0%> (-4.62%)`	⬇️
torchvision/datasets/semeion.py	`29.82% <0%> (-3.51%)`	⬇️
torchvision/datasets/cifar.py	`33.66% <0%> (-3.3%)`	⬇️
... and 53 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2b3a1b6...a8e7da1. Read the comment docs.

ekagra-ranjan · 2019-03-20T17:44:28Z

@soumith Is the PR fine?

ekagra-ranjan · 2019-03-25T15:11:23Z

@soumith Is there anything else that needs to be done?

soumith

efficient as a flag is a bit misleading. Can you rename it to memory_efficient

soumith · 2019-03-26T02:01:27Z

torchvision/models/densenet.py

            self.add_module('denselayer%d' % (i + 1), layer)

+    def forward(self, init_features):
+        features = [init_features]
+        for name, layer in self.named_children():


is the ordering of this correct across all python versions?

Worked on python 3.6. Will check them on 2.7 as well.

@soumith Works on python 2.7.

soumith · 2019-03-26T02:01:56Z

torchvision/models/densenet.py

@@ -16,8 +17,17 @@
 }


+def _bn_function_factory(norm, relu, conv):
+    def bn_function(*inputs):


i forgot why this concatenation is needed for checkpointing, can you remind me?

In a densenet block, the previous outputs are concatenated with current input before passing through a layer. So the checkpoints saves memory by not saving these activations in the computation graph for backward pass. Instead it recomputes these intermediate activations during backward pass which makes them slower.

fmassa

From a quick look, it is non-trivial to me that both represent the same model.

Can you add a test that compares the output of the model using both memory_efficient=False and memory_efficient=True?

ekagra-ranjan · 2019-04-30T15:39:06Z

Okay, I will do it.

soumith · 2019-05-05T18:40:39Z

I re-reviewed it today as well. After the test above ^^ which makes sure the same function is computed, this PR is good to go.

ekagra-ranjan · 2019-05-31T11:07:25Z

I have added the test but there are conflicts. @fmassa Can you please help me resolve it?

fmassa · 2019-06-07T11:15:27Z

@ekagra-ranjan do you want me to resolve the conflicts?

ekagra-ranjan · 2019-06-07T11:42:45Z

Yes @fmassa, that would be very helpful.

fmassa · 2019-06-07T12:14:12Z

I've sent a new PR in #1003

All the history of changes that you have made have been kept.
Thanks a lot for the awesome work @ekagra-ranjan !

ekagra-ranjan added 5 commits February 14, 2019 22:34

Merge pull request #1 from pytorch/master

154cadb

update before transform

Merge pull request #2 from pytorch/master

e964780

update 9/03/19

Merge pull request #3 from pytorch/master

103b25a

11/03/19

Merge pull request #5 from pytorch/master

2925fed

12/03/10 10:34pm

GPU efficient Densenets

8b068b8

removed import math

d3cc33e

soumith requested changes Mar 26, 2019

View reviewed changes

Changed 'efficient' to 'memory_efficient'

639553d

fmassa reviewed Apr 30, 2019

View reviewed changes

Add tests

a8e7da1

fmassa mentioned this pull request Jun 7, 2019

Memory efficient densenet #1003

Merged

fmassa closed this Jun 7, 2019

fmassa mentioned this pull request Mar 20, 2020

does the densenet support gpu memory effecient implement? #691

Closed

GPU efficient Densenets #797

GPU efficient Densenets #797

Uh oh!

Conversation

ekagra-ranjan commented Mar 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-io commented Mar 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ekagra-ranjan commented Mar 20, 2019

Uh oh!

ekagra-ranjan commented Mar 25, 2019

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan Apr 29, 2019

Choose a reason for hiding this comment

Uh oh!

soumith Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan Mar 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan commented Apr 30, 2019

Uh oh!

soumith commented May 5, 2019

Uh oh!

ekagra-ranjan commented May 31, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa commented Jun 7, 2019

Uh oh!

ekagra-ranjan commented Jun 7, 2019

Uh oh!

fmassa commented Jun 7, 2019

Uh oh!

Uh oh!

ekagra-ranjan commented Mar 12, 2019 •

edited

Loading

codecov-io commented Mar 12, 2019 •

edited

Loading

ekagra-ranjan Mar 26, 2019 •

edited

Loading

ekagra-ranjan commented May 31, 2019 •

edited

Loading