Share convolution buffers to reduce memory usage #2016

shelhamer · 2015-03-03T02:26:48Z

Share the columnation buffer for im2col / col2im transformations across all Caffe convolution layers. The memory usage is now equal to the maximum buffer size instead of the sum over all layers. In particular this is useful for many-layered architectures like the VGG ILSVRC14 19 layer model.

Advice and Cautions:

This is worth it for fully convolutional models where Caffe convolution is faster than cuDNN.
No N-D convolution. The buffer shape will be overwritten by other layers.
No parallelism. Only a single net can do forward / backward at a time. As the buffer is shared by all layers within and across nets, no convolution can be done in parallel. (A fix for parallel nets is to make the buffer a member of net. DAG parallelism is still out in that case, but isn't currently parallelized anyway.)
This has no effect on cuDNN convolution, but that consumes less memory anyway.

All credit to @longjon who reshaped our world in #594 and suggested this patch in #520 (comment).

master edition of #1291.

Do not merge.

sguada · 2015-03-03T16:20:46Z

Although this a simple and elegant change, maybe it will be worthy to consider a factory of temporary Blobs which could be shared, re-used across layers, or even across nets.

shelhamer · 2015-03-03T17:55:45Z

For a more mannered take on sharing buffers see #2009.

shelhamer · 2015-03-06T05:38:30Z

As a reminder, the static buffer causes this crash on exit:

F0305 21:14:53.266198 1810 syncedmem.cpp:16] Check failed: error == cudaSuccess (29 vs. 0) driver shutting down

this is a known side effect.

Share convolution buffers to reduce memory usage

cheer37 · 2016-03-04T13:48:18Z

i got the this problem.
CUDA_CHECK(cudaFreeHost(ptr)); in syncedmem.cpp causes crash on exit.
what is the reason of this?
How can i solve this?
Thanks.

naibaf7 · 2016-03-04T21:35:21Z

@shelhamer
In my Caffe branch, I have shared convolution buffers for both CUDA and OpenCL without compromising ND convolution. If you are interested in fixing this soon, I could probably make a PR out of the CUDA part.

shelhamer · 2016-03-04T21:41:58Z

@naibaf7 that sounds promising, but I won't have a chance to review it until after the ECCV deadline 03/14, and we'll have to check it with the NVIDIA/Flickr parallelism too. Feel free to open the PR when ready all the same. Thanks.

@longjon

share the im2col / col2im buffers among convolution + deconvolution layers by making the buffer a static member. @longjon deserves all the credit for the reshaping BVLC#594 and this patch.

shelhamer · 2016-03-04T23:36:09Z

Rebased to master @ a1c81ac for use after the great layer reckoning that split headers.

Share convolution buffers to reduce memory usage * shelhamer/share-col-buffer: share columnation buffers for convolution to save memory

cheer37 · 2016-03-09T17:30:01Z

@shelhamer
how can i solve this side effect?

Share convolution buffers to reduce memory usage

naibaf7 · 2018-06-12T17:36:56Z

This problem was recently solved for fast inference on the OpenCL branch. No layers are broken.

aaron-michaux · 2019-08-16T14:59:24Z

This bug prevents other "atexit" handlers from running. It's sloppy to have an error and say "Hey, don't worry about it, it doesn't have side effects!"

Is there a function that can be set to safely shut down caffe? That should be all that's required, right?

shelhamer mentioned this pull request Mar 3, 2015

Share convolution buffers to reduce memory usage #1291

Closed

shelhamer added a commit to longjon/caffe that referenced this pull request Mar 5, 2015

note BVLC#2016 for reducing memory

81897f3

weiliu89 added a commit to weiliu89/caffe that referenced this pull request Apr 4, 2015

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

6b90554

Share convolution buffers to reduce memory usage

qinhongwei pushed a commit to qinhongwei/caffe that referenced this pull request Apr 14, 2015

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

e9724f3

Share convolution buffers to reduce memory usage

weiliu89 mentioned this pull request Apr 19, 2015

Out-of-memory in syncedmem.cpp (when GPU has enough memory) #2339

Closed

elleryrussell pushed a commit to elleryrussell/caffe that referenced this pull request May 1, 2015

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

12d29cd

Share convolution buffers to reduce memory usage

zlmzju pushed a commit to zlmzju/caffe that referenced this pull request Jun 17, 2015

manually add pull BVLC#2016

af23c9b

naibaf7 mentioned this pull request Jun 26, 2015

OpenCL Backend #2195

Closed

jmerkow mentioned this pull request Jul 16, 2015

Incorporating cuda unified memory into caffe #2775

Closed

jeffdonahue mentioned this pull request Aug 21, 2015

ND convolution with im2col #2049

Merged

bittnt mentioned this pull request Oct 6, 2015

Compatibility with AWS / nvidia Grid GPU torrvision/crfasrnn#4

Closed

ybkuang added a commit to ybkuang/caffe that referenced this pull request Jan 19, 2016

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

877ba64

Share convolution buffers to reduce memory usage

bittnt mentioned this pull request Feb 18, 2016

Classification performance is very slow like half a minute on AWS G2 instance? torrvision/crfasrnn#25

Closed

share columnation buffers for convolution to save memory

839b050

share the im2col / col2im buffers among convolution + deconvolution layers by making the buffer a static member. @longjon deserves all the credit for the reshaping BVLC#594 and this patch.

shelhamer force-pushed the share-col-buffer branch from 1dcfc3c to 839b050 Compare March 4, 2016 23:35

CarrieHui mentioned this pull request Mar 31, 2016

Unable to install caffe-future. longjon/caffe#1

Closed

kashefy added a commit to kashefy/caffe that referenced this pull request Apr 5, 2016

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

7a69329

Share convolution buffers to reduce memory usage

bittnt mentioned this pull request Jul 12, 2016

Memory Requirements on CPU torrvision/crfasrnn#58

Closed

samylee mentioned this pull request Nov 21, 2016

check failed: error == cudaSuccess(29 vs. 0) driver shutting down zeakey/DeepSkeleton#3

Open

bittnt mentioned this pull request Nov 27, 2016

error == cudaSuccess (2 vs. 0) out of memory torrvision/crfasrnn#79

Closed

zhengmzong referenced this pull request in happynear/MTCNN_face_detection_alignment Jan 9, 2017

fix cuda error when terminate

b3e7d2e

weiliu89 mentioned this pull request Jan 17, 2017

ParseNet: "Check failed: error == cudaSuccess (29 vs. 0) driver shutting down" weiliu89/caffe#376

Open

frolenkov-nikita added a commit to frolenkov-nikita/caffe that referenced this pull request Jun 14, 2017

Merged pull request BVLC#2016

aa4262e

jmerkow added a commit to DR08/caffe that referenced this pull request Jun 27, 2017

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

d76465e

Share convolution buffers to reduce memory usage

jmerkow added a commit to DR08/caffe that referenced this pull request Jun 27, 2017

Merge pull request BVLC#2016 from shelhamer/share-col-buffer

308f1b8

Share convolution buffers to reduce memory usage

venkai mentioned this pull request Apr 12, 2018

Reducing memory usage during inference NVIDIA/caffe#498

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Share convolution buffers to reduce memory usage #2016

Share convolution buffers to reduce memory usage #2016

Uh oh!

shelhamer commented Mar 3, 2015 •

edited

Loading

Uh oh!

sguada commented Mar 3, 2015

Uh oh!

shelhamer commented Mar 3, 2015

Uh oh!

shelhamer commented Mar 6, 2015

Uh oh!

cheer37 commented Mar 4, 2016

Uh oh!

naibaf7 commented Mar 4, 2016

Uh oh!

shelhamer commented Mar 4, 2016

Uh oh!

shelhamer commented Mar 4, 2016

Uh oh!

cheer37 commented Mar 9, 2016

Uh oh!

naibaf7 commented Jun 12, 2018

Uh oh!

aaron-michaux commented Aug 16, 2019

Uh oh!

Uh oh!

Share convolution buffers to reduce memory usage #2016

Are you sure you want to change the base?

Share convolution buffers to reduce memory usage #2016

Uh oh!

Conversation

shelhamer commented Mar 3, 2015 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sguada commented Mar 3, 2015

Uh oh!

shelhamer commented Mar 3, 2015

Uh oh!

shelhamer commented Mar 6, 2015

Uh oh!

cheer37 commented Mar 4, 2016

Uh oh!

naibaf7 commented Mar 4, 2016

Uh oh!

shelhamer commented Mar 4, 2016

Uh oh!

shelhamer commented Mar 4, 2016

Uh oh!

cheer37 commented Mar 9, 2016

Uh oh!

naibaf7 commented Jun 12, 2018

Uh oh!

aaron-michaux commented Aug 16, 2019

Uh oh!

Uh oh!

shelhamer commented Mar 3, 2015 •

edited

Loading