[Memory]More memory optimization policy #8690

QiJune · 2018-03-01T13:41:19Z

After add a more optimized level, the image_classification demo memory reduced from 93024256 to 92807168. There is a little benefit.

There are still many die variables not be reused. Most of these are gradient variable. After sgd optimization, these gradient can be released. Maybe we have to delete them with a DeleteOperator.

I add another release memory policy with DeleteOp, and tested on resnet model:

Model	no optimize	reuse memory	release memory	forward memory
Resnet	170590208	92995584(reduce 45.5%)	78004224(reduce 54.3%)	77488128

Release memory policy has almost reached the upper limit(forward memory). If we want to reduce the memory occupation further, there are two ways:

Look carefully at forward pass, and fuse some small operators, and try to reduce some intermediate result.
Use re-computation policy, throw some results in forward pass and re-compute it in backward pass

wangkuiyi · 2018-03-01T22:28:16Z

python/paddle/fluid/memory_optimization_transpiler.py

@@ -118,7 +118,7 @@ def _find_var(self, block_desc, var_name, is_forward):
        else:
            return block_desc.find_var_recursive(str(var_name))

-    def memory_optimize(self):
+    def memory_optimize(self, level=0):


The code style says that a function name should be a verb-subject phrase, like, optimize_memory, instead of memory_optimize.

Also, it seems that we cannot optimize the memory; what we could is to optimize the usage of the memory.

For this case, does it mean reuse_memory and should we rename level into reuse_tensor_with_the_same_size?

Yes, you are right. It's mainly reuse memory. And level 0 means that we can reuse tensor with the same size. Level 1 means that we can reuse tensor if current tensor size is the same or less than cache pool tensor.

I will refine these codes accordingly. Thanks!

dzhwinter

This PR give an aggressive policy to reuse the memory--do stream synchronize after each operator is launch, so we can delete all the variables if the op is not running.
@QiJune
We will merge this PR since the Image mission deadline is looming. Please give some experiment detail of the effect on speed and complete the issue description. Thanks!

QiJune added 3 commits March 1, 2018 18:11

add memopt level

3eaa820

add opt level for image classification demo

261571f

clean code

5ca9615

QiJune changed the title ~~More level memory optimization level~~ More memory optimization level Mar 1, 2018

wangkuiyi reviewed Mar 1, 2018

View reviewed changes

QiJune added 6 commits March 2, 2018 15:27

add delete op

15b40db

clean code

6b5456e

test machine translation demo

cf12a01

merge baidu/develop

7fa1d98

clean code

c9cbfb3

clean code

60e49e8

dzhwinter added this to Doing in Performance Tuning Mar 6, 2018

dzhwinter changed the title ~~More memory optimization level~~ [Memory]More memory optimization level Mar 6, 2018

QiJune added 11 commits March 6, 2018 15:20

skip fill constant with force cpu

2d0f3b9

clean code

8adba60

Merge remote-tracking branch 'baidu/develop' into fix_memopt_decay

eb7f763

clean code

674d04c

Merge remote-tracking branch 'baidu/develop' into more_level_memopt

c3217a9

Merge branch 'fix_memopt_decay' into more_level_memopt

0ec9860

refine code

066cb87

clean code

abad0a1

merge baidu/develop

f8e6488

fix bug

84829d4

merge baidu/develop

084484c

QiJune changed the title ~~[Memory]More memory optimization level~~ [Memory]More memory optimization policy Mar 8, 2018

jacquesqiao mentioned this pull request Mar 12, 2018

SE-ResNeXt Optimization #8990

Closed

QiJune assigned dzhwinter and reyoung Mar 12, 2018

dzhwinter approved these changes Mar 12, 2018

View reviewed changes

QiJune merged commit f7e9fe5 into PaddlePaddle:develop Mar 12, 2018

Performance Tuning automation moved this from Doing to Done Mar 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Memory]More memory optimization policy #8690

[Memory]More memory optimization policy #8690

QiJune commented Mar 1, 2018 •

edited

Loading

wangkuiyi Mar 1, 2018

QiJune Mar 2, 2018

dzhwinter left a comment

[Memory]More memory optimization policy #8690

[Memory]More memory optimization policy #8690

Conversation

QiJune commented Mar 1, 2018 • edited Loading

wangkuiyi Mar 1, 2018

Choose a reason for hiding this comment

QiJune Mar 2, 2018

Choose a reason for hiding this comment

dzhwinter left a comment

Choose a reason for hiding this comment

QiJune commented Mar 1, 2018 •

edited

Loading