support excluded_layers for amp.decorate #52871

zhangting2020 · 2023-04-13T05:24:20Z

PR types

New features

PR changes

APIs

Describe

support excluded_layers for amp.decorate

英文文档：http://preview-paddle-pr-52871.paddle-docs-preview.paddlepaddle.org.cn/documentation/docs/en/api/paddle/amp/decorate_en.html
中文文档：PaddlePaddle/docs#5792

paddle-bot · 2023-04-13T05:24:25Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Xreki · 2023-04-13T13:19:00Z

python/paddle/amp/auto_cast.py

+        ),
+    ):
+        need_keep_fp32 = True
+    elif (layer._dtype == 'float16') or isinstance(


layer._dtype代表什么，需要加一些注释解释下？这个接口也用于BF16吗，那BF16的LayerNorm参数是FP32还是BF16？

已增加注释。该接口就是要统一fp16和bf16的参数转换过程。
bf16下原始接口为pure_bf16_initialize，会将所有层参数做转换，PR中处理为只有bn保持fp32，其他层都会被转换。

Xreki · 2023-04-13T13:23:45Z

python/paddle/amp/auto_cast.py

+    # initialize parameters of the model
+    for idx in range(len(excluded_layers)):
+        for layer in excluded_layers[idx].sublayers(include_self=True):
+            layer._cast_to_low_precison_amp = False


low_precison和amp重复了？

Xreki · 2023-04-13T13:26:37Z

python/paddle/amp/auto_cast.py

+        ),
+    ):
+        need_keep_fp32 = True
+    elif not layer._cast_to_low_precison_amp:


确认下，LayerNorm是否可以通过某种方式，配置参数使用FP16/BF16类型？实际业务中可能会有这种需求，且pytorch的LayerNorm参数类型默认也是FP16的。

目前实现不改变fp16的默认行为，依然是原来的处理方式，即BN、LayerNorm、InstanceNorm都保持fp32。

暂时没有提供任何方式设置fp16下允许LN的参数使用fp16。

Xreki · 2023-04-13T13:28:44Z

python/paddle/amp/auto_cast.py

-             gradients will be FP32 dtype after the backpropagation. Default is False.
+        master_grad(bool, optional): For level='O2', whether to use float32 weight gradients for calculations such as gradient clipping, weight decay, and weight updates. If master_grad is enabled, the weight
+             gradients will be float32 dtype after the backpropagation. Default is False.
+        excluded_layers(Layer|list of Layer, optional): Specifies the layers not to be decorated. The weights of these layers will always keep float32 when level is O2. Default is None, the weights of the whole model will be casted to float16 or bfloat16.


确认下，这里是设置类型如[nn.LayerNorm]，还是设置实例对象如[norm]，还是两者皆可？

已经支持2种方式

Xreki · 2023-04-13T13:30:26Z

test/amp/test_amp_decorate.py

+            models=model,
+            level='O2',
+            dtype='float16',
+            excluded_layers=[model.conv],


这个测试里面，设置多个实例吧。

Xreki

LGTM

Xreki · 2023-04-18T01:47:34Z

python/paddle/amp/auto_cast.py

                layer,
                (
                    paddle.incubate.nn.FusedFeedForward,
                    paddle.incubate.nn.FusedMultiHeadAttention,
                ),
            ):
-                layer._amp_decorate(dtype='float16')
+                layer._amp_decorate(dtype=dtype)


感觉这个逻辑，后面还可以优化、统一下

Xreki · 2023-04-18T01:48:46Z

python/paddle/amp/auto_cast.py

+        master_grad(bool, optional): For level='O2', whether to use float32 weight gradients for calculations such as gradient clipping, weight decay, and weight updates. If master_grad is enabled, the weight
+             gradients will be float32 dtype after the backpropagation. Default is False.
+        excluded_layers(Layer|list of Layer, optional): Specifies the layers not to be decorated. The weights of these layers will always keep float32 when level is O2. `excluded_layers` can be specified as
+             an Layer instance/type or a list of Layer instances/tpyes. Default is None, the weights of the whole model will be casted to float16 or bfloat16.


instances/tpyes，有个typo

sunzhongkai588

LGTM

lanxianghit

LGTM for new args

zhangting2020 force-pushed the amp_decorate branch from db9d53a to e93bd5d Compare April 13, 2023 05:34

Xreki reviewed Apr 13, 2023

View reviewed changes

support excluded_layers for amp.decorate

d48152b

zhangting2020 force-pushed the amp_decorate branch 3 times, most recently from 257c7b6 to 49d9b84 Compare April 17, 2023 11:48

support subclass

20c616b

zhangting2020 force-pushed the amp_decorate branch from 49d9b84 to 20c616b Compare April 17, 2023 12:32

Xreki previously approved these changes Apr 18, 2023

View reviewed changes

Update auto_cast.py

d4e9e59

zhangting2020 dismissed Xreki’s stale review via d4e9e59 April 18, 2023 02:58

test=docs_preview

e2620bc

sunzhongkai588 approved these changes Apr 18, 2023

View reviewed changes

Xreki approved these changes Apr 18, 2023

View reviewed changes

lanxianghit approved these changes Apr 18, 2023

View reviewed changes

zhangting2020 merged commit 534efcb into PaddlePaddle:develop Apr 18, 2023
23 of 24 checks passed

jjyaoao pushed a commit to jjyaoao/Paddle that referenced this pull request Apr 19, 2023

support excluded_layers for amp.decorate (PaddlePaddle#52871)

e7bbc04

lijialin03 pushed a commit to lijialin03/Paddle that referenced this pull request Apr 25, 2023

support excluded_layers for amp.decorate (PaddlePaddle#52871)

0efceb2

zhangting2020 added a commit to zhangting2020/Paddle that referenced this pull request Apr 28, 2023

support excluded_layers for amp.decorate (PaddlePaddle#52871)

de14e72

Xreki pushed a commit to Xreki/Paddle that referenced this pull request May 2, 2023

support excluded_layers for amp.decorate (PaddlePaddle#52871)

503ed13

Xreki mentioned this pull request May 2, 2023

support excluded_layers for amp.decorate (#52871) #53486

Closed

zhangting2020 added a commit to zhangting2020/Paddle that referenced this pull request May 22, 2023

support excluded_layers for amp.decorate (PaddlePaddle#52871)

38f0232

gongweibao pushed a commit that referenced this pull request May 22, 2023

support excluded_layers for amp.decorate (#52871) (#54022)

343c286

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support excluded_layers for amp.decorate #52871

support excluded_layers for amp.decorate #52871

zhangting2020 commented Apr 13, 2023 •

edited

paddle-bot bot commented Apr 13, 2023

Xreki Apr 13, 2023

zhangting2020 Apr 17, 2023

Xreki Apr 13, 2023

zhangting2020 Apr 17, 2023

Xreki Apr 13, 2023

zhangting2020 Apr 17, 2023

Xreki Apr 13, 2023

zhangting2020 Apr 17, 2023

Xreki Apr 13, 2023

zhangting2020 Apr 17, 2023

Xreki left a comment

Xreki Apr 18, 2023

Xreki Apr 18, 2023

sunzhongkai588 left a comment

lanxianghit left a comment

support excluded_layers for amp.decorate #52871

support excluded_layers for amp.decorate #52871

Conversation

zhangting2020 commented Apr 13, 2023 • edited

PR types

PR changes

Describe

paddle-bot bot commented Apr 13, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Xreki left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sunzhongkai588 left a comment

Choose a reason for hiding this comment

lanxianghit left a comment

Choose a reason for hiding this comment

zhangting2020 commented Apr 13, 2023 •

edited