update optimizer for 2.0 #26288

MRXLT · 2020-08-14T10:18:27Z

PR types

New features

PR changes

OPs

Describe

完善Adam、Adamax、Optimizer、RMSProp op
新增AdamW op

Optimizer类
参数parameter_list 变为 parameters
参数regularization 变为weight_decay，传入float类型时为L2Decay的系数
set_dict接口变为set_state_dict
动态图下新增step接口，替代minimize
current_step_lr接口变为get_lr
clear_gradicents变为clear_grad，原接口仍存在，作为clear_grad的alias接口

AdamOptimzer变为Adam、AdamaxOptimizer变为Adamax、RMSPropOptimizer变为RMSProp，其余改动与基类Optimizer相同。

新增AdamW类
继承自DecoupledWeightDecay、Adam

中文文档链接：PaddlePaddle/docs#2424

paddle-bot-old · 2020-08-14T10:18:33Z

Hi, It's a test PR, it will not trigger CI. If you want to trigger CI, please remove notest in your commit message.

python/paddle/optimizer/optimizer.py

phlrain · 2020-08-19T03:47:23Z

python/paddle/optimizer/adam.py

+                 beta1=0.9,
+                 beta2=0.999,
+                 epsilon=1e-8,
+                 parameters=None,


parameters 的位置能上前移动么，毕竟动态图强依赖这个参数

为了与其他优化器保持一致，暂时先不移动这个参数

phlrain · 2020-08-19T04:07:24Z

python/paddle/optimizer/optimizer.py

+                    outputs={"ParamOut": param_and_grad[0]})
+        return new_param_grads, (table_param, table_grad), sgd_op
+
+    def _append_dgc_ops(self, param_and_grad):


为啥要有这个api

在DGCMomentum优化器中会重写并用到，这里主要是为了防止backward中报错

python/paddle/optimizer/optimizer.py

XiaoguangHu01

反馈几个小问题，可以先合入，然后再修改。

python/paddle/optimizer/optimizer.py

python/paddle/fluid/tests/unittests/test_fleet_graph_execution_meta_optimizer.py

TCChenlong

LGTM

TCChenlong · 2020-08-22T01:10:53Z

python/paddle/optimizer/adam.py

+    Related paper: `Adam: A Method for Stochastic Optimization <https://arxiv.org/abs/1412.6980>`_
+
+    Args:
+        learning_rate (float|LearningRateDecay, optional): The learning rate used to update ``Parameter``.


learning_rate的类型英文是float|LearningRateDecay，中文是float|Variable，保持一致哈，另外Variable->Tensor

文档以英文为准，中文文档后续会更新

TCChenlong · 2020-08-22T01:24:57Z

python/paddle/optimizer/adam.py

+            The default value is 0.999.
+        epsilon (float, optional): A small float value for numerical stability.
+            The default value is 1e-08.
+	parameters (list, optional): List of ``Tensor`` names to update to minimize ``loss``. \


parameters的参数顺序中英文保持一致哈

TCChenlong · 2020-08-22T02:09:58Z

python/paddle/optimizer/optimizer.py

+            indicate program pruning. If so, the program will be pruned by ``feed`` and 
+            ``fetch_list`` before run, see details in ``Executor``.
+
+        Examples:


2.0的API实现哈

TCChenlong · 2020-08-22T02:33:28Z

python/paddle/optimizer/adamax.py

+    it is added here for numerical stability to prevent the division by 0 error.
+
+    Args:
+        learning_rate (float|LearningRateDecay, optional): The learning rate used to update ``Parameter``.


float|LearningRateDecay 还是 float|Tensor？

float|LearningRateDecay ，中文文档后续更新

TCChenlong

LGTM

XiaoguangHu01

LGTM

raindrops2sea

LGTM

jzhang533 · 2020-08-28T03:52:39Z

参数parameter_list 变为 parameters
SGD, Momentum的没改吧？

add doc; notest

e45dcff

MRXLT added 8 commits August 14, 2020 18:28

fix doc; notest

85b3f92

update doc; notest

cbcd950

refine optimizer && adam

9661a54

fix conflict

f542d77

refine optimizer; notest

73baac0

add adam

5a55869

fix doc

fd34fbd

Merge remote-tracking branch 'upstream/develop' into 2.0-op

f5e6881

wawltor reviewed Aug 19, 2020

View reviewed changes

python/paddle/optimizer/optimizer.py Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Outdated Show resolved Hide resolved

wawltor reviewed Aug 19, 2020

View reviewed changes

python/paddle/optimizer/optimizer.py Outdated Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Outdated Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Outdated Show resolved Hide resolved

wawltor reviewed Aug 19, 2020

View reviewed changes

python/paddle/optimizer/optimizer.py Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Show resolved Hide resolved

python/paddle/optimizer/optimizer.py Outdated Show resolved Hide resolved

Merge remote-tracking branch 'upstream/develop' into 2.0-op

a715c46

phlrain reviewed Aug 19, 2020

View reviewed changes

MRXLT and others added 15 commits August 19, 2020 14:59

fix doc && add adamw; notest

e67cd86

add error message

da4025d

bug fix

f3699cb

refine rmsprop && adamax

6f00384

fix ci

654377d

buf fix

fa7ccb1

update comment

9aaf899

unify arguments place; notest

b727dad

fix ut, test=develop

9cf4c3b

bug fix

2e8d253

fix conflicts, test=develop

00c38fc

add examples code

b75ab16

Merge remote-tracking branch 'origin/2.0-op' into 2.0-op

84205ce

bug fix

b6fa771

fix comments

9cd1838

MRXLT added 2 commits August 20, 2020 19:54

fix sample code

95310f5

add sample code for Optimizer

ce31795

MRXLT mentioned this pull request Aug 20, 2020

update optimizer doc for 2.0 PaddlePaddle/docs#2424

Merged

mapingshuo and others added 5 commits August 21, 2020 10:32

add adamax ut, test=develop

0780b9c

fix rmsprop ut, test=develop

87a7f56

add ut for optimizer.py and adamw.py

06f3c73

Merge branch '2.0-op' of https://github.com/MRXLT/Paddle into 2.0-op

fd67080

remove TestAdamOptimizerBetaVariable

b00b85f

XiaoguangHu01 previously approved these changes Aug 21, 2020

View reviewed changes

TCChenlong previously approved these changes Aug 21, 2020

View reviewed changes

update api && add ut

6cc0fc2

MRXLT dismissed stale reviews from TCChenlong and XiaoguangHu01 via 6cc0fc2 August 21, 2020 09:42

update doc && fix ut

5d42420

raindrops2sea previously approved these changes Aug 21, 2020

View reviewed changes

TCChenlong reviewed Aug 22, 2020

View reviewed changes

add ut

9094782

MRXLT dismissed raindrops2sea’s stale review via 9094782 August 23, 2020 05:35

TCChenlong approved these changes Aug 23, 2020

View reviewed changes

XiaoguangHu01 approved these changes Aug 23, 2020

View reviewed changes

raindrops2sea reviewed Aug 23, 2020

View reviewed changes

raindrops2sea approved these changes Aug 23, 2020

View reviewed changes

MRXLT merged commit eeda90d into PaddlePaddle:develop Aug 23, 2020

qingqing01 mentioned this pull request Aug 24, 2020

Unify the metrics implementation between low-level and high-level API. #26158

Merged

MRXLT changed the title ~~[WIP] update optimizer for 2.0~~ update optimizer for 2.0 Aug 24, 2020

MRXLT deleted the 2.0-op branch August 24, 2020 06:13

This was referenced Aug 24, 2020

[2.0API] Reconstruct all API related to LR Scheduler, unify dygraph and static #26550

Merged

Add interface to launch parallel dygraph by multiprocessing #26044

Merged

TCChenlong mentioned this pull request Mar 8, 2022

【PaddlePaddle Hackathon 2】49、在 Paddle 中实现1-bit Adam 优化器 #40283

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update optimizer for 2.0 #26288

update optimizer for 2.0 #26288

MRXLT commented Aug 14, 2020 •

edited

Loading

paddle-bot-old bot commented Aug 14, 2020

phlrain Aug 19, 2020

MRXLT Aug 21, 2020

phlrain Aug 19, 2020 •

edited

Loading

MRXLT Aug 19, 2020

XiaoguangHu01 left a comment

TCChenlong left a comment

TCChenlong Aug 22, 2020

MRXLT Aug 22, 2020

TCChenlong Aug 22, 2020

TCChenlong Aug 22, 2020

TCChenlong Aug 22, 2020

MRXLT Aug 22, 2020

TCChenlong left a comment

XiaoguangHu01 left a comment

raindrops2sea left a comment

jzhang533 commented Aug 28, 2020

update optimizer for 2.0 #26288

update optimizer for 2.0 #26288

Conversation

MRXLT commented Aug 14, 2020 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Aug 14, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

phlrain Aug 19, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

TCChenlong left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TCChenlong left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

raindrops2sea left a comment

Choose a reason for hiding this comment

jzhang533 commented Aug 28, 2020

MRXLT commented Aug 14, 2020 •

edited

Loading

phlrain Aug 19, 2020 •

edited

Loading