question again on gradient_clipping_threshold #3696

yu239-zz · 2017-08-27T01:37:45Z

In an old Issue #775 , there seemed to be some discussions on how the gradient clipping is triggered. Currently, on a single machine Paddle always uses SgdLocalUpdater. However, gradient clipping is only used in SgdThreadUpdater. Is there any plan to fix this issue? Or can we always use SgdThreadUpdater?

lcy-seso · 2017-08-27T02:17:27Z

I have the same problem. but it hasn't been solved yet ...

lcy-seso · 2017-08-27T03:47:45Z

Previously, I just changed SgdLocalUpdater into SgdThreadUpdater and it worked, but I am not sure whether there may be certain problems.

qingqing01 · 2017-08-27T03:49:10Z

@yu239 The v2 API always use SgdThreadUpdater and the gradient_clipping works well now. If you use binary paddle_trainer to run job, the gradient_clipping doesn't work indeed.

yu239-zz · 2017-08-31T20:37:39Z

@qingqing01 I use the class paddle::Trainer in the C++ code. It seems that SgdLocalUpdater is always used. So I can just change the Paddle source code (TrainerInternal.cpp) to use SgdThreadUpdater? Is there any potential issue by doing so?

yu239-zz · 2017-08-31T20:42:16Z

@qingqing01 I can hack the source code so that SgdThreadUpdater is used in place of SgdLocalUpdater. But this does not seem like a final solution. If I use paddle::Trainer, is there any official way that I can specify SgdThreadUpdater without modifying the Paddle source code?

shanyi15 closed this as completed Aug 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question again on gradient_clipping_threshold #3696

question again on gradient_clipping_threshold #3696

yu239-zz commented Aug 27, 2017

lcy-seso commented Aug 27, 2017

lcy-seso commented Aug 27, 2017

qingqing01 commented Aug 27, 2017 •

edited

Loading

yu239-zz commented Aug 31, 2017

yu239-zz commented Aug 31, 2017

question again on gradient_clipping_threshold #3696

question again on gradient_clipping_threshold #3696

Comments

yu239-zz commented Aug 27, 2017

lcy-seso commented Aug 27, 2017

lcy-seso commented Aug 27, 2017

qingqing01 commented Aug 27, 2017 • edited Loading

yu239-zz commented Aug 31, 2017

yu239-zz commented Aug 31, 2017

qingqing01 commented Aug 27, 2017 •

edited

Loading