About gradient #3

liuhengli · 2017-07-13T07:12:19Z

Excuse me, about gradient I have some not understand, why the gradient shapes same as activation output shape. And the gradient is not weight gradient? it shape is [i , o , 3, 3]?

jacobgil · 2017-07-13T15:10:42Z

The gradient is the gradient of the output with respect to each one of the activation outputs. Therefore the gradient shape is the same as the activation outputs shape.

guoxiaolu · 2017-09-30T08:20:10Z

@jacobgil , this place is difficult to understand. For example, gradient(final_loss, layer_weight) means the gradient of loss wrt layer weight, so the output of gradient keeps the same dimension. According to your comment, the final_loss is the output (that is the x = module(x) in your code)? and the layer_weight is each one of the activation outputs (what is this, can I find the corresponding variable in your code)?
Thank you very much

jacobgil mentioned this issue Jun 24, 2019

in finetune.py 117 Why "-i"?? #32

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About gradient #3

About gradient #3

liuhengli commented Jul 13, 2017

jacobgil commented Jul 13, 2017

guoxiaolu commented Sep 30, 2017

About gradient #3

About gradient #3

Comments

liuhengli commented Jul 13, 2017

jacobgil commented Jul 13, 2017

guoxiaolu commented Sep 30, 2017