How to compute the loss diff in negativemining op #23

dtivger · 2017-02-17T11:38:09Z

@Seanlinx Hi , Seanlinx , I have some questions about your negativemining op . Theoretically , the loss of the CLS can be writen into 1(x) * log(x) * (-1/ohem_keep) , in which the x represents the tuple of cls_label and the softmax op's output ( x=(label , prob)) , the 1(x) represents indicator function ( 1 { . } ) , so the bottom diff is
1(x) * (1/ x) *(-1/ohem_keep) , but you only compute 1(x) * (-1/ohem_keep) . Meanwhile , the loss of the BBOX can be writen into ( x )^2 / valid_num , so the diff is x * 2 /valid_num , but you only compute 1 / valid_num . Can you show me your advice ?

Seanlinx · 2017-02-17T11:51:12Z

@dtivger The missing part is computed in previous layers, softmax and linear_regression.

dtivger · 2017-02-18T14:45:28Z

@Seanlinx I got it , Thanks

geoffzhang · 2018-06-26T03:05:55Z

@Seanlinx @dtivger Please, why is the gradient calculated like "cls_grad /= len(np.where(cls_keep == 1)[0]) bbox_grad /= len(np.where(bbox_keep == 1)[0])" in backward?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to compute the loss diff in negativemining op #23

How to compute the loss diff in negativemining op #23

dtivger commented Feb 17, 2017

Seanlinx commented Feb 17, 2017

dtivger commented Feb 18, 2017

geoffzhang commented Jun 26, 2018

How to compute the loss diff in negativemining op #23

How to compute the loss diff in negativemining op #23

Comments

dtivger commented Feb 17, 2017

Seanlinx commented Feb 17, 2017

dtivger commented Feb 18, 2017

geoffzhang commented Jun 26, 2018