About TV loss #8

mountains-high · 2022-07-14T05:10:22Z

Hi~
Thank you for this great work.

My question is about the TV loss. Could you give the reason why you took mean while calculating the TV loss?
The paper about that work did not mention the 'mean'. Thank you

CMI/datafree/criterions.py

Line 48 in 9e79fa9

diff3.abs() / 255.0).mean() + (diff4.abs() / 255.0).mean()

VainF · 2022-07-14T08:05:39Z

Hi @mountains-high, diff1-4 measure the pixel-wise gradient in four directions. So we reduce these gradients to a scalar value for training.

mountains-high · 2022-07-15T04:30:22Z

Good day ~

Thank you for your reply, I got the point about the diff 1-4, however, didn't understand the taking "mean" of them. I found these lines in the paper Data-free Knowledge Distillation for Object Detection

According to equation 4 which they(the same authors) say that used on [44] isn't there to be 1/N if we consider the mean?
What do you think about it?

Thank you

VainF · 2022-07-15T15:59:55Z

Yes as you mentioned, the only difference between mean and sum lies in the scaling factor 1/N. You can adjust the their weight to get the same loss during training. In other word, if you want to use the summed TV loss, you need to lower down the weight of $\mathcal{L}_{TV}$ by N$\times$. However, there is not too much difference from the perspective of training.

mountains-high · 2022-07-18T01:03:30Z

Good day~
Thank you for the answer

mountains-high closed this as completed Jul 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About TV loss #8

About TV loss #8

mountains-high commented Jul 14, 2022

VainF commented Jul 14, 2022

mountains-high commented Jul 15, 2022

VainF commented Jul 15, 2022

mountains-high commented Jul 18, 2022

About TV loss #8

About TV loss #8

Comments

mountains-high commented Jul 14, 2022

VainF commented Jul 14, 2022

mountains-high commented Jul 15, 2022

VainF commented Jul 15, 2022

mountains-high commented Jul 18, 2022