Not prune the bias #2

b03902128 · 2018-04-05T05:43:02Z

Hi Shuan,

Thanks for the great implementation.
I wonder what do you mean by 'didn't prune the bias term'.
Do you mean that you only use Wx (instead of Wx+b) to get the predictions and calculate the gradients?

For the pruned models of interests, should I use:

both new weights and (original) bias (which does not make sense).
only new weights (which may cause negative effects on the accuracy of original models because bia terms are omitted).

Thanks!

HolmesShuan · 2018-04-05T14:48:54Z

To be clear:

Wx refers to without bias term rather than didn't prune the bias term.
We update the bias term during the fine-tuning process, thus the original bias should not be used.
DNS (Dynamic Network Surgery) set some small biases to zero, i.e. prune bias term. Strictly speaking, it is possible that all bias term are set to zero, in another word, Wx+0, which is equivalent to Wx. But we didn't prune bias, even if the value of some bias terms are very close to zero.

b03902128 closed this as completed Apr 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not prune the bias #2

Not prune the bias #2

b03902128 commented Apr 5, 2018

HolmesShuan commented Apr 5, 2018

Not prune the bias #2

Not prune the bias #2

Comments

b03902128 commented Apr 5, 2018

HolmesShuan commented Apr 5, 2018