regularization on biases #40

GuangQin1995 · 2017-04-28T11:23:47Z

if regularizer is not None:
regularizers = sum([tf.nn.l2_loss(variable) for variable in self.variables])
loss += (regularizer * regularizers)
it seems like that you have regularization on biases, as the self.variables included the biases
` variables = []
for w1,w2 in weights:
variables.append(w1)
variables.append(w2)

for b1,b2 in biases:
    variables.append(b1)
    variables.append(b2)`

The text was updated successfully, but these errors were encountered:

jakeret · 2017-04-30T18:19:10Z

Yes you're right. Have you checked if it makes a big difference?

According to deeplearning.stanford.edu/ :Applying weight decay to the bias units usually makes only a small difference to the final network.

But I might be worth investigating

meijie0401 · 2018-08-26T01:20:10Z

I think when batchsize is not 1, we should divide 'regularizers' by 2batch size just like following. What's your idea?
regularizers = sum([tf.nn.l2_loss(variable) for variable in self.variables])//(2batch_size)

jakeret · 2018-09-08T12:05:05Z

Sorry for the very late reply.
The power of the regularizer is a hyperparameter like the batch size we can cover their relation in the param search instead of having a it implicitly in the code, don't we?

jakeret added the help wanted label May 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

regularization on biases #40

regularization on biases #40

GuangQin1995 commented Apr 28, 2017

jakeret commented Apr 30, 2017

meijie0401 commented Aug 26, 2018

jakeret commented Sep 8, 2018

regularization on biases #40

regularization on biases #40

Comments

GuangQin1995 commented Apr 28, 2017

jakeret commented Apr 30, 2017

meijie0401 commented Aug 26, 2018

jakeret commented Sep 8, 2018