Scaling dropout layer by keep probability during test time #106

Chris-Nicholls · 2019-01-26T11:45:19Z

From the dropout paper http://www.cs.toronto.edu/~rsalakhu/papers/srivastava14a.pdf :

If a unit is retained with probability p during training, the outgoing weights of that unit are multiplied by p at test time

The test activations should be scaled by (1-drop_prob), not drop_prob.
For example, if drop prob is 0, this layer should have no effect and we should scale activations by 1.

ratajczak · 2019-01-29T09:48:37Z

Hi Chris, it looks like duplicate of #61

Chris-Nicholls · 2019-01-29T09:58:03Z

Yup, you're right. Looks like this isn't being maintained anyway.

Scaling dropout layer by keep probability during test time

dcba8da

Chris-Nicholls closed this Jan 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling dropout layer by keep probability during test time #106

Scaling dropout layer by keep probability during test time #106

Chris-Nicholls commented Jan 26, 2019

ratajczak commented Jan 29, 2019

Chris-Nicholls commented Jan 29, 2019

Scaling dropout layer by keep probability during test time #106

Scaling dropout layer by keep probability during test time #106

Conversation

Chris-Nicholls commented Jan 26, 2019

ratajczak commented Jan 29, 2019

Chris-Nicholls commented Jan 29, 2019