Add Dropout/DropConnect #413

zoq · 2015-03-01T16:02:01Z

Dropout is a recently introduced algorithm to prevent co-adaptation during training (overfitting). The key idea is to randomly drop units, along with their connections from a neural network during training. Routhly each element of a layer's output is kept with probability p, otherwise it's being set to 0.

For more information see:

Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors", 2012
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov, "Dropout: A Simple Way to Prevent Neural Networks from Overfitting", 2014

A simple way to implement the technique is to introduce a new function which creates a dropOutMask. Afterwards, we can multiply the dropOutMask with the inputActivation in all layers which should support dropout. Something like:

// dropout
if dropoutFraction > 0
{
    ActivationFunction::fn(inputActivation * dropOutMask, outputActivation);
}
else
{
    ActivationFunction::fn(inputActivation, outputActivation);
}

DropConnect is a generalization of Dropout that takes the idea a step further. Rather than zeroing each unit activations with probability p, it zeroes the weights/connections with probability p.

For more information see:

Li Wan, Matthew Zeiler, Sixin Zhang, Yann Le Cun, Rob Fergus, "Regularization of Neural Networks using DropConnect", 2013

The idea to implement the technique is similar, except that we need to introduce the feature to all connections which should support dropConnect. The modified code should look something like:

// dropConnect
if dropConnectFraction > 0
{
    outputLayer.InputActivation() += (weights * dropOutMask * input);
}
else
{
    outputLayer.InputActivation() += (weights * input);
}

The text was updated successfully, but these errors were encountered:

stephentu · 2015-03-02T00:06:24Z

Did the neural net people really have to assign a new phrase to the idea of overfitting? 😄

zoq · 2015-06-24T17:58:05Z

In 7de290f I wrote the Dropout layer, so the ticket should focus on the Dropconnect implementation.

palashahuja · 2016-03-01T02:02:48Z

@zoq, Hello, I was wondering if this task is available to solve. If so, I'm willing to do so.

zoq · 2016-03-01T10:29:41Z

I've added the Dropout layer in 7de290f but I didn't have a chance to implement Dropconnect. If you like you can implement Dropconnect.

Fixes mlpack#413

theaverageguy · 2016-03-04T17:51:37Z

I got the difference @zoq between the two. One has probability multiplications in weights(Connect) and the other has it in activation functions(Out). Can you guide me where to make changes though? According to me the input activation should be modified and then assigned to output activation. Please guide me a bit coz I want to fix this now.

palashahuja · 2016-03-04T19:09:39Z

@theaverageguy, I am working on this right now. I will be sending a PR very soon regarding this bug ..

zoq · 2016-03-04T20:34:34Z

@theaverageguy you are right, I really like the images from the authors: http://cs.nyu.edu/~wanli/dropc/.

So, the implementation of the DropConnectLayer isn't that different as of the DropoutLayer. So you can use the DropoutLayer as a basis. So imagine you like to create a simple feedforward network something like that:

LinearLayer<> inputLayer(10, 2);
BiasLayer<> inputBiasLayer(2);
ReLULayer<> inputBaseLayer;

LinearLayer<> hiddenLayer(2, 10);
ReLULayer<> outputLayer;

Now we would like to use DropConnect between the input and the first hidden layer, so we what we need to do here is to set randomly weights from the inputLayer to 0. So let us modify our feedforward network so that it uses this new DropConnectLayer:

LinearLayer<> inputLayer(10, 2);
DropConnectLayer<> (inputLayer, 0.5, true);

BiasLayer<> inputBiasLayer(2);
ReLULayer<> inputBaseLayer;

LinearLayer<> hiddenLayer(2, 10);
ReLULayer<> outputLayer;

As you can see the Constructor of the DropConnectLayer is similar to the DropoutLayer but takes an additional parameter:

DropConnectLayer(layer, ratio, rescale)

In this case, we use layer (LinearLayer) inside the DropConnect Layer for the weight modification. So the Forward(...) function should look like

void Forward(input, output)
{
    layer.Weights() % mask;
    layer.Forward(input, output);    
}

We modify the weights of the layer and then use the Forward() function of the layer. The Backward() function is similar.

I hope this is helpful.

zoq · 2016-03-04T20:44:22Z

Not sure I get your point, but what if I would like DropConnect for the ConvLayer?

palashahuja · 2016-03-04T20:44:52Z

Ok, I got it .. thanks ..

In drop_connect_layer.hpp, we randomly drop weights instead of units.This layer is based on linear_layer.hpp, with the exception that the weights matrix is multiplied by the mask.

palashahuja · 2016-03-06T17:40:04Z

@zoq, Please have a look at this commit. I have already implemented the dropconnect, but I wanted to know what should I do for layerTraits? It isn't clear, for this case ..

zoq · 2016-03-06T22:30:48Z

template<
    typename InputDataType,
    typename OutputDataType
>
class LayerTraits<DropConnectLayer<InputDataType, OutputDataType> >
{
 public:
  static const bool IsBinary = false;
  static const bool IsOutputLayer = false;
  static const bool IsBiasLayer = false;
  static const bool IsLSTMLayer = false;
  static const bool IsConnection = true;
};

Looks, good, the DropConnectLayer is connection, since it connects two layer. Btw. I really like your commit message.

chvsp · 2016-03-13T08:18:16Z

@zoq is this issue fixed or is there something I can work on. I am willing to contribute for the same.

zoq · 2016-03-13T14:39:26Z

@chvsp Sorry, @palashahuja is working on the issue. The problem is, I can't assign anyone who isn't already part of mlpack.

chvsp · 2016-03-13T15:19:50Z

@zoq I am a GSOC 2016 aspirant and as the PR hasn't been merged for long I thought there would be something which needs work. I don't understand what you meant by "The problem is, I can't assign anyone who isn't already part of mlpack." . Like is there something I should do to be eligible of fixing issues?

rcurtin · 2016-03-13T15:42:52Z

@chvsp: it seems to be a shortcoming of Github; see https://help.github.com/articles/assigning-issues-and-pull-requests-to-other-github-users/ :

You can only create assignments for yourself, collaborators on personal projects, or members of your organization with read permissions on the repository.

palashahuja · 2016-03-23T19:26:50Z

@rcurtin, @zoq you could go ahead and close this issue

zoq · 2016-03-23T19:35:04Z

Merged DropConnect implementation in 63a7f62; take a look at #576.

zoq added P: minor labels Mar 1, 2015

theaverageguy added a commit to theaverageguy/mlpack that referenced this issue Mar 4, 2016

Create drop_connect.hpp

c295933

Fixes mlpack#413

zoq closed this as completed Mar 23, 2016

rcurtin added t: feature request and removed T: task labels Jan 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Dropout/DropConnect #413

Add Dropout/DropConnect #413

zoq commented Mar 1, 2015

stephentu commented Mar 2, 2015

zoq commented Jun 24, 2015

palashahuja commented Mar 1, 2016

zoq commented Mar 1, 2016

theaverageguy commented Mar 4, 2016

palashahuja commented Mar 4, 2016

zoq commented Mar 4, 2016

zoq commented Mar 4, 2016

palashahuja commented Mar 4, 2016

palashahuja commented Mar 6, 2016

zoq commented Mar 6, 2016

chvsp commented Mar 13, 2016

zoq commented Mar 13, 2016

chvsp commented Mar 13, 2016

rcurtin commented Mar 13, 2016

palashahuja commented Mar 23, 2016

zoq commented Mar 23, 2016

Add Dropout/DropConnect #413

Add Dropout/DropConnect #413

Comments

zoq commented Mar 1, 2015

stephentu commented Mar 2, 2015

zoq commented Jun 24, 2015

palashahuja commented Mar 1, 2016

zoq commented Mar 1, 2016

theaverageguy commented Mar 4, 2016

palashahuja commented Mar 4, 2016

zoq commented Mar 4, 2016

zoq commented Mar 4, 2016

palashahuja commented Mar 4, 2016

palashahuja commented Mar 6, 2016

zoq commented Mar 6, 2016

chvsp commented Mar 13, 2016

zoq commented Mar 13, 2016

chvsp commented Mar 13, 2016

rcurtin commented Mar 13, 2016

palashahuja commented Mar 23, 2016

zoq commented Mar 23, 2016