Train with this model for Mnist? #88

fdncred · 2018-02-28T14:00:50Z

I'm trying to use your software and mimic these results from this keras model using the MnistDemo program.

In particular, I'm looking at this section:

# Set the CNN model 
# my CNN architechture is In -> [[Conv2D->relu]*2 -> MaxPool2D -> Dropout]*2 -> Flatten -> Dense -> Dropout -> Out

model = Sequential()

model.add(Conv2D(filters = 32, kernel_size = (5,5),padding = 'Same', 
                 activation ='relu', input_shape = (28,28,1)))
model.add(Conv2D(filters = 32, kernel_size = (5,5),padding = 'Same', 
                 activation ='relu'))
model.add(MaxPool2D(pool_size=(2,2)))
model.add(Dropout(0.25))

model.add(Conv2D(filters = 64, kernel_size = (3,3),padding = 'Same', 
                 activation ='relu'))
model.add(Conv2D(filters = 64, kernel_size = (3,3),padding = 'Same', 
                 activation ='relu'))
model.add(MaxPool2D(pool_size=(2,2), strides=(2,2)))
model.add(Dropout(0.25))

model.add(Flatten())
model.add(Dense(256, activation = "relu"))
model.add(Dropout(0.5))
model.add(Dense(10, activation = "softmax"))

This is what I have so far:

this._net = new Net<double>();
this._net.AddLayer(new InputLayer(28, 28, 1));
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2));
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2) { Stride = 2 });
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.5 });
this._net.AddLayer(new FullyConnLayer(10));
this._net.AddLayer(new SoftmaxLayer(10));

This seems close but I'm not exactly sure how to do the last section since I see no Flatten() or Dense() methods, so I'm not sure it will even work at all.

Do you have any ideas how to duplicate this model section with your codebase?

Thanks,
Darren

cbovar · 2018-02-28T16:09:40Z

I believe Dense in Keras is FullyConnLayer + Activation layer (Relu, Sigmoid, etc.) in ConvNetSharp
So you should probably add FullyConnLayer(256) between DropoutLayer and ReluLayer

fdncred · 2018-02-28T16:16:43Z

Is this what you mean?

this._net = new Net<double>();
this._net.AddLayer(new InputLayer(28, 28, 1));
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2));
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2) { Stride = 2 });
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new FullyConnLayer(256));
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.5 });
this._net.AddLayer(new FullyConnLayer(10));
this._net.AddLayer(new SoftmaxLayer(10));

cbovar · 2018-02-28T16:29:18Z

I would put FullyConnLayer(256) before the ReluLayer.

fdncred · 2018-02-28T16:47:26Z

ok, thanks. I wasn't sure which Dropoutlayer and ReluLayer you were referencing. I'll give this a whirl. It'll take a while to train I'm sure.

fdncred · 2018-03-06T21:32:36Z

I'm getting an exception Training when Backward() is called in the Train method. I was wondering if you can help.

In this exception reserveSpace is null.

It's called from this code. So really it's _volumeStorage.DropoutStorage that's null.

public override void DoDropoutGradient(Volume<double> input, Volume<double> outputGradient, Volume<double> inputGradient, double dropProbability)
{
    var inputStorage = _volumeStorage;
    var outputGradientStorage = outputGradient.Storage as VolumeStorage;
    var inputGradientStorage = inputGradient.Storage as VolumeStorage;
.....
        _context.CudnnContext.DropoutBackward(dropoutDesc,
            dOutputDesc, outputGradientStorage.DeviceBuffer,
            dDataDesc, inputGradientStorage.DeviceBuffer,
            _volumeStorage.DropoutStorage);
    }
}

I figure it has to do with the way I stacked these layers but I'm not sure. I'm just trying to emulate the python code above as close as possible.

this._net = new Net<double>();
this._net.AddLayer(new InputLayer(28, 28, 1));
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2));
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2) { Stride = 2 });
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.25 });

this._net.AddLayer(new FullyConnLayer(256));
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new DropoutLayer<double>() { DropProbability = 0.5 });
this._net.AddLayer(new FullyConnLayer(10));
this._net.AddLayer(new SoftmaxLayer(10));

Any ideas?

Side Note: It seems to be working in CPU mode but it will take forever. ;)
Thanks,
Darren

cbovar · 2018-03-07T00:08:27Z

DropoutStorage is shared between forward and backward pass. It is supposed to be allocated in the forward pass.
DropOut was only added recently and may have some bugs. VolumeTests.Dropout() is passing though.

I will reproduce and try to understand that tonight.

fdncred · 2018-03-07T13:39:38Z

Ok, good. Hopefully you can find something because my estimation code says that training this model via CPU will be finished on 6/12/2018 3:45pm. LOL. So, I won't be training this without GPU training.

I forgot to mention that I let this training run all night. It didn't get very far but I let it save out the model when I stopped the training. The model was a 35MB 1 line json file. Wow! Not sure what makes it so big but I can't imagine how big it'll be when I finish training this way.

cbovar · 2018-03-07T14:43:33Z

No more crash after PR #91

cbovar · 2018-03-09T01:10:40Z

The json file contains all parameters and their gradients. Overtime their value will change but there won't be more parameters. So the size of the file should remain close to 35MB in your case.
Gradients are not needed for inference so the file could be half it size.

fdncred · 2018-03-09T13:42:49Z

Are you saying I can remove the FilterGradient and BiasGradient sections of the json entirely, in order to make the file smaller, and the model will still work for me?

NickStrupat · 2018-09-14T02:14:48Z

With this model I am getting this exception with 0.4.11-alpha

Exception has occurred: CLR/System.ArgumentException
An unhandled exception of type 'System.ArgumentException' occurred in ConvNetSharp.Volume.dll: 'Volume should have a Shape [1] to be converter to a System.Double'
   at ConvNetSharp.Volume.Volume`1.op_Implicit(Volume`1 v)
   at ConvNetSharp.Core.Layers.DropoutLayer`1.Backward(Volume`1 outputGradient)
   at ConvNetSharp.Core.Net`1.Backward(Volume`1 y)
   at ConvNetSharp.Core.Training.TrainerBase`1.Backward(Volume`1 y)
   at ConvNetSharp.Core.Training.TrainerBase`1.Train(Volume`1 x, Volume`1 y)
   at MnistDemo.Program.Train(Volume`1 x, Volume`1 y, Int32[] labels) in /Users/nick/Dev/MnistDemo/Program.cs:line 138
   at MnistDemo.Program.MnistDemo() in /Users/nick/Dev/MnistDemo/Program.cs:line 87
   at MnistDemo.Program.Main() in /Users/nick/Dev/MnistDemo/Program.cs:line 28

this._net = new Net<double>();
this._net.AddLayer(new InputLayer(28, 28, 1));
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(5, 5, 32) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2));
this._net.AddLayer(new DropoutLayer<double>(0.25));

this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new ConvLayer(3, 3, 64) { Stride = 1, Pad = 2 });
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new PoolLayer(2, 2) { Stride = 2 });
this._net.AddLayer(new DropoutLayer<double>(0.25));

this._net.AddLayer(new FullyConnLayer(256));
this._net.AddLayer(new ReluLayer());
this._net.AddLayer(new DropoutLayer<double>(0.5));
this._net.AddLayer(new FullyConnLayer(10));
this._net.AddLayer(new SoftmaxLayer(10));

cbovar mentioned this issue Mar 7, 2018

Crash in backward pass of Dropout in GPU #90

Closed

fdncred mentioned this issue Mar 7, 2018

DropoutLayer Missing Method Exception #92

Closed

fdncred closed this as completed Mar 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train with this model for Mnist? #88

Train with this model for Mnist? #88

fdncred commented Feb 28, 2018

cbovar commented Feb 28, 2018

fdncred commented Feb 28, 2018

cbovar commented Feb 28, 2018

fdncred commented Feb 28, 2018

fdncred commented Mar 6, 2018 •

edited

Loading

cbovar commented Mar 7, 2018

fdncred commented Mar 7, 2018 •

edited

Loading

cbovar commented Mar 7, 2018

cbovar commented Mar 9, 2018

fdncred commented Mar 9, 2018

NickStrupat commented Sep 14, 2018 •

edited

Loading

Train with this model for Mnist? #88

Train with this model for Mnist? #88

Comments

fdncred commented Feb 28, 2018

cbovar commented Feb 28, 2018

fdncred commented Feb 28, 2018

cbovar commented Feb 28, 2018

fdncred commented Feb 28, 2018

fdncred commented Mar 6, 2018 • edited Loading

cbovar commented Mar 7, 2018

fdncred commented Mar 7, 2018 • edited Loading

cbovar commented Mar 7, 2018

cbovar commented Mar 9, 2018

fdncred commented Mar 9, 2018

NickStrupat commented Sep 14, 2018 • edited Loading

fdncred commented Mar 6, 2018 •

edited

Loading

fdncred commented Mar 7, 2018 •

edited

Loading

NickStrupat commented Sep 14, 2018 •

edited

Loading