Benchmark: 2 conv avg pool + 1 fc #47

rfratila · 2017-08-31T22:45:50Z

No preprocessing. See source code for exact network config.

Fashion-MNIST test accuracy: 97.39 %
Digit-MNIST test accuracy: 99.13 %

Source code: https://github.com/rfratila/Vulcan/blob/master/train_mnist_conv.py

Built with Lasagne and Theano

kashif · 2017-09-02T08:28:08Z

thanks @rfratila I will confirm this and get back to you.

kashif · 2017-09-03T12:51:40Z

Is the networks something like this:

model = Sequential()
model.add(Conv2D(16, kernel_size=(5, 5),
                 activation='relu',
                 input_shape=input_shape))
model.add(Conv2D(32, (5, 5), activation='relu'))
model.add(AveragePooling2D(pool_size=(2, 2)))
model.add(Flatten())
model.add(Dense(512, activation='relu'))
model.add(Dropout(0.3))
model.add(Dense(num_classes, activation='softmax'))

because over 200 epochs I can only manage Test accuracy: 0.9156 on fashion-mnist

rfratila · 2017-09-03T16:29:35Z

There should be another average pool in between the conv2d layers. I think Keras defaults to no padding which I actually use for both the conv2d layers. As a result, I'm not sure if the Keras AveragePooling layer will include the extra padding (I exclude it) in its calculation. Also, I trained with cuDNN and ran the tests on my computer which only has CPUs. I'm not sure if there is a discrepancy in performance between GPU and CPU for Theano.

kashif · 2017-09-03T17:05:03Z

model = Sequential()
model.add(Conv2D(16, kernel_size=(5, 5),
                 activation='relu',
                 input_shape=input_shape))
model.add(AveragePooling2D(pool_size=(2,2)))
model.add(Conv2D(32, (5, 5), activation='relu'))
model.add(AveragePooling2D(pool_size=(2, 2)))
model.add(Flatten())
model.add(Dense(512, activation='relu'))
model.add(Dropout(0.3))
model.add(Dense(num_classes, activation='softmax'))

with this model i get for 200 epochs: Test accuracy: 0.9144 on fashion-mnist.

I think your 97.39% test accuracy is a bit fishy.

rfratila · 2017-09-03T23:36:46Z

Interesting. I have the trained model saved and when I run tests on it on the CPU, I get ~97% and when I run the exact same thing on the GPU I get ~92% (In both cases with the test set). Any ideas as to why this may be?

kashif · 2017-09-04T08:04:30Z

Hard to say why... I would try to get rid of 1 layer at a time and compare the CPU and GPU versions to see if a particular layer is responsible... Start with the Dropout layer... and then perhaps the Avg. layers etc.

After that I would try to perhaps use a simpler SGD optimiser to see if the results between CPU and GPU become similar...

Also have a look at your .theanorc file, to see if Theano is not defaulting to say float64 or something on the CPU... good luck!

hanxiao · 2017-09-04T08:05:43Z

close as it is not a valid benchmark

hanxiao closed this as completed Sep 4, 2017

hanxiao mentioned this issue Jul 17, 2018

BenchMark: CNN with 2 Conv Layers. Accuracy on FashionMNIST Dataset: 99.2% and on MNIST dataset: 99.1% #119

Closed

hanxiao mentioned this issue Sep 14, 2018

Tensorflow's doc - 2 Conv+pooling benchmark update #129

Closed

hanxiao mentioned this issue Nov 7, 2018

About the benchmark list credibility #134

Closed

hanxiao mentioned this issue Jan 1, 2019

Benchmark (FullConnection + IncrementalLearning) with Test Acc. 98.77%! #136

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark: 2 conv avg pool + 1 fc #47

Benchmark: 2 conv avg pool + 1 fc #47

rfratila commented Aug 31, 2017 •

edited

kashif commented Sep 2, 2017

kashif commented Sep 3, 2017

rfratila commented Sep 3, 2017

kashif commented Sep 3, 2017 •

edited

rfratila commented Sep 3, 2017

kashif commented Sep 4, 2017

hanxiao commented Sep 4, 2017

Benchmark: 2 conv avg pool + 1 fc #47

Benchmark: 2 conv avg pool + 1 fc #47

Comments

rfratila commented Aug 31, 2017 • edited

kashif commented Sep 2, 2017

kashif commented Sep 3, 2017

rfratila commented Sep 3, 2017

kashif commented Sep 3, 2017 • edited

rfratila commented Sep 3, 2017

kashif commented Sep 4, 2017

hanxiao commented Sep 4, 2017

rfratila commented Aug 31, 2017 •

edited

kashif commented Sep 3, 2017 •

edited