simpleCNN

It has CNN layer, Pooling layer, FC layer, and softmax layer. The CNN layer, Pooling layer and FC layer can be stacked up to construct deeper neural networks.

A convolution layer and a max pooling layer are added to my vanilla neural network framework.

With these two additional layers, a CNN can be built via this simple framework. The main.py file demonstrates how to use the simple framework to build a CNN, and how to train the CNN with MNIST dataset.

Construct the CNN

This neural networks contain 1 convolution layer, 1 max pooling layer, 3 fully connected hidden layers and a softmax output layer.

The convolution layer has 16 kernels, 8 of them are 3x3 kernels, and 8 of them are 5x5 kernels. With zero-padding, each kernel will preserve the dimensions of the input data. The max pooling layer does 2x2 none-overlapping max pooling, and the dimensions of its output will be half (or half+1) of the input dimensions.

def get_kernels():
    result = []
    uuid = 1

    # 1. 3x3 kernels
    for i in range(8):
        func = activation.reluFunc
        if i % 3 == 0:
            func = activation.tanhFunc
        kernel = conv_layer.Kernel(3, func, uuid)
        result.append(kernel)
        uuid += 1

    # 2. 5x5 kernels
    for i in range(8):
        func = activation.reluFunc
        if i % 3 == 0:
            func = activation.tanhFunc
        kernel = conv_layer.Kernel(5, func, uuid)
        result.append(kernel)
        uuid += 1
    return result


def construct_cnn(l2=0.0):
    img_input = nn_layer.InputLayer("mnist_input", 784)
    output_layer = nn_layer.SoftmaxOutputLayer("mnist_output", 10)

    # 1. set input and output layers
    nn = simple_nn.NNetwork()
    nn.set_input(img_input)
    nn.set_output(output_layer)

    # 2. add Conv-Pooling layers
    c1 = conv_layer.ConvLayer("conv1")
    c1.set_kernels(get_kernels())
    nn.add_hidden_layer(c1)

    # 2x2 none-overlapping max-pooling
    p1 = pooling_layer.MaxPoolingLayer("pool1", 2, 2)
    nn.add_hidden_layer(p1)

    # 3. add some full-connected hidden layers
    h1 = nn_layer.HiddenLayer("h1", 512, activation.tanhFunc)
    h1.set_lambda2(l2)
    nn.add_hidden_layer(h1)

    h2 = nn_layer.HiddenLayer("h2", 128, activation.tanhFunc)
    h2.set_lambda2(l2)
    nn.add_hidden_layer(h2)

    h3 = nn_layer.HiddenLayer("h3", 10, activation.reluFunc)
    h3.set_lambda2(l2)
    nn.add_hidden_layer(h3)

    # 3. complete nn construction
    # print("%s" % (nn))
    fake_img = np.zeros((1, 28, 28))
    img_input.feed(fake_img)
    nn.connect_layers()
    print(nn.get_detail())
    return nn

Run it

1. get data

cd simpleCNN
cd data
sh get.sh

2. train the model

cd simpleCNN
python main.py

Because of the convolution layer, the training process is very slow: takes around 4 hours to finish one echo. But the result is promising: after the training of the second epoch, it can get 98.54% correctness on testing set.

[2017-09-30 14:43:45.445192][test] accuracy=0.9741, avg_cost=0.0796
[2017-09-30 16:17:21.345727][train] accuracy=0.9792, avg_cost=0.0673
[2017-09-30 20:55:07.247804][test] accuracy=0.9854, avg_cost=0.0480
[2017-09-30 22:28:31.628410][train] accuracy=0.9890, avg_cost=0.0345
[2017-10-01 11:13:07.159749][test] accuracy=0.9861, avg_cost=0.0426
[2017-10-01 12:41:43.706586][train] accuracy=0.9936, avg_cost=0.0216
[2017-10-01 16:44:47.117386][test] accuracy=0.9862, avg_cost=0.0412
[2017-10-01 18:19:10.587616][train] accuracy=0.9951, avg_cost=0.0163
[2017-10-01 22:32:35.422284][test] accuracy=0.9882, avg_cost=0.0379
[2017-10-02 00:05:24.414455][train] accuracy=0.9973, avg_cost=0.0108
[2017-10-02 12:37:03.867947][test] accuracy=0.9894, avg_cost=0.0351
[2017-10-02 14:11:40.970246][train] accuracy=0.9984, avg_cost=0.0073
[2017-10-02 19:02:01.563857][test] accuracy=0.9896, avg_cost=0.0348
[2017-10-02 20:37:20.065452][train] accuracy=0.9990, avg_cost=0.0060

As a comparison, the similar simple NN model (without the ConvLayer + MaxPoolingLayer) gets 94.74% correctness on testing set after the first epoch, and only gets 98.11% at best (11 epochs).

[2017-09-24 23:12:26.834471][train] accuracy=0.9526, avg_cost=0.1555
[2017-09-24 23:12:27.730683][test] accuracy=0.9474, avg_cost=0.1725

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
nn		nn
test		test
util		util
.gitignore		.gitignore
README.md		README.md
cnn2_main.py		cnn2_main.py
main.py		main.py
nn_main.py		nn_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simpleCNN

Construct the CNN

Run it

1. get data

2. train the model

About

Releases

Packages

Languages

beekbin/simpleCNN

Folders and files

Latest commit

History

Repository files navigation

simpleCNN

Construct the CNN

Run it

1. get data

2. train the model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages