tiny-cnn: A C++11 implementation of deep learning (convolutional neural networks)

tiny-cnn is a C++11 implementation of deep learning (convolutional neural networks).

designing principles
comparison with other libraries
supported networks
dependencies
building sample project
examples
references
license

designing principles

fast, without GPU
- with TBB threading and SSE/AVX vectorization
- 98.8% accuracy on MNIST in 13 minutes training (@Core i7-3520M)
header only
- Just include tiny_cnn.h and write your model in c++. There is nothing to install.
policy-based design
small dependency & simple implementation

comparison with other libraries

	Language	Lines Of Code	License	Prerequisites	Platforms	Modeling By	GPU Support	Installing	Pre-Trained model
tiny-cnn	C++	3.1K	BSD(3-clause)	Boost,TBB	Linux/OS-X/Windows	C++ code	No	Unnecessary	No
caffe	C++(Python/Matlab interfaces available)	58.7K	BSD(2-clause)	CUDA,BLAS,Boost,OpenCV,protobuf,etc	Linux/OS-X	Config File	Yes	Necessary	Yes
Theano	Python	134K	BSD(3-clause)	Numpy,Scipy,BLAS,(optional:nose,Sphinx,CUDA etc)	Linux/OS-X/Windows	Python Code	Yes	Necessary	No

supported networks

layer-types

fully-connected layer
fully-connected layer with dropout
convolutional layer
average pooling layer
max-pooling layer

activation functions

tanh
sigmoid
rectified linear
identity

loss functions

cross-entropy
mean-squared-error

optimization algorithm

stochastic gradient descent (with/without L2 normalization and momentum)
stochastic gradient levenberg marquardt
adagrad
rmsprop

dependencies

building sample project

gcc(4.7~)

without tbb

./waf configure --BOOST_ROOT=your-boost-root
./waf build

with tbb

./waf configure --TBB --TBB_ROOT=your-tbb-root --BOOST_ROOT=your-boost-root
./waf build

with tbb and SSE/AVX

./waf configure --AVX --TBB --TBB_ROOT=your-tbb-root --BOOST_ROOT=your-boost-root
./waf build


./waf configure --SSE --TBB --TBB_ROOT=your-tbb-root --BOOST_ROOT=your-boost-root
./waf build

vc(2012~)

open vc/tiny_cnn.sln and build in release mode.

You can edit include/config.h to customize default behavior.

examples

construct convolutional neural networks

#include "tiny_cnn.h"
using namespace tiny_cnn;
using namespace tiny_cnn::activation;

void construct_cnn() {
    using namespace tiny_cnn;

    // specify loss-function and optimization-algorithm
    network<mse, adagrad> net;
    //network<cross_entropy, RMSprop> net;

    // add layers
    net << convolutional_layer<tan_h>(32, 32, 5, 1, 6) // 32x32in, conv5x5, 1-6 f-maps
        << average_pooling_layer<tan_h>(28, 28, 6, 2) // 28x28in, 6 f-maps, pool2x2
        << fully_connected_layer<tan_h>(14 * 14 * 6, 120)
        << fully_connected_layer<identity>(120, 10);

    assert(net.in_dim() == 32 * 32);
    assert(net.out_dim() == 10);
    
    // load MNIST dataset
    std::vector<label_t> train_labels;
    std::vector<vec_t> train_images;
    
    parse_mnist_labels("train-labels.idx1-ubyte", &train_labels);
    parse_mnist_images("train-images.idx3-ubyte", &train_images);
    
    // train (50-epoch, 30-minibatch)
    net.train(train_images, train_labels, 30, 50);
    
    // save
    std::ofstream ofs("weights");
    ofs << net;
    
    // load
    // std::ifstream ifs("weights");
    // ifs >> net;
}

construct multi-layer perceptron(mlp)

#include "tiny_cnn.h"
using namespace tiny_cnn;
using namespace tiny_cnn::activation;

void construct_mlp() {
    network<mse, gradient_descent> net;

    net << fully_connected_layer<sigmoid>(32 * 32, 300);
        << fully_connected_layer<identity>(300, 10);

    assert(net.in_dim() == 32 * 32);
    assert(net.out_dim() == 10);
}

another way to construct mlp

#include "tiny_cnn.h"
using namespace tiny_cnn;
using namespace tiny_cnn::activation;

void construct_mlp() {
    auto mynet = make_mlp<mse, gradient_descent, tan_h>({ 32 * 32, 300, 10 });

    assert(mynet.in_dim() == 32 * 32);
    assert(mynet.out_dim() == 10);
}

more sample, read main.cpp

references

[1] Y. Bengio, Practical Recommendations for Gradient-Based Training of Deep Architectures. arXiv:1206.5533v2, 2012

[2] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278-2324.

other useful reference lists:

license

The BSD 3-Clause License

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
data		data
include		include
src		src
test		test
vc		vc
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
waf		waf
wscript		wscript

varunbezzam/tiny-cnn

Folders and files

Latest commit

History

Repository files navigation