Binarized convolutional neural network implementation for vehicle classification in CUDA, timed against cuDNN+cuBLAS (7.5X speed up for cuDNN with GEMM convolution). Paper accepted.
For more information about binarized neural networks: http://papers.nips.cc/paper/6573-binarized-neural-networks.pdf http://papers.nips.cc/paper/5647-binaryconnect-training-deep-neural-networks-with-binary-weights-during-propagations.pdf