Skip to content

thomalm/svhn-multi-digit

Repository files navigation

Learning to Recognise Multiple Digits in Real-World Images with Convolutional Neural Networks

This project explores how Convolutional Neural Networks (ConvNets) can be used to identify series of digits in natural images taken from The Street View House Numbers (SVHN) dataset.

All the code in this project has been written using Python 2.7 and the Convolutional Neural Networks have been implemented using TensorFlow.

What is contained in this repository?

  • 01-svhn-single-preprocessing.ipynb: contains code for pre-processing the SVHN 32-by-32 images
  • 02-svhn-single-model.ipynb: contains code for implementing a simple ConvNet for recognising single digits in the SVHN 32-by-32 images using TensorFlow
  • 03-mnist-synthetic-dataset.ipynb: contains code for creating a simplistic synthetic dataset of images with multiple digits created from the MNIST dataset
  • 04-mnist-synthetic-model.ipynb: contains code for implementing a ConvNet for recognising multiple digits in the synthetic MNIST dataset using TensorFlow
  • 05-svhn-multi-preprocessing.ipynb: contains code for pre-processing the original SVHN images
  • 06-svhn-multi-model.ipynb: contains code for implementing a ConvNet for recognising multiple digits from the original SVHN dataset using TensorFlow and TensorBoard.

Tensorboard

To start Tensorboard run tensorboard --logdir=logs/<log dir>/ in the project folder.

Resources