img2txt

A series of experiments using convolutional neural nets and LSTMs to generate text from images.

As an introduction to the image-to-text problem space, I start out with Stanford's Street View House Numbers (SVHN) dataset. The naive implementation in simple_svhn.ipynb is both a kind of warm-up exercise and also serves as my final project for the Udacity Deep Learning Course which I completed in September of 2017.

Setup

Make sure you have Docker installed!
Run ./setup.sh to create the docker container
Run ./start.sh to start the docker container
Copy the outputted localhost path from the terminal and paste it in your browser
Navigate to the /src folder to see the source code

Notice that with this Docker configuration, datasets are only stored in the container whereas source files are accessible from both the container and its host.

Files

simple_svhn.ipynb - naive approach exhibiting some convergence on the SVHN dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

img2txt

Setup

Files

Files

README.md

Latest commit

History

README.md

File metadata and controls

img2txt

Setup

Files