Final Project
Python Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
alignment_model
bash_scripts
cnn_code
confusionMatrix
convnet_test
mnist_grid
6_867_Project_Writeup.pdf
README.md

README.md

6.867-Final-Project

In the Fall 2016 semester, Sitara Persad, Andrew Xia, and Karan Kashyap worked on constructing models for the direct bi-directional classification of speech and images. For our final project, we trained two Convolutional Neural Networks to map image representations of digits to their spoken equivalent, achieving an image annotation accuracy of 88.5% and an image retrieval accuracy of 87.6%.

Our paper can be viewed here.