Skip to content

jhasegaw/image2speech

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image2speech

This repo is my attempt to reconstruct all of the stages in the paper http://www.isle.illinois.edu/sst/pubs/2018/hasegawajohnson_isga18.pdf

It is not yet complete. Currently it downloads the image set, and the captions, and the speech files, and their forced alignments, and generates cnnfeats from the images, and then runs XNMT to train the image-to-phone transducer. But the phone-to-speech transducer isn't there yet.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published