Web Application for Captioning Image using Neural Network
Jupyter Notebook Lua Python Other
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
coco-caption
coco
cv
misc
vis
.DS_Store
.gitignore
301270756.gif
LICENSE
README.md
Screen Shot 2016-05-31 at 7.19.45 AM.png
convert_checkpoint_gpu_to_cpu.lua
eval.lua
execute.py
prepro.py
test_language_model.lua
train.lua
videocaptioning.lua

README.md

ImageSub.

ImageSub is web app project using Recurrent Neural Network. This project based on NeuralTalk2 by [Andrej Karpathy] (https://github.com/karpathy). I modify the project in order to interact with the user via a web application. So Users can upload pictures or photographs that they want to identify its caption. If you prefer using Docker and Restful version, you can using neuraltalk2-web : https://github.com/jacopofar/neuraltalk2-web

This is an early code release that works great but is slightly hastily released and probably requires some code reading of inline comments (which I tried to be quite good with in general). I will be improving it over time but wanted to push the code out there because I promised it to too many people.

teaser results

Requirements

Now you need to clone or download this project into your machine and open open vis/app.py, and edit line :

subprocess.call('th eval.lua -model ../Public/model_id1-501-1448236541_cpu.t7 -image_folder '+folder_path+' -num_images 1 -result_folder vis/'+dir, shell=True, cwd="../") 

-model is where your model located. Run python server

$ cd vis
$ python app.py

Now visit localhost:8000 in your browser.

teaser results

"I only have CPU". Okay, in that case download the cpu model checkpoint. Make sure you add -gpuid -1 in vis/app.py to tell the script to run on CPU.

License

BSD License.