Images Captioning Using Neural Network -- Keras

YouTube

Blog

Challenges!

Baseline:

KNN model for image captioning. Get the features of images using SURF or GIST algorithm, and feed into knn model.
For prediction, find the closest image based on features.
use BLEU score to choose one best caption from the captions of closest images.

Final Model:

Use VGG16 or VGG19 CNN to extraction features from images.
Use LSTM model to generate the captions

Dataset:

run baseline

run image_knn.py

training LSTM model

run image_rnn.py

LSTM prediction

run image_rnn_predict.py

VGG Feature Extractor (CNN)

use 16 layer version of CNN to extract features
pre-trained model put pr-trained model file in model folder

Present Results

to present results, we use Flask and show the result as web pages.
run python app.py, and use 127.0.0.1:4555 to see the results in browser.

dependencies:

OpenCV: for feature extraction(SIFT, SURF, ORB) run install-opencv.sh
GIST: A wrapper for Lear's GIST implementation written in C. follow the instruction: here
Tensorflow
Keras
Flask

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
data		data
feature_extraction		feature_extraction
model		model
static		static
templates		templates
.gitignore		.gitignore
DataReader.py		DataReader.py
ProjectWriteup - Google Docs.pdf		ProjectWriteup - Google Docs.pdf
README.md		README.md
app.py		app.py
evaluation.py		evaluation.py
format.py		format.py
image_knn.py		image_knn.py
image_rnn.py		image_rnn.py
image_rnn_predict.py		image_rnn_predict.py
install-opencv.sh		install-opencv.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Images Captioning Using Neural Network -- Keras

YouTube

Blog

Challenges!

Baseline:

Final Model:

Dataset:

run baseline

training LSTM model

LSTM prediction

VGG Feature Extractor (CNN)

Present Results

dependencies:

About

Releases

Packages

Contributors 3

Languages

ZhenguoChen/Neural-Network-Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Images Captioning Using Neural Network -- Keras

YouTube

Blog

Challenges!

Baseline:

Final Model:

Dataset:

run baseline

training LSTM model

LSTM prediction

VGG Feature Extractor (CNN)

Present Results

dependencies:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages