SimpsonRecognition

Training a Convolutional Neural Network to recognize The Simpson TV Show characters using Keras (TensorFlow backend).
To have more detailed explanations, see the blog article on Medium. If you like it, don't hesitate to recommend it (as for the dataset on Kaggle)

First part : Collecting data

The first part is collecting and labeling Simpson pictures.
Most of the pictures are from Simpson video, analyzed frame by frame.

Run python3 label_data.py into a folder with Simpson episodes (.avi format) to analyze them and label frames.
You crop each frame (left part, right part, full-frame, nothing) and then label it.

You can find the dataset on Kaggle

Second part : Training with Keras

The second part is training the model. My goal is to have 20 classes. I aim to have 1000 pictures per class, unfortunately some characters are not often on screen so I have fewer pictures for those characters. As you can see on the Jupyter notebook, I benchmark two models : 4 and 6 convolutional layers neural networks. Because of the small number of pictures (approx. 1k pictures per class), I use data augmentation.
Currently, I have 96% of accuracy (F1-Score) for 18 classes.

Third part : Faster R-CNN

The third part is upgrade the model to detect and recognize characters. I have to annotate data to get bounding boxes for characters for each picture in order to train a new model : Faster R-CNN (which is based on a Region Proposal Network). As usual, the annotation text file with the bounding boxes coordinates will be released soon.
The implementation on this network on Keras is from here by Yann Henon. I slightly edited the code : remove parts which are useless for my purpose).

Files description

label_data.py : tools functions for notebooks + script to name characters from frames from .avi videos
label_pointer.py : point with mouse clicks to save bounding box coordinates on annotations text file (from already labeled pictures)
train.py : training simple convnet
train_frcnn.py -p annotation.txt : training Faster R-CNN with data from the annotation text file
test_frcnn.py -p path/test_data/ : testing Faster R-CNN

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
faster_rcnn		faster_rcnn
pics		pics
Create DataSet.ipynb		Create DataSet.ipynb
Data Processing and Learning.ipynb		Data Processing and Learning.ipynb
README.md		README.md
label_data.py		label_data.py
label_pointer.py		label_pointer.py
test_frcnn.py		test_frcnn.py
train.py		train.py
train_frcnn.py		train_frcnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faster_rcnn

faster_rcnn

pics

pics

Create DataSet.ipynb

Create DataSet.ipynb

Data Processing and Learning.ipynb

Data Processing and Learning.ipynb

README.md

README.md

label_data.py

label_data.py

label_pointer.py

label_pointer.py

test_frcnn.py

test_frcnn.py

train.py

train.py

train_frcnn.py

train_frcnn.py

Repository files navigation

SimpsonRecognition

First part : Collecting data

Second part : Training with Keras

Third part : Faster R-CNN

Files description

About

Releases

Packages

Languages

ichejun/SimpsonRecognition

Folders and files

Latest commit

History

Repository files navigation

SimpsonRecognition

First part : Collecting data

Second part : Training with Keras

Third part : Faster R-CNN

Files description

About

Resources

Stars

Watchers

Forks

Languages