Simple convolutional neural network to detect cat bounding boxes in images. The system is restricted to one bounding box per image, which is localized using regression (i.e. directly predicting the bounding box coordinates). The model consists of 7 convolutional layers and 2 fully connected layers (including output layer).
- python 2.7 (only tested with that version)
- keras (tested in v1.06)
- scipy
- numpy
- scikit-image
- Download the 10k cats dataset and extract it, e.g. into directory
/foo/bar/10k-cats
. That directory should contain the subdirectoriesCAT_00
,CAT_01
, etc. - Train the model using
train_convnet.py --dataset="/foo/bar/10k-cats"
. - Apply the model using
train_convnet.py --dataset="/foo/bar/directory-with-cat-images"
.
Example results: