Dog&Cat image classifier for course: Introduction to Artificial Intelligence

Requirement

Python 2.7
Tensorflow 1.4.1

Requirements can be installed by

pip install -r requirements.txt

Use virtualenv venv to manage python virtual environment is a better choice

Dataset

We use dataset from here, you may also want to use some external data like The Oxford-IIIT Pet Dataset

If you want to use your own dataset, make sure you have a data list containing the image path and label as follows:

dataset/train/cat.1.jpg 0
dataset/train/dog.2.jpg 1

The script ./dataset/create_dataset.py can help you create such list and split data into train and val by randomly sampling or K-fold

./create_dataset.py \
--data_split_type k-fold \
--fold_num 10 \
--labelmap ./label_map.txt \
--data_dir ./dataset/train

Pretrained Model

We use ImageNet pretrained Inception-ResNet-v2 from Tensorflow official repository, you need to download the checkpoint file as long as the code. You need to put the checkpoint file under ./pretrained_model like:

./pretrained_model/inception_resnet_v2.ckpt

If you want to use other pretrained backbone models, you need to put the network defination code under ./nets and prepare the checkpoint

Train

After datasets and pretrained model are prepared, you can train the model just run

./train.py \
--train_dataset dataset/train.txt \
--val_dataset dataset/val.txt \
--train_dir experiments/expr1 \
--learning_rate 1e-4 \
--epoch 100 \
--batch_size 32 \
--image_size 224 \
--pretrained_model ./pretrained_model/inception_resnet_v2.ckpt

If you want to resume training, just add --resume to the command above

Evaluation

There are several tools provided to evaluate the model.

Evaluation using accuracy, precision and recall

You can use eval.py to evaluate your model using val data and calculate the accuracy, precision and recall, the command is:

./eval.py \
--val_dataset dataset/val.txt \
--train_dir experiments/expr1 \
--checkpoint model-10000 \
--batch_size 128

Prediction

If you want to generate submission for the Dogs vs Cats Competition, you can use test.py :

./test.py \
--test_dataset dataset/test.txt \
--train_dir experiments/expr1 \
--checkpoint model-10000 \
--batch_size 128

The items in dataset/test.txt should arrange as follows:

dataset/1.jpg
dataset/2.jpg

If you have several models, you can use simple ensembling technique to improve your performance:

# Assume your submissions are placed as: 
# experiments/expr1-1/submission.csv
# experiments/expr1-2/submission.csv
# experiments/expr1-3/submission.csv
# ...

./ensemble.py \
--fold_num 10 \
--ensembled_root experiments/expr1 \
--submission_name submission.csv

Tools

There are also some tools for K-fold cross validation and ensembling, which will be completed soon.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
dataset		dataset
experiments		experiments
nets		nets
pretrained_model		pretrained_model
.gitignore		.gitignore
README.md		README.md
common.py		common.py
draw_roc.py		draw_roc.py
ensemble.py		ensemble.py
eval.py		eval.py
feature_extractor.py		feature_extractor.py
kford.sh		kford.sh
post_process.ipynb		post_process.ipynb
requirements.txt		requirements.txt
test.py		test.py
test.sh		test.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dog&Cat image classifier for course: Introduction to Artificial Intelligence

Requirement

Dataset

Pretrained Model

Train

Evaluation

Evaluation using accuracy, precision and recall

Prediction

Tools

About

Releases

Packages

Languages

knwng/DogvsCat

Folders and files

Latest commit

History

Repository files navigation

Dog&Cat image classifier for course: Introduction to Artificial Intelligence

Requirement

Dataset

Pretrained Model

Train

Evaluation

Evaluation using accuracy, precision and recall

Prediction

Tools

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages