The Implementation of models for a thesis titled "Investigating automated bird detection from webcams using machine learning"

Introduction

This repository hosts all the scripts and procedure used in the implementation of bird detection models. We are using Convolutional Neural Networks(CNN)'s Faster R-CNN and Single Shot Detector(SSD) meta-architectures while utilizing MobileNet-v2, ResNet50, ResNet101, ResNet152, and Inception ResNet-v2 feature extraction networks.

All models were initialized with weights pre-trained on the MS COCO dataset as provided by the Tensorflow Object detection API and the fine-tuned on our dataset. The data used in this study is collected from the live FeedWatcher cams of Cornell Lab Bird Cams suited in different parts of the United States. We used auto-screen software which captured approximately 1 Megapixel (Joint Photographic Experts Group)JPEG coloured images of resolution 1366x768 pixels from the feeds. At one of the stations in Treman bird feeding in Ithaca, New York, Axis P11448-LE camera is being used for recordings. A total of 10,592 images of different quality were collected and labelled manually using the LabelImg image annotation tool https://github.com/tzutalin/labelImg.

The repository provides all the files used to train and evaluate the models. But we have only attached a sample of our data since we couldn't upload the entire dataset of 10GB due to limited space allowed in the git free repository. A full dataset has been published on Zenodo for public access: https://zenodo.org/record/5172214#.YSO8YI4zZhH

System Requirement

We ran the experiments on an MSI GL75 Leopard 10SFR laptop with;

CUDA 11.0,
cuDNN SDK 8.0.4
Windows 10 x64
10th Gen Intel Core i7-10750H,
GeForce RTX 2070 8GB GDDR6 graphics processing unit (GPU)
32GB DDR4 RAM.

The CUDA and cuDNN were downloaded and installed following instructions from the official NVIDIA Website.

Procedure

Preparing the environment

Setting up a virtual environment

Opened Anaconda Prompt as an administrator and we created a virtual environment called "thesis_models" using the following command;

C:\> conda create -n thesis_models pip python=3.8

After, activate the virtual environment using,

C:\> activate thesis_models

Then, we installed the following packages.

(thesis_models) C:\> pip install tensorflow==2.5.0 tensorflow-gpu==2.5.0
(thesis_models) C:\> conda install -c anaconda protobuf
(thesis_models) C:\> pip install pillow
(thesis_models) C:\> pip install lxml
(thesis_models) C:\> pip install Cython
(thesis_models) C:\> pip install contextlib2
(thesis_models) C:\> pip install jupyter
(thesis_models) C:\> pip install matplotlib

Cloning a TensorFlow object detection API repository in folder "thesis" using,

(thesis_models) C:\thesis> git clone https://github.com/tensorflow/models.git

At this stage, we had something similar to:

Installing Protobuf and Object Detection API

Protobufs are used by the Tensorflow Object Detection API to configure the model and training the parameters.

This was achieved using the following command.

(thesis_models) C:\thesis\mobilenet\models\research> protoc --python_out=. .\object_detection\protos\anchor_generator.proto .\object_detection\protos\argmax_matcher.proto .\object_detection\protos\bipartite_matcher.proto .\object_detection\protos\box_coder.proto .\object_detection\protos\box_predictor.proto .\object_detection\protos\eval.proto .\object_detection\protos\faster_rcnn.proto .\object_detection\protos\faster_rcnn_box_coder.proto .\object_detection\protos\grid_anchor_generator.proto .\object_detection\protos\hyperparams.proto .\object_detection\protos\image_resizer.proto .\object_detection\protos\input_reader.proto .\object_detection\protos\losses.proto .\object_detection\protos\matcher.proto .\object_detection\protos\mean_stddev_box_coder.proto .\object_detection\protos\model.proto .\object_detection\protos\optimizer.proto .\object_detection\protos\pipeline.proto .\object_detection\protos\post_processing.proto .\object_detection\protos\preprocessor.proto .\object_detection\protos\region_similarity_calculator.proto .\object_detection\protos\square_box_coder.proto .\object_detection\protos\ssd.proto .\object_detection\protos\ssd_anchor_generator.proto .\object_detection\protos\string_int_label_map.proto .\object_detection\protos\train.proto .\object_detection\protos\keypoint_box_coder.proto .\object_detection\protos\multiscale_anchor_generator.proto .\object_detection\protos\graph_rewriter.proto .\object_detection\protos\calibration.proto .\object_detection\protos\flexible_grid_anchor_generator.proto

By running the command above, the name_pb2.py file of every .proto file is created in the protos folder as seen in the image below.

The Object Detection API is installed using the object_detection package which was achieved using the two commands below.

(thesis_models) C:\thesis\mobilenet\models\research> python setup.py build
(thesis_models) C:\thesis\mobilenet\models\research> python setup.py install

Verifying our Installation

To verify our installation, using the following command.

(thesis_models) C:\thesis\mobilenet\models\research> python object_detection/builders/model_builder_tf2_test.py

The following output was obtained, and therefore our installation was confirmed successfully.

After successfully installing the object detection API, we started training our models.

Data Pre-processing

Dataset Partitioning

Using the data_partitioning.py script in the pre-processing folder, we split the dataset into training, validation and testing sets.

We first split the dataset into training and testing sets in a ratio of 80:20. Before splitting is done, copy all training images, together with their corresponding *.xml annotation files a single folder.

And then after 20% of the training data was set aside as a validation set.

usage:

dataset_partitioning_script.py -x -i [PATH_TO_IMAGES_FOLDER] -r [x]

where;

PATH_TO_IMAGES_FOLDER -- represents the directory path where images and the XML annotation files are stored. Still, training and testing sets are created in the same directory.
x -- the splitting value, for example, if x = 0.2, then the training set is 80% of the whole dataset and testing set 20%.

For more info: https://github.com/sglvladi/TensorFlowObjectDetectionTutorial

Convert .xml to .csv

Script: xml_to_csv.py

The script helps in generating CSV files from the XML files.

Usage: Place training, validation, and testing datasets folders into a single folder named "image" but the folder's name can be changed if at all you change lines 30 and 32 in the script to change the name of that folder.

Then run

python xml_to_csv.py install

Convert .xml to .record

Script: generate_tfrecord.py

This script helps in converting XML files to the tf-records format

usage:- Creating train set - tf-record file:

python generate_tfrecord.py -x [PATH_TO_TRAIN-SET_FOLDER]/train_set -l [PATH_TO_label_map_FOLDER]/label_map.pbtxt -o [PATH_TO_TF]/train_set.record

Creating validation set - tf-record file:

python generate_tfrecord.py -x [PATH_TO_VALIDATION-SET_FOLDER]/validation_set -l [PATH_TO_label_map_FOLDER]/label_map.pbtxt -o [PATH_TO_TF]/validation_set.record

where

PATH	MEANING
PATH_TO_TRAIN-SET_FOLDER	is where the training set data is located.
PATH_TO_label_map_FOLDER is	the location to the label map *.pbtxt script
PATH_TO_TF	path where you want to store the generated *.record files
PATH_TO_VALIDATION-SET_FOLDER	is the directory path to the validation data set.

This generates train_set.record and validation_set.record files.

After sections 5.1 to 5.3, we copied the training set, validation set to the images folder found in C:\thesis\sdd_mobilenet\models\research\object_detection\images and the TFrecods files were copied to the object_detection folder C:\thesis\sdd_mobilenet\models\research\object_detection. Finally, we had;

Creating Label Map

A labelled map is required by TensorFlow in both training and detection processes. And since our dataset has only one class "bird", we created the label map below.

item {
  id: 1
  name: 'bird'
}

Model Training

Downloading the pre-trained model

We download the SSD MobileNet-v2 model from the TensorFlow 2 Object Detection Model Zoo repository into the object detection folder and our directory looked like.

model/
|-- ...
|-- research/
|   |-- ....
|   |-- object_detection/
|   |   |-- ...
|   |   |-- ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8/
|   |   |   |-- checkpoint/
|   |   |   |-- saved_model/
|   |   |   |-- pipeline.config
|   |   |
|   |   |-- .....
|   |-- ...
|-- ...

Configuring the Training Pipeline

The pipeline.config file is copied from the ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8 folder and pasted into the training with the label map. Our directory looked like.

model/
|-- ...
|-- research/
|   |-- ....
|   |-- object_detection/
|   |   |-- ...
|   |   |-- training/
|   |   |   |-- labelmap.pbtxt
|   |   |   |-- pipeline.config
|   |   |
|   |   |-- .....
|   |-- ...
|-- ...

The hyperparameters like the number of epochs, batch size, step size, image dimensions, learning rate, weight decay, and IoU score were fine-tuned through several training and validation sessions until the best model performing parameters were obtained.

Training

We used the model_main_tf2.py script provided by the object detection API and found it in the object_detection to train our model. The best performing model took 4hours to complete. This operation was executed using the following command.

(thesis_models) C:\thesis\mobilenet\models\research\object_detection> python model_main_tf2.py --model_dir=training --pipeline_config_path=training/pipeline.config --num_train_steps=<NUMBER_OF_STEPS>

Validation

For validation, we used the following command.

(thesis_models) C:\thesis\mobilenet\models\research\object_detection> python model_main_tf2.py --model_dir=training --pipeline_config_path=training/pipeline.config --checkpoint_dir=training

We utilized the MS COCO evaluation tool and we obtained the following.

Exporting Inference Graph

After training and validating our model, we exported the frozen inference graph into the folder "inference_graph" created in the object_detection folder using the exporter_main_v2.py script again provided by the object detection API. We performed this using the command below.

(thesis_models) C:\thesis\mobilenet\models\research\object_detection> python exporter_main_v2.py --input_type image_tensor --pipeline_config_path training/pipeline.config --trained_checkpoint_dir training --output_directory inference_graph

After exporting the inference graph, we copied the label model into the saved_model to help us during model testing and we hard a directory layout below.

model/
|-- ...
|-- research/
|   |-- ....
|   |-- object_detection/
|   |   |-- ...
|   |   |-- inference_graph/
|   |   |   |-- checkpoint
|   |   |   |-- saved_model/
|   |   |   |    |--assets
|   |   |   |    |-- variables/
|   |   |   |    |-- labelmap.pbtxt
|   |   |   |    |-- saved_model.pb
|   |   |   |-- pipeline.config
|   |   |
|   |   |-- .....
|   |-- ...
|-- ...

Testing the Model

Using the bird_detection.py script in the post-processing folder in the repository, we test our model on images in the test set. This script was copied and pasted in the object_detection folder and the following command was used to execute the operation.

(thesis_models) C:\thesis\mobilenet\models\research\object_detection> python bird_detection.py

Below are some of the images with detected birds.

Note: All the other seven models followed the same procedure.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
annotations		annotations
models		models
photos		photos
plots		plots
post-processing		post-processing
pre-processing		pre-processing
sample_set		sample_set
README.md		README.md

mirugwe1/bird_detection

Folders and files

Latest commit

History

Repository files navigation

The Implementation of models for a thesis titled "Investigating automated bird detection from webcams using machine learning"

Introduction

Table of Contents

System Requirement

Procedure

Preparing the environment

Setting up a virtual environment

Cloning a TensorFlow object detection API repository in folder "thesis" using,

Installing Protobuf and Object Detection API

Verifying our Installation

Data Pre-processing

Dataset Partitioning

Convert *.xml to *.csv

Convert *.xml to *.record

Creating Label Map

Model Training

Downloading the pre-trained model

Configuring the Training Pipeline

Training

Validation

Exporting Inference Graph

Testing the Model

References

About

Topics

Resources

Stars

Watchers

Forks

Languages

Convert .xml to .csv

Convert .xml to .record