Objet Detection using Single Shot Multibox Detector (SSD)

This is a Keras port of the SSD model architecture introduced by Wei Liu et al. in the paper SSD: Single Shot MultiBox Detector. This implementation is focussed towards two important points (which were missing in originall implementation):

Training and inference can be done on mediocre discreate local system GPUs (like I have used Nvidia gtx 1660 on my local setup).
A universal note book to perform all task:- training, training on custom dataset, inference on video or images, etc.

APIS and Usage

Training API

This apis will be need to train any custom dataset. (Though some are common in both training and inference)

limit_gpu: bool, by default True

Enable Tensorflow's limiting memory graph. This will need if the job is performed on local setup. True: To enable False: To disable

mode: str

What job to perform, either 'training' or 'inference'

img_height: int, by default 300

Height of image

img_width:int, by default 300

Width of image

img_channels: int, by default 3

Color channel, 3 for RGB

Loading annotation

annotation_type: str, by default 'csv'

Format of annotation, either 'csv' or 'xml'

train_load_images_into_memory: bool, by default True

True:Will load all images into memory; False: Keeeps on disk, but much slower

validation_load_images_into_memory: bool, by default True

same as above

Dataset location

train_img_dir: str

csv or path containing training data

train_image_set_filename:str

To be used only in coco data set.

val_img_dir:str

validation data

val_annotation_dir:str

vidation data annotation

val_image_set_filename:str

To used only in coco dataset.

classes:list

list of all classes

n_classes no of classes: int

no of classes

Training Hyperparameters

l2_regularization:float, by default 0.5

L2 regularization penalizing factor

pos_iou_threshold:float, by default 0.5

IOU threshold used for localization

learning_rate: float, by default 0.001

Learning rate to train

steps_per_epoch:int

no of steps per epoch to take

batch_size: int

size of batch of data to train

epochs:int

no of epoch to train on

Saving training assets

weight_save_path: str

path to save weights

csv_log_save_path: str

path to save training job monitor csv

Inference API

This apis will be used for inference job

limit_gpu: bool, by default True

Enable Tensorflow's limiting memory graph. This will need if the job is performed on local setup. True: To enable False: To disable

mode: str

What job to perform, either 'training' or 'inference'

img_height: int, by default 300

Height of image

img_width:int, by default 300

Width of image

img_channels: int, by default 3

Color channel, 3 for RGB

weights_path: str

path to load trained weights

confidence_threshold:float, by default 0.5

Threshold to select prediction

Saving Inference Assets

predicted_frames_export_path:str

frames with predicted bounding box will be saved here

video_output_path:str

video with predicted bounding box will be saved here

Technology Used

Core Technology:

Python, Keras (Tensorflow:114),OpenCV, FFmpeg, Nvidia cuda

Tools:

MLflow (tracking experiments), DVC(version control data), Git(version control project)

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.dvc		.dvc
assets		assets
bounding_box_utils		bounding_box_utils
data_generator		data_generator
eval_utils		eval_utils
keras_layers		keras_layers
keras_loss_function		keras_loss_function
misc_utils		misc_utils
mlruns/0		mlruns/0
models		models
ssd_encoder_decoder		ssd_encoder_decoder
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
SSD_generic_workflow_notebook.ipynb		SSD_generic_workflow_notebook.ipynb
__init__.py		__init__.py
car_dashcam.mp4		car_dashcam.mp4
config.yaml		config.yaml
dataset.dvc		dataset.dvc
executor.py		executor.py
funny_dog.mp4		funny_dog.mp4
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py
udacity_driving_datasets.dvc		udacity_driving_datasets.dvc

Akashdesarda/Object-Detection-using-SSD

Folders and files

Latest commit

History

Repository files navigation

Objet Detection using Single Shot Multibox Detector (SSD)

APIS and Usage

Training API

limit_gpu: bool, by default True

mode: str

img_height: int, by default 300

img_width:int, by default 300

img_channels: int, by default 3

annotation_type: str, by default 'csv'

train_load_images_into_memory: bool, by default True

validation_load_images_into_memory: bool, by default True

train_img_dir: str

train_image_set_filename:str

val_img_dir:str

val_annotation_dir:str

val_image_set_filename:str

classes:list

n_classes no of classes: int

l2_regularization:float, by default 0.5

pos_iou_threshold:float, by default 0.5

learning_rate: float, by default 0.001

steps_per_epoch:int

batch_size: int

epochs:int

weight_save_path: str

csv_log_save_path: str

Inference API

limit_gpu: bool, by default True

mode: str

img_height: int, by default 300

img_width:int, by default 300

img_channels: int, by default 3

weights_path: str

confidence_threshold:float, by default 0.5

predicted_frames_export_path:str

video_output_path:str

Technology Used

Core Technology:

Tools:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages