Weapons Detection in Real Time Surveillance Videos

This project aims to minimize the police response time by detecting weapons through a live CCTV camera feed. So it alerts the police as soon as it detects any sort of weapons. In our project we are focusing on guns primarily.

Code and Resources Used

Language: Python 3.8

Libraries: pandas, numpy, csv, re, cv2, os, glob, io, tensorflow, PIL, shutil, urllib, tarfile, files(google colab), Ordereddict (collections), ElementTree (xml)

Dataset: Pistol Dataset by University of Granada

Step 1: Gathering Images and Labels

Download the images from the above link. At least 50 images for each class the more the no. of classes the more images.
Images with random objects in the backgorund.
Various background conditions such as dark, light, indoor, oudoor, etc.
Save all the images in a folder called images and all images should be in .jpg format.

Step 2: Labelling Images

Using LabelImg drag and annotated the guns in the images.
The labels will be in PascalVOC format. Each image will have one .xml file that has its labels. If there is more than one class or one label in an image, that .xml file will include them all.
And the labels/ annotations will be done.

Step 3: Setting the Data Systematically

Mount the google drive to google colab.
The Directory should be as follows:

object_detection └── data ├── images │ ├── image_1.jpg │ ├── image_2.jpg │ └── ... │ └── annotations ├── image_1.xml ├── image_2.xml └── ...

Split the images into train and test labels (i.e. only the xml files)

object_detection └── data ├── images │ ├── image_1.jpg │ └── ... │ ├── annotations │ ├── image_1.xml │ └── ... │ ├── train_labels //contains the labels only │ ├── image_1.xml │ └── ... │ └── test_labels //contains the labels only ├── image_50.xml └── ...

Step 4: Import and Install the Required Packages

Install PIL and Cython as they are not pre installed in Google colab.
Just import all the packages that were listed above.
Tensorflow version should be 1.15.0. for our project.

Step 5: Preprocessing the Images and Labels

Create 2 csv files for .xml files in each train_labels and test labels folder.
Create pbtxt file that will contian the label map for each class
The working directory should be as follows:

object_detection/ └── data/ ├── images/ │ └── ... ├── annotations/ │ └── ... ├── train_labels/ │ └── ... ├── test_labels/ │ └── ... │ ├── label_map.pbtxt │ ├── test_labels.csv │ └── train_labels.csv

Step 6: Downloading the Tensorflow model

Tensorflow model contains the object deteection API. So just get from the official repository by cloning it.
Compile Proto buffers and also PATH var should have the directories models/research/ and models/research/slim added.
Run a quick test to confirm that the model builder is working properly

Step 7: Generating TFRecords

The CSVs file names is matched:train_labels.csv and test_labels.csv
Current directory is object_detection/models/research
Add your custom object text in the function class_text_to_int below by changing the row_label variable (This is the text that will appear on the detected object). Add more labels if you have more than one object.
Check if the path to data/ directory is the same asdata_base_url below.

Step 8: Selecting and downloading a Pre-Trained Model

A pre-trained model simply means that it has been trained on another dataset. That model has seen thousands or millions of images and objects.
COCO is a dataset of Common Objects in Context dataset.
Choose a a model that has a low ms inference speed with a relatively high mAP on COCO. The on we are using is ssd_mobilenet_v2_coco. Check the other models from here. You could use any pre-trained model you prefer, but I would suggest experimenting with SSD ‘Single Shot Detector’ models first as they perform faster than any type of RCNN on a real-time video.
Download th pretrained model. While training, the model will get autosaved every 600 seconds by default. The logs and graphs, such as, the mAP, loss and AR, will also get saved constantly.
The working directory at this point:

object_detection/ ├── data/ │ ├── images/ │ │ └── ... │ ├── annotations/ │ │ └── ... │ ├── train_labels/ │ │ └── ... │ ├── test_labels/ │ │ └── ... │ ├── label_map.pbtxt │ ├── test_labels.csv │ ├── train_labels.csv │ ├── test_labels.records │ └── train_labels.records │ └── models/
├── research/ │ ├── training/ │ │ └── ... │ ├── pretrained_model/ │ ├── frozen_inference_graph.pb │ └── ... └── ...

Step 9: Configuring the Training Pipeline

ssd_mobilenet_v2_coco.config is the config file for the pretrained model we are using.
View the content of the sample config file by running
Copy the content of the config file
Edit

model {} > ssd {}: change num_classes to the number of classes you have.
train_config {}: change fine_tune_checkpoint to the checkpoint file path.
train_input_reader {}: set the path to the train_labels.record and the label map pbtxt file.
eval_input_reader {}: set the path to the test_labels.record and the label map pbtxt file.
n model {} > ssd {} > box_predictor {}: set use_dropout to true This will be helpful to counter overfitting.
In eval_config : {} set the number of testing images you have in num_examples and remove max_eval to evaluate indefinitely

Final full working directory:

object_detection/ ├── data/ │ ├── images/ │ │ └── ... │ ├── annotations/ │ │ └── ... │ ├── train_labels/ │ │ └── ... │ ├── test_labels/ │ │ └── ... │ ├── label_map.pbtxt │ ├── test_labels.csv │ ├── train_labels.csv │ ├── test_labels.records │ └── train_labels.records │ └── models/
├─ research/ │ ├── fine_tuned_model/ │ │ ├── frozen_inference_graph.pb │ │ └── ... │ │
│ ├── pretrained_model/ │ │ ├── frozen_inference_graph.pb │ │ └── ... │ │
│ ├── object_detection/ │ │ ├── utils/ │ │ ├── samples/ │ │ │ ├── configs/
│ │ │ │ ├── ssd_mobilenet_v2_coco.config │ │ │ │ ├── rfcn_resnet101_pets.config │ │ │ │ └── ... │ │ │ └── ...
│ │ ├── export_inference_graph.py │ │ ├── model_main.py │ │ └── ... │ │
│ ├── training/ │ │ ├── events.out.tfevents.xxxxx │ │ └── ...
│ └── ... └── ...

Step 10: Tensorboard (optional)

Tensorboard is the place where we can visualize everything that’s happening during training. You can monitor the loss, mAP & AR.
We use ngork to get tensorboard on Colab.

Step 11: Training the Model

model_main.py which runs the training process
pipeline_config_path=Path/to/config/file/model.config
model_dir= Path/to/training/
If the kernel dies, the training will resume from the last checkpoint. Unless you didn’t save the training/ directory somewhere, ex: GDrive.
If you are changing the below paths, make sure there is no space between the equal sign = and the path.

Step 12: Export the Trained Model

the model will save a checkpoint every 600 seconds while training up to 5 checkpoints. Then, as new files are created, older files are deleted.
Then by executing export_inference_graph.py to convert the model to a frozen model frozen_inference_graph.pb that we can use for inference.
This frozen model can’t be used to resume training. However, saved_model.pb gets exported as well which can be used to resume training as it has all the weights.

Step 13: Webcam Inference

To use your webcam in your local machine to inference the model use tensorflow and cv2.
You can run the following from a jupyter notebook or by creating a .py file. However, change PATH_TO_FROZEN_GRAPH , PATH_TO_LABEL_MAP and NUM_CLASSES.

SSD MobileNet V2 (Single Shot MultiBox Detector)

This model is a single-stage object detection model that goes straight from image pixels to bounding box coordinates and class probabilities. The model architecture is based on inverted residual structure where the input and output of the residual block are thin bottleneck layers as opposed to traditional residual models. Moreover, nonlinearities are removed from intermediate layers and lightweight depthwise convolution is used. This model is part of the Tensorflow object detection API.
SSD is a popular algorithm in object detection. It’s generally faster than Faster RCNN. In this post, I will give you a brief about what is object detection, what is tenforflow API, what is the idea behind neural networks and specifically how SSD architecture works.
The SSD architecture is a single convolution network that learns to predict bounding box locations and classify these locations in one pass. Hence, SSD can be trained end-to-end. The SSD network consists of base architecture (MobileNet in this case) followed by several convolution layers:
By using SSD, we only need to take one single shot to detect multiple objects within the image, while regional proposal network (RPN) based approaches such as R-CNN series that need two shots, one for generating region proposals, one for detecting the object of each proposal. Thus, SSD is much faster compared with two-shot RPN-based approaches.

Results

References

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Copy_of_weapon_detection_BL.ipynb		Copy_of_weapon_detection_BL.ipynb
Detection_Knives_and_Guns.ipynb		Detection_Knives_and_Guns.ipynb
Heu_test_obj_detn.ipynb		Heu_test_obj_detn.ipynb
Heu_train_obj_detn.ipynb		Heu_train_obj_detn.ipynb
README.md		README.md
direct_test.py		direct_test.py
frozen_inference_graph.pb		frozen_inference_graph.pb
graph.pbtxt		graph.pbtxt
label_map.pbtxt		label_map.pbtxt
model.ckpt (1).meta		model.ckpt (1).meta
model.ckpt-35949.index		model.ckpt-35949.index
model.ckpt-35949.meta		model.ckpt-35949.meta
model.ckpt.index		model.ckpt.index
requirements.txt		requirements.txt
result1.png		result1.png
result2.png		result2.png
result3.png		result3.png
result4.png		result4.png
results.gif		results.gif
saved_model (1).pb		saved_model (1).pb
ssd.png		ssd.png
train.py		train.py
train_acc.PNG		train_acc.PNG
train_acc_2.PNG		train_acc_2.PNG

ShrishtiHore/Weapons-Detection-in-Real-Time-Surveillance-Videos

Folders and files

Latest commit

History

Repository files navigation

Weapons Detection in Real Time Surveillance Videos

Code and Resources Used

About

Topics

Resources

Stars

Watchers

Forks

Languages