Find more of my deep learning models on the following links:
- Emoji Generator
- Face Recognition
- Machine Translation
- Neural Style Transfer
- Trigger Word Detection
- Word Vector Representation
More ML/DL models in the following links:
A simple autonomous driving implementation using python and jupyter notebook. In this notebook, you will learn about object detection using the very powerful YOLO model. Many of the ideas in this notebook are described in the two YOLO papers:
Redmon et al., 2016 and Redmon and Farhadi, 2016.
Please download the dataset from the following link, and put it in the model_data
folder.
Download Dataset
- Use object detection on a car detection dataset using YOLO algorithm.
- Deal with bounding boxes.
- YOLO is a state-of-the-art object detection model that is fast and accurate.
- It runs an input image through a CNN which outputs a 19x19x5x85 dimensional volume.
- The encoding can be seen as a grid where each of the 19x19 cells contains information about 5 boxes.
- You filter through all the boxes using non-max suppression. Specifically:
- Score thresholding on the probability of detecting a class to keep only accurate (high probability) boxes
- Intersection over Union (IoU) thresholding to eliminate overlapping boxes.
- Because training a YOLO model from randomly initialized weights is non-trivial and requires a large dataset as well as lot of computation, we used previously trained model parameters in this notebook.
The ideas presented in this notebook came primarily from the two YOLO papers. The implementation here also took significant inspiration and used many components from Allan Zelener's GitHub repository. The pre-trained weights used in this exercise came from the official YOLO website.
- Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi - You Only Look Once: Unified, Real-Time Object Detection (2015)
- Joseph Redmon, Ali Farhadi - YOLO9000: Better, Faster, Stronger (2016)
- Allan Zelener - YAD2K: Yet Another Darknet 2 Keras
- The official YOLO website https://pjreddie.com/darknet/yolo/
The Drive.ai Sample Dataset (provided by drive.ai) is licensed under a Creative Commons Attribution 4.0 International License. We are grateful to Brody Huval, Chih Hu and Rahul Patel for providing this data.