Post-Processing Model of Video Object Recognition using IMU

a post-processing model using Inertial Measurement Unit (IMU) to enhance accuracy of object recognition

uses Kalman Filter and Intersection over Union model to update the recognition results with IMU information, compatible with any existing object recognition algorithms such as YOLO or Detectron

used on Vision Prosthetic in Paradiso Lab, Brown University

Read about the details of this work in this paper here.

Installation

Clone this repo to your local machine using https://github.com/paulzhou69/object-recognition-imu.git
install the dependencies using requirements.txt

$ git clone https://github.com/paulzhou69/object-recognition-imu.git
$ pip install -r requirements.txt

If you are using YOLO as your object detection algorithm, please follow the README here to set up YOLO. Please note that the YOLO in this repo is a bit different from the official release , as the YOLO in this repo is better integrated with the post-processing model.

If you are nto using YOLO as your object detection algorithm, please build the algorithm of your choice under this folder, and change the follwing scripts:

object_dict.py. change the object_to_index dictionary to fit the classes of your detection algorithm
iou_update.py & kf_update.py. Change the process_img() function to use your algorithm's detection function
observation.py. Please parse the detection result of a single frame into the form

    a list of objects: [obj1, obj2, obj3, ..., obj N], where N is the total number of bounding boxes in the frame
    an object is a list of full probability distribution over all classes: obj = [class1, class2, ..., class M], where M is the total number of classes your algorithm can recognize
    a class is a tuple: class = (confidence, (x, y, w, h)), 
    where confidence is the confidence of the object being that class, x, y is the x, y coordinates of the center of the bounding box of the object, w, h are the width and heighht of the bounding box

If you are not using a conventional IMU/camera coordinate system orientation, please adjust the function compute_displacement_pr() in imu/displacement.py

Features

updates the object recognition result (class confidences & bounding box locations) using either Kalman Filter model or a (generalized) Intersection over Union model
save the updated results in a new file
included functions to draw plots to compare the confidences/bounding box location before and after the post-processing model is applied

Usage

first install your own object recognition algorithm

then, specify your camera, IMU, and other parameters in config.py

make sure to put your images and IMU raw data in the input/image and input/imu folder. The default file format for IMU data is csv. If not, please change the script imu/raw_data.py to take in the file format of your choosing. Then run python main.py csv to process the raw data

the bulk of the model can be access by running main.py

to see what arguments main.py takes in, run python main.py --help

###Example 1:

running the Kalman Filter model and process input images and IMU info

$ python main.py kf

you should see the terminal loading the images and the processing steps if you set debug = True in config.py

the results of the updated observation will be saved in files in kf_output/, where the read.csv files are for human reading the outputs, and the store.csv files are stored in json format and available for further processing

###Example 2:

running the IoU model and process input images and IMU info

$ python main.py iou

to run the IoU with generalized IoU, use the tag --giou or -g

$ python main.py iou --giou

###Example 3: to compare the updated results with original results output by your object detection algorithm, use the --compare or -c tag, but remember to specify the model you used

$ python main.py iou --compare
$ python main.py kf --compare

you should see an image pop up:

to customized your own compare settings, please edit iou_output/iou_compare.py, iou_output/giou_compare.py, and kf_output/kf_compare.py

Contributing

To get started...

Step 1

Option 1
- 🍴 Fork this repo!
Option 2
- 👯 Clone this repo to your local machine using https://github.com/paulzhou69/object-recognition-imu.git

Step 2

HACK AWAY! 🔨🔨🔨

Step 3

🔃 Create a new pull request using https://github.com/paulzhou69/object-recognition-imu/compare .

Team (Paradiso Lab, Brown University)

Contributors/People

FAQ

Feel free to open issues :))

Support

Reach out to me at one of the following places!

Leave comments on my website: https://paulzhou69.github.io/

License

MIT license

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
assets		assets
darknet		darknet
imu		imu
input		input
iou		iou
iou_output		iou_output
kf		kf
kf_output		kf_output
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
backtrace_update.py		backtrace_update.py
config.py		config.py
iou_update.py		iou_update.py
kf_update.py		kf_update.py
main.py		main.py
object_dict.py		object_dict.py
observation_parser.py		observation_parser.py
requirements.txt		requirements.txt
update_functions.py		update_functions.py

License

zhangfaquan/object-recognition-imu

Folders and files

Latest commit

History

Repository files navigation

Post-Processing Model of Video Object Recognition using IMU

Table of Contents

Installation

Features

Usage

Contributing

Step 1

Step 2

Step 3

Team (Paradiso Lab, Brown University)

FAQ

Support

License

About

Resources

License

Stars

Watchers

Forks

Languages