Skip to content

Mask RCNN for Toyota Smarthome dataset people detection

Notifications You must be signed in to change notification settings

DAIGroup/mask_rcnn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mask RCNN detections for Toyota Smarthome dataset

This project uses Matterport's implementation of Mask RCNN to retrieve bounding boxes for detected humans in the Toyota Smarthome dataset, as described in our paper (Climent-Pérez et al. 2021, https://doi.org/10.3390/s21031005).

These calculated bounding boxes are then used in the DAIGroup/i3d project to extract crops of the images around the detections.

Download and reproducibility

You have two options, to clone this project and run it (you will need a copy of the dataset), or to download the detections that this network produced.

Pre-calculated bounding boxes

You can dowload them from this Google Drive link. [download].

There are two directories within the downloaded .tgz file.

  • mp4_mrcnn_bbox contains the bounding boxes calculated with Mask RCNN as is, that is, the raw version.
  • mp4_mrcnn_bbox_nogaps contains versions of some videos that had gaps in detection, of <60 frames, that have been filled-in with the preprocessing scripts found in the companion DAIGroup/i3d project (here).

The preprocessing scripts in the DAIGroup/i3d project will take the best available file: that is, if the detection file is only present in the mp4_mrcnn_bbox directory it will take that, but if a corrected version is available in the mp4_mrcnn_bbox_nogaps directory it will take that instead.

NOTE: If using them in your research, please cite (Climent-Pérez et al. 2021) below.

References

  • (Das et al. 2019) Das, S., Dai, R., Koperski, M., Minciullo, L., Garattoni, L., Bremond, F., & Francesca, G. (2019). Toyota smarthome: Real-world activities of daily living. In Proceedings of the IEEE International Conference on Computer Vision (pp. 833-842).
  • (Climent-Pérez et al. 2021) Climent-Pérez, P., Florez-Revuelta, F. (2021). Improved action recognition with Separable spatio-temporalattention using alternative Skeletal and Video pre-processing, Sensors 21(3), 1005. DOI: https://doi.org/10.3390/s21031005.

Copyright of Mask RCNN implementation

Copyright (c) 2017 Matterport, Inc.

Licensed under the MIT License (see LICENSE for details) Written by Waleed Abdulla

About

Mask RCNN for Toyota Smarthome dataset people detection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages