Skip to content

The official codebase for the research "Densely Populated Regions Face Masks Localization and Classification Using Deep Learning Models"

Notifications You must be signed in to change notification settings

barbatoz0220/Densely-populated-FMD

Repository files navigation

Densely Populated Regions Face Masks Localization and Classification Using Deep Learning Models

Anh Pham H.N., Linh Phung K., Vi Le T.T. and Nga Ly T.

Introduction

Welcome to the official codebase for the research "Densely Populated Regions Face Masks Localization and Classification Using Deep Learning Models" by our team from International University - VNU-HCM with:

  • @barbatoz0220 - Anh Pham H.N. as 1st author
  • @indigoYoshimaru - Linh Phung K. as 2nd author
  • @lttvi - Vi Le T.T. as 3rd author
  • and Dr. Nga Ly T. as supervisor

Overview

Over the last two years, the correct wearing of facial masks in public is still a relevant matter in the fight against the COVID-19 pandemic. A popular approach that helps regulate this by global researchers is the application of Deep Learning. This paper will contribute to the literature in two main aspects:

  1. We first compare and contrast the application of four pre-trained models in classification of the specific face mask dataset.
  2. We follow with the proposal of a face mask detector model which combines the state-of-the-art RetinaFace to the localization of faces in densely populated regions and the most prominent classifier obtained from the previous step to divide the faces into three categories: correctly-worn, incorrectly-worn and no-masks-worn.

Results

Performance metrics from the test phase have shown that our detector were one to achieve the best accuracy among all works compared with 94,59% on one test dataset and a less satisfactory 69.6% on another due to certain characteristics of the set. A worth-noting feature is how it can perform rather well in densely populated detection of masks wearing following the validity of how the masks are worn: correctly, incorrectly and without. Although there are still certain limitations to some of the class variance, the performance metrics have proved our effectiveness with the combination of ResNet50V1 and RetinaFace. It is certainly possible for our model in the future to be better optimized, evaluated and ultimately deployed into public usage through real-world applications.

A few images showing the model's ability to detect and classify in crowded areas:

  • Densely_populated1
  • Densely_populated2
  • Densely_populated3

Faces classified as Incorrectly-worn by the proposed model:

  • incorrect1
  • incorrect2
  • incorrect3
  • incorrect4

More results are available at:

Contents

In this repository, you will be able to find:

  1. A comparison between 4 backbone models for the classification stage (MobileNetV1, MobileNetV2, ResNet50V1, ResNet50V2) using the Kaggle12K dataset in the folder exported_model Results on the accuracy and loss of each are expressed through the following figures:
  • MobileNetV1 and MobileNetV2
  • ResNetV1 and ResNetV2
  1. A comparison between our proposed model (with ResNet50V2 implemented) and another two (AIZOO and Neuralet) in the file model_comparison.ipynb. The following figure illustrates each model's performance on the two test sets. model_comparison
  2. The Google Colab file for our proposed model is available at: Open In Colab

More documentation and updates will be coming soon

About

The official codebase for the research "Densely Populated Regions Face Masks Localization and Classification Using Deep Learning Models"

Topics

Resources

Stars

Watchers

Forks