Video face recognition based on RetinaFace and FaceNet

Catalog

Notes
Environment
Experiment_Results
How2predict

Notes

The library contains two networks, retinaface and facenet, they use different weights.
When using the networks, we need pay attention to the choice of weights and check the paths.

Environment

python==3.9
pytorch==1.11.0
opencv==4.5.5.64

Experiment_Results

RetinaFace

Dataset	Input Image Size	Easy	Medium	Hard
Widerface-Train	1280x1280	89.76%	86.96%	74.69%

FacaNet

Dataset	Input Image Size	Accuracy
CASIA-WebFace	160x160	98.23%

How2predict

The project comes with its own backbone for the retinaface model and facenet model of mobilenet. 2.. In the retinaface.py file, modify the model_path and backbone in the following section to make them correspond to the trained files.

_defaults = {
    "retinaface_model_path" : 'model_data/Retinaface_mobilenet0.25.pth',
    #-----------------------------------#
    #   Select RetinaFace backone as MobileNet
    #-----------------------------------#
    "retinaface_backbone"   : "mobilenet",
    "confidence"            : 0.5,
    "iou"                   : 0.3,
    #----------------------------------------------------------------------#
    #  If or not the image size limit is needed.
    #  The input image size will affect the FPS significantly, you can reduce the input_shape if you want to speed up the detection speed.
    #  When enabled, it will limit the input image size to input_shape. otherwise, use the original image for prediction.
    #  The input_shape can be adjusted according to the size of the input image, note that it is a multiple of 32, e.g. [640, 640, 3]
    #----------------------------------------------------------------------#
    "retinaface_input_shape": [640, 640, 3],
    #-----------------------------------#
    #   Whether the image size limit is required.
    #-----------------------------------#
    "letterbox_image"       : True,
    
    "facenet_model_path"    : 'facenet_mobilenet0.25.pth',
    #-----------------------------------#
    #   Select FaceNet backbone for MobileNet.
    #-----------------------------------#
    "facenet_backbone"      : "mobilenet",
    "facenet_input_shape"   : [160,160,3],
    "facenet_threhold"      : 0.9,

    "cuda"                  : True
}

Upload an image in the face_dataset folder. The naming rules of face_dataset are XXX_1.jpg, XXX_2.jpg.
Run encoding.py to encode the images inside the face_dataset. The model will generate the corresponding database face encoding data files in the model_data folder.
Upload a photo or video that needs to be recognized in the img folder and change the path of the image/video in the predict.py function.
Run predict.py.

Experiment results

https://drive.google.com/drive/folders/1twdFGTBXaLMNAzMBl7wrfn4Ves_tjiwb?usp=share_link

Let's see the results on the image side.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.idea		.idea
__pycache__		__pycache__
face_dataset		face_dataset
img		img
model_data		model_data
nets		nets
nets_retinaface		nets_retinaface
utils		utils
LICENSE		LICENSE
README.md		README.md
encoding.py		encoding.py
image_fliplr.py		image_fliplr.py
predict.py		predict.py
project_report_Qimeng_Tao_qt2139.ipynb		project_report_Qimeng_Tao_qt2139.ipynb
project_report_Qimeng_Tao_qt2139.pdf		project_report_Qimeng_Tao_qt2139.pdf
requirements.txt		requirements.txt
retinaface.py		retinaface.py

License

qt2139/COMS4995DLCV

Folders and files

Latest commit

History

Repository files navigation

Video face recognition based on RetinaFace and FaceNet

Catalog

Notes

Environment

Experiment_Results

How2predict

Experiment results

About

Resources

License

Stars

Watchers

Forks

Languages