GitHub

Knowing When to Quit: Selective Cascaded Regression withPatch Attention for Real-Time Face Alignment

Introduction

This is an implementation of the fast and accurate face alignment algorithm presented in the paper: Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment

Abstract

Facial landmarks (FLM) estimation is a critical component in many face-related applications. In this work, we aim to optimize for both accuracy and speed and explore the trade-off between them. Our key observation is that not all faces are created equal. Frontal faces with neutral expressions converge faster than faces with extreme poses or expressions. To differentiate among samples, we train our model to predict the regression error after each iteration. If the current iteration is accurate enough, we stop iterating, saving redundant iterations while keeping the accuracy in check. We also observe that as neighboring patches overlap, we can infer all facial landmarks (FLMs) with only a small number of patches without a major accuracy sacrifice. Architecturally, we offer a multi-scale, patch-based, lightweight feature extractor with a fine-grained local patch attention module, which computes a patch weighting according to the information in the patch itself and enhances the expressive power of the patch features. We analyze the patch attention data to infer where the model is attending when regressing facial landmarks and compare it to face attention in humans. Our model runs in real-time on a mobile device GPU, with 95 Mega Multiply-Add (MMA) operations, outperforming all state-of-the-art methods under 1000 MMA, with a normalized mean error of 8.16 on the 300W challenging dataset

Installation

The codebases are built on top of MDM

Steps

Run docker:

Download the docker image from here
Load the image: nvidia-docker load < kwtc_docker_image.tar.gz
Run the image: nvidia-docker run -v your_download_dir:dest_dir -it kwtc:new /bin/bash (The -v is needed to copy files to your container)

git clone:

Inside the container: cd /opt/kwtc/
git clone https://github.com/ligaripash/MuSiCa.git

WFLW:

Download the WFLW dataset from here.
copy WFLW.tar.gz to /opt/kwtc/
gunzip WFLW.tar.gz
tar xvf WFLW.tar

Models:

Download the model from here.
Copy models.tar.gz to /opt/kwtc/
gunzip models.tar.gz
tar xvf models.tar

Run inference on a pretrained model with 49 patches:

cd MuSica
python inference.py ( inference.json contains the inference paramters). The output is written to /opt/kwtc/output/
Render the calculated landmarks on image: python show_flm_on_image.py ( the output images are written to /tmp/ )

Evaluate the inference against WFLW ground-truth (expression subset)

python evaluate.py (evaluate.json contains the evluation parameters). You should get 0.088 average normalized error.

To train the model:

python train.py (train_params.py contain the trainig parameters)

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
slim		slim
Produce 300W results.py		Produce 300W results.py
README.md		README.md
arch-coarse-no-pga.png		arch-coarse-no-pga.png
arrange_dataset.py		arrange_dataset.py
build_reference_shapes.py		build_reference_shapes.py
common_params.py		common_params.py
contour.py		contour.py
create_aggregate_db.py		create_aggregate_db.py
create_face_rect.py		create_face_rect.py
data_provider.py		data_provider.py
detect.py		detect.py
evaluation.json		evaluation.json
evaluation.py		evaluation.py
image_preprocessing.py		image_preprocessing.py
inference.py		inference.py
inference_params.py		inference_params.py
mdm_eval_utils.py		mdm_eval_utils.py
mdm_model.py		mdm_model.py
mdm_train.py		mdm_train.py
show_flm_on_image.py		show_flm_on_image.py
train.py		train.py
train_params.py		train_params.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowing When to Quit: Selective Cascaded Regression withPatch Attention for Real-Time Face Alignment

Introduction

Abstract

Installation

Steps

Run docker:

git clone:

WFLW:

Models:

Run inference on a pretrained model with 49 patches:

Evaluate the inference against WFLW ground-truth (expression subset)

To train the model:

About

Releases

Packages

Languages

ligaripash/MuSiCa

Folders and files

Latest commit

History

Repository files navigation

Knowing When to Quit: Selective Cascaded Regression withPatch Attention for Real-Time Face Alignment

Introduction

Abstract

Installation

Steps

Run docker:

git clone:

WFLW:

Models:

Run inference on a pretrained model with 49 patches:

Evaluate the inference against WFLW ground-truth (expression subset)

To train the model:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages