Skip to content

mapleandfire/300VW-Mask

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

300VW-Mask Dataset

The repository provides the download of the 300VW-Mask dataset utilised in the following papers:

@article{wang2019face,
  title={Face mask extraction in video sequence},
  author={Wang, Yujiang and Luo, Bingnan and Shen, Jie and Pantic, Maja},
  journal={International Journal of Computer Vision},
  volume={127},
  number={6},
  pages={625--641},
  year={2019},
  publisher={Springer}
}
@inproceedings{wang2020dynamic,
  title={Dynamic face video segmentation via reinforcement learning},
  author={Wang, Yujiang and Dong, Mingzhi and Shen, Jie and Wu, Yang and Cheng, Shiyang and Pantic, Maja},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={6959--6969},
  year={2020}
}

The 300VW-Mask dataset is an extension of the 300 Videos in the Wild (300VW Dataset) [1] where pixel-wise annotations (face mask) are provided for each cropped facial image. See [2] for how this is done. See examples of cropped faces and mask labels below.

plot

This dataset only contains the pixel-wise mask labels for pre-cropped facial images of 300VW. Please visit the original 300VW Dataset if you need other information like landmark annotations or original frame images.

Download Link

The 300VW-Mask dataset can be downloaded here (Google Drive) or here (Dropbox).

File Structure

Unzip the downloaded file and you will see the following file structure:

..
├── 300VW-Mask-Dataset
│   ├── images_orig
│       └── 001_000001.jpg
│           ...
│   ├── labels_orig
│       └── 001_000001.png
│           ...
│   ├── data_splits
│       ├── split_1
│       └── split_2
│   └── README.md

The cropped facial images and the corresponding mask labels reside under images_orig and labels_orig folders, respectively.

The cropped facial image named "xxx_yyyyyy.jpg" stands for the yyyyyy-th frame of the "xxx" video in the original 300VW dataset. Please be aware that "000001" is the starting frame of the video, not "000000".

The mask labels are named identically except that it ends with "png".

Data Splits

We provide the data splits used in [2] under data_splits/split_1, where train_per10.txt/val_per10.txt/test_per10.txt corresponds to the train/validation/test sets of the 10% 1-s sequences (see [2] for more details).

The data splits used in [3] are provided in data_splits/split_2 folder, where A.txt/B.txt/C.txt corresponds to the A/B/C sets used in [3].

Each line in the data split file, e.g. "/images_orig/xxx_yyyyyy.jpg /labels_orig/xxx_yyyyyy.png" indicates the paths of the cropped facial image and the mask label for the yyyyyy-th frame of the "xxx" video in 300VW.

Note that we have excluded certain frames with inaccurate landmark annotations from the data splits. See ReadMe.txt and extra.zip of the original 300VW Dataset for details.

Generate 300VW-Mask

If you would like to generate 300VW-Mask from the original 300VW using different settings, e.g. with larger or smaller cropping margins, you can use the Matlab scripts under ./processing_scripts to do so.

The script ./processing_scripts/do_segmentation.m will generate the face mask for a frame with 68 landmark annotations.

The script ./processing_scripts/crop_face_and_seg.m will crop the frame image and the generated mask label with a certain margin.

Please consider cite our works [2][3] if you find this dataset useful.

Reference

  1. The first facial landmark tracking in-the-wild challenge: Benchmark and results
    Jie Shen, Stefanos Zafeiriou, Grigoris G. Chrysos, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic.
    [link]. Proceedings of the IEEE international conference on computer vision workshops. 2015.

  2. Face mask extraction in video sequence
    Yujiang Wang, Bingnan Luo, Jie Shen, Maja Pantic.
    [link]. International Journal of Computer Vision. 2019.

  3. Dynamic face video segmentation via reinforcement learning
    Yujiang Wang, Mingzhi Dong, Jie Shen, Yang Wu, Shiyang Cheng, Maja Pantic.
    [link]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.

About

300VW-Mask dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages