Pose-native Neural Architecture Search for Multi-person Pose Estimation

Introduction

In this work, we present the Pose-native Network Architecture Search (PoseNAS) to simultaneously design a pose encoder and pose decoder for pose estimation. Specifically, we directly search a data-oriented pose encoder with stacked searchable cells, which can provide an optimum feature extractor for the pose specific task. In the pose decoder, we exploit scale-adaptive fusion cells to promote rich information exchange across the multi-scale feature maps. Meanwhile, the pose decoder adopts a Fusion-and-Enhancement manner to progressively boost the high-resolution representations that are non-trivial for the precious prediction of hard keypoints. With the exquisitely designed search space and search strategy, PoseNAS can simultaneously search all modules in an end-to-end manner. Our best model obtains 76.7% mAP and 75.9% mAP on the COCO validation set and test set with only 33.6M parameters.

Architecture Description

In this project, we support two different structures, PoseNAS-L18-C48 and PoseNAS-L18-C64. 'L' and 'C' stand for the number of cells in the pose encoder and the number of the initial channels of the pose encoder, respectively, you can change them in the Configuration files (config.TRAIN.LAYERS and config.TRAIN.INIT_CHANNELS).

Main Results

Results on COCO val2017

Arch	Input size	#Params	GFLOPs	AP	Ap .5	AP .75	AP (M)	AP (L)	AR	AR .5	AR .75	AR (M)	AR (L)
PoseNAS-L18-C48	384x288	21.1M	9.1	0.762	0.910	0.830	0.723	0.828	0.810	0.944	0.871	0.768	0.871
PoseNAS-L18-C64	384x288	33.6M	14.8	0.767	0.915	0.837	0.725	0.829	0.812	0.947	0.873	0.771	0.871

Flip test is used.
Person detector is available at detected_bbox.
GFLOPs is for convolution and linear layers only.

Results on COCO test-dev2017

Arch	Input size	#Params	GFLOPs	AP	Ap .5	AP .75	AP (M)	AP (L)	AR	AR .5	AR .75	AR (M)	AR (L)
PoseNAS-L18-C48	384x288	21.1M	9.1	0.753	0.927	0.832	0.717	0.810	0.802	0.956	0.871	0.762	0.857
PoseNAS-L18-C64	384x288	33.6M	14.8	0.759	0.930	0.838	0.722	0.814	0.807	0.958	0.876	0.767	0.861

Flip test is used.
Person detector is available at detected_bbox.
GFLOPs is for convolution and linear layers only.

Results on MPII val

Arch	Head	Shoulder	Elbow	Wrist	Hip	Knee	Ankle	Mean
PoseNAS-L18-C64	97.2	96.3	90.6	86.0	90.0	86.5	83.0	90.4

Flip test is used.
Input size is 256x256

Code description

Currently we release our searched network architecture and the training code.
The search code and all the pretrained models will be released after the paper accepted.

Quick start

Installation

Install pytorch >= v1.0.0 following official instruction.
Clone this repo, and we'll call the directory that you cloned as ${POSE_ROOT}.
Install dependencies:
```
pip install -r requirements.txt
```
Make libs:
```
cd ${POSE_ROOT}
make
```

Install COCOAPI:

# COCOAPI=/path/to/clone/cocoapi
git clone https://github.com/cocodataset/cocoapi.git $COCOAPI
cd $COCOAPI/PythonAPI
# Install 
python3 setup.py install --user
# Alternatively, Install into global site-packages
make install

Init output (training model output directory) and log (tensorboard log directory) directory:

mkdir output 
mkdir log

Your directory tree should look like this:

${POSE_ROOT}
├── data
├── core
├── dataset
├── experiments
├── log
├── models
├── nms
├── output
├── utils
├── train.py
├── test.py
├── Makefile
├── README.md
└── requirements.txt

Data preparation

For MPII data, please download from MPII Human Pose Dataset. The original annotation files are in matlab format. We use the converted json format provided by SimpleBaseline . Extract them under {POSE_ROOT}/data, and make them look like this:

${POSE_ROOT}
|-- data
`-- |-- mpii
    `-- |-- annot
        |   |-- gt_valid.mat
        |   |-- test.json
        |   |-- train.json
        |   |-- trainval.json
        |   `-- valid.json
        `-- images
            |-- 000001163.jpg
            |-- 000003072.jpg

For COCO data, please download from COCO download . The person detection results of COCO val2017 and test-dev2017 are available at detected_bbox. Download and extract them under {POSE_ROOT}/data, and make them look like this:

${POSE_ROOT}
|-- data
`-- |-- coco
    `-- |-- annotations
        |   |-- person_keypoints_train2017.json
        |   `-- person_keypoints_val2017.json
        |-- person_detection_results
        |   |-- COCO_val2017_detections_bbox.json
        |   |-- COCO_test-dev2017_detections_bbox.json
        `-- images
            |-- train2017
            |   |-- 000000000009.jpg
            |   |-- 000000000025.jpg
            |   |-- ... 
            `-- val2017
                |-- 000000000139.jpg
                |-- 000000000285.jpg
                |-- ...

Training and Testing

Testing on COCO val2017 dataset

Testing on COCO dataset using our pretrained model.

python test.py --cfg experiments/coco/256x192_18l_64c.yaml --test_weight "path/to/your/weight"

Training on COCO train2017 dataset

python train.py --cfg experiments/coco/256x192_18l_64c.yaml

Testing on MPII dataset

Testing on MPII dataset using our pretrained model.

python test.py --cfg experiments/mpii/256x256_18l_64c.yaml --test_weight "path/to/your/weight"

Training on MPII dataset

python train.py --cfg experiments/mpii/256x256_18l_64c.yaml

Acknowledgements

This repo is largely modified from DARTS, HrNet and Simple Baseline.

Reference

@InProceedings{PoseNAS,
author = {Qian Bao, Wu Liu, Jun Hong, Lingyu Duan, Tao Mei},
title = {Pose-native Neural Architecture Search for Multi-person Pose Estimation},
booktitle = {ACM International Conference on Multimedia (ACM MM)},
year = {2020}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pose-native Neural Architecture Search for Multi-person Pose Estimation

Introduction

Architecture Description

Main Results

Results on COCO val2017

Results on COCO test-dev2017

Results on MPII val

Code description

Quick start

Installation

Data preparation

Training and Testing

Testing on COCO val2017 dataset

Testing on COCO dataset using our pretrained model.

Training on COCO train2017 dataset

Testing on MPII dataset

Testing on MPII dataset using our pretrained model.

Training on MPII dataset

Acknowledgements

Reference

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
core		core
dataset		dataset
experiments		experiments
models		models
nms		nms
utils		utils
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

License

for-code0216/PoseNAS

Folders and files

Latest commit

History

Repository files navigation

Pose-native Neural Architecture Search for Multi-person Pose Estimation

Introduction

Architecture Description

Main Results

Results on COCO val2017

Results on COCO test-dev2017

Results on MPII val

Code description

Quick start

Installation

Data preparation

Training and Testing

Testing on COCO val2017 dataset

Testing on COCO dataset using our pretrained model.

Training on COCO train2017 dataset

Testing on MPII dataset

Testing on MPII dataset using our pretrained model.

Training on MPII dataset

Acknowledgements

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages