Human Pose Estimation with Deeply Learned Multi-Scale Compositional Models

This implementation is based on the code and data from [1-7]. We thank all the authors for kindly sharing these valuable resources.

Setting

Install Torch

Option (a): Follow http://torch.ch/docs/getting-started.html
Option (b): Use the Docker image kaixhin/cuda-torch:8.0 (https://hub.docker.com/r/kaixhin/cuda-torch/)

Install dependences

apt-get install libhdf5-serial-dev
luarocks install hdf5

Prepare datasets Download MPII [3] and LSP [4] datasets and create symbolic links so that their respective JPEG images can be found in:

data/mpii/images
data/lsp_dataset/images
data/lspet_dataset/images

Training

Train a model with 3 semantic levels on 1 GPUs(16G).

./experiments/PLACEHOLDER/train_dlmscm_l3.sh

where PLACEHOLDER can be:

mpii: Train with MPII training data excluding the 3K validation samples.
mpii-include-val: Train with MPII training data including the 3K validation samples.
mpii-lsp: Train with MPII training data and corrected LSP training data.

Testing

Download trained models from our project website and put them in checkpoints/saved.
Get human pose predictions using a trained model with 3 semantic levels

./experiments/PLACEHOLDER/predict_dlmscm_l3.sh

where PLACEHOLDER can be:

mpii: Predict on MPII 3K validation samples.
mpii-include-val: Predict on MPII testing data.
mpii-lsp: Predict on LSP testing data.

Evaluate the predictions by comparing them against the corresponding ground truth.

Check http://human-pose.mpi-inf.mpg.de/#evaluation for evaluation on MPII data.
You may evaluate the PCK@0.2 scores of your model on the LSP test set. To get start, download our prediction pred_multiscale_1_best.h5 and eval code from github, and run the MATLAB script transfer_pre_test.m.

References

[1] Wei Tang, Pei Yu, and Ying Wu. "Deeply Learned Compositional Models for Human Pose Estimation." in Proceedings of European Conference on Computer Vision (ECCV'18), Munich, Germany, Sept. 2018.

[2] Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, and Xiaogang Wang. "Learning feature pyramids for human pose estimation." In ICCV 2017.

[3] Alejandro Newell, Kaiyu Yang, and Jia Deng. "Stacked hourglass networks for human pose estimation." In ECCV 2016.

[4] https://github.com/facebook/fb.resnet.torch

[5] Mykhaylo Andriluka, Leonid Pishchulin, Peter Gehler, and Bernt Schiele. "2d human pose estimation: New benchmark and state of the art analysis." In CVPR 2014.

[6] Sam Johnson and Mark Everingham. "Clustered pose and nonlinear appearance models for human pose estimation." In BMVC 2010.

[7] Ben Sapp and Ben Taskar. "Modec: Multimodal decomposable models for human pose estimation." In CVPR 2013.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
checkpoints		checkpoints
data		data
experiments		experiments
gen		gen
src		src
.gitattributes		.gitattributes
README.md		README.md
main.lua		main.lua
multiscale_predict.lua		multiscale_predict.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Pose Estimation with Deeply Learned Multi-Scale Compositional Models

Setting

Training

Testing

References

About

Releases

Packages

Languages

idotc/DLMSCM

Folders and files

Latest commit

History

Repository files navigation

Human Pose Estimation with Deeply Learned Multi-Scale Compositional Models

Setting

Training

Testing

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages