This is the official implementation of the approach described in the paper:
GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation,
Wenhao Li, Mengyuan Liu, Hong Liu, Tianyu Guo, Ti Wang, Hao Tang, Nicu Sebe
Pattern Recognition, 2024
GraphMLP is tested on Ubuntu 18 with Pytorch 1.7.1 and Python 3.9.
- Create a conda environment:
conda create -n graphmlp python=3.9
- Install PyTorch 1.7.1 and Torchvision 0.8.2 following the official instructions
pip3 install -r requirements.txt
Please download the dataset from Human3.6M website and MPI-INF-3DHP website, and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory). Or you can download the processed data from here.
${POSE_ROOT}/
|-- dataset
| |-- data_3d_h36m.npz
| |-- data_3d_3dhp.npz
| |-- data_2d_h36m_gt.npz
| |-- data_2d_h36m_cpn_ft_h36m_dbb.npz
| |-- data_2d_3dhp.npz
The pretrained model can be found in here, please download it and put it in the './checkpoint/pretrained' directory.
To test a 1-frame GraphMLP model:
# Human3.6M
python main.py --test --previous_dir 'checkpoint/pretrained/1' --frames 1
# MPI-INF-3DHP
python main.py --test --previous_dir 'checkpoint/pretrained/1' --frames 1 --dataset '3dhp'
To test a 1-frame GraphMLP model with refine module on Human3.6M:
python main.py --test --previous_dir 'checkpoint/pretrained/1/refine' --frames 1 --refine --refine_reload
To test a 243-frames GraphMLP model on Human3.6M:
python main.py --test --previous_dir 'checkpoint/pretrained/243' --frames 243
Here, we report the parameters, FLOPs, and MPJPE of GraphMLP with different input frame numbers on Human3.6M dataset.
1 | 27 | 81 | 243 | |
---|---|---|---|---|
Param (M) | 9.49 | 9.51 | 9.57 | 9.73 |
FLOPs (M) | 348 | 349 | 351 | 356 |
MPJPE (mm) | 49.2 | 45.5 | 44.5 | 43.8 |
To train a 1-frame GraphMLP model on Human3.6M:
# Train from scratch
python main.py --frames 1 --batch_size 256
# After training for 20 epochs, add refine module
python main.py --frames 1 --batch_size 256 --refine --lr 1e-5 --previous_dir [your best model saved path]
To train a 243-frames GraphMLP model on Human3.6M:
python main.py --frames 243 --batch_size 64
First, you need to download YOLOv3 and HRNet pretrained models here and put it in the './demo/lib/checkpoint' directory. Then, you need to put your in-the-wild videos in the './demo/video' directory.
Run the command below:
# Run the command below:
python demo/vis.py --video sample_video.mp4
# Or run the command with the fixed z-axis:
python demo/vis.py --video sample_video.mp4 --fix_z
Sample demo output:
If you find our work useful in your research, please consider citing:
@article{li2024graphmlp,
title={GraphMLP: A graph MLP-like architecture for 3D human pose estimation},
author={Li, Wenhao and Liu, Mengyuan and Liu, Hong and Guo, Tianyu and Wang, Ti and Tang, Hao and Sebe, Nicu},
journal={Pattern Recognition},
pages={110925},
year={2024},
}
Our code is extended from the following repositories. We thank the authors for releasing the codes.
This project is licensed under the terms of the MIT license.