WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

copied from: https://github.com/Ascend-Research/HeadPoseEstimation-WHENet

yolo weights: https://drive.google.com/uc?id=1wGrwu_5etcpuu_sLIXl9Nu0dwNc8YXIH
gdown --id 1wGrwu_5etcpuu_sLIXl9Nu0dwNc8YXIH
save yolo weights to: WHENet/yolo_v3/data

testing: python demo_video.py --video video.mp4 --output video_head_pose.avi

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Yijun Zhou and James Gregson - BMVC2020

Abstract: We present an end-to-end head-pose estimation network designed to predict Euler angles through the full range head yaws from a single RGB image. Existing methods perform well for frontal views but few target head pose from all viewpoints. This has applications in autonomous driving and retail. Our network builds on multi-loss approaches with changes to loss functions and training strategies adapted to wide range estimation. Additionally, we extract ground truth labelings of anterior views from a current panoptic dataset for the first time. The resulting Wide Headpose Estimation Network (WHENet) is the first fine-grained modern method applicable to the full-range of head yaws (hence wide) yet also meets or beats state-of-the-art methods for frontal head pose estimation. Our network is compact and efficient for mobile devices and applications. ArXiv

Demo

We provided two use case of the WHENet, image input and video input in this repo. Please make sure you installed all the requirments before running the demo code by pip install -r requirements.txt. Additionally, please download the YOLOv3 model for head detection and put it under yolo_v3/data.

Image demo

To run WHENet with image input, please put images and bbox.txt under one folder (E.g. Sample/) and just run pthon demo.py.

Format of bbox.txt are showed below:

image_name,x_min y_min x_max y_max
mov_001_007585.jpeg,240 0 304 83

Video/Webcam demo

We used YOLO_v3 in the video demo to get the cropped head image. In order to customize some of the functions we have put the yolo implementation and the pre-trained model in the repo. Hollywood head and Crowdhuman are used to train the head detection YOLO model.

demo_video.py [--video INPUT_VIDEO_PATH] [--snapshot WHENET_MODEL] [--display DISPLAY_OPTION] 
              [--score YOLO_CONFIDENCE_THRESHOLD] [--iou IOU_THRESHOLD] [--gpu GPU#] [--output OUTPUT_VIDEO_PATH]

Please set --video '' for webcam input.

Dependncies

EfficientNet https://github.com/qubvel/efficientnet
Yolo_v3 https://github.com/qqwweee/keras-yolo3

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Sample		Sample
yolo_v3		yolo_v3
LICENSE.txt		LICENSE.txt
README.md		README.md
THIRD PARTY OPEN SOURCE SOFTWARE NOTICE.txt		THIRD PARTY OPEN SOURCE SOFTWARE NOTICE.txt
WHENet.h5		WHENet.h5
demo.py		demo.py
demo_video.py		demo_video.py
detect_whenet.py		detect_whenet.py
prepare_images.py		prepare_images.py
requirements.txt		requirements.txt
utils.py		utils.py
whenet.py		whenet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample

Sample

yolo_v3

yolo_v3

LICENSE.txt

LICENSE.txt

README.md

README.md

THIRD PARTY OPEN SOURCE SOFTWARE NOTICE.txt

THIRD PARTY OPEN SOURCE SOFTWARE NOTICE.txt

WHENet.h5

WHENet.h5

demo.py

demo.py

demo_video.py

demo_video.py

detect_whenet.py

detect_whenet.py

prepare_images.py

prepare_images.py

requirements.txt

requirements.txt

utils.py

utils.py

whenet.py

whenet.py

Repository files navigation

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Demo

Image demo

Video/Webcam demo

Dependncies

About

Releases

Packages

Contributors 4

Languages

License

revygabor/WHENet

Folders and files

Latest commit

History

Repository files navigation

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Demo

Image demo

Video/Webcam demo

Dependncies

About

Resources

License

Stars

Watchers

Forks

Languages