Pose2Gaze: Eye-body Coordination during Daily Activities for Gaze Prediction from Full-body Poses

Project homepage: https://zhiminghu.net/hu24_pose2gaze.

Abstract

Human eye gaze plays a significant role in many virtual and augmented reality (VR/AR) applications, such as gaze-contingent rendering, gaze-based interaction, or eye-based activity recognition. 
However, prior works on gaze analysis and prediction have only explored eye-head coordination and were limited to human-object interactions. 
We first report a comprehensive analysis of eye-body coordination in various human-object and human-human interaction activities based on four public datasets collected in real-world (MoGaze), VR (ADT), as well as AR (GIMO and EgoBody) environments. 
We show that in human-object interactions, e.g. pick and place, eye gaze exhibits strong correlations with full-body motion while in human-human interactions, e.g. chat and teach, a person’s gaze direction is correlated with the body orientation towards the interaction partner. 
Informed by these analyses we then present Pose2Gaze – a novel eye-body coordination model that uses a convolutional neural network and a spatio-temporal graph convolutional neural network to extract features from head direction and full-body poses, respectively, and then uses a convolutional neural network to predict eye gaze. 
We compare our method with state-of-the-art methods that predict eye gaze only from head movements and show that Pose2Gaze outperforms these baselines with an average improvement of 24.0% on MoGaze, 10.1% on ADT, 21.3% on GIMO, and 28.6% on EgoBody in mean angular error, respectively. 
We also show that our method significantly outperforms prior methods in the sample downstream task of eye-based activity recognition. 
These results underline the significant information content available in eye-body coordination during daily activities and open up a new direction for gaze prediction.

Environments:

Ubuntu 22.04 python 3.8+ pytorch 1.8.1 cudatoolkit 11.1

Usage:

Step 1: Create the environment

conda env create -f ./environments/pose2gaze.yaml -n pose2gaze
conda activate pose2gaze

Step 2: Follow the instructions in './adt_processing/', './egobody_processing/', './gimo_processing/' and './mogaze_processing/' to process and analyse the datasets.

Step 3: Set 'data_dir' and 'cuda_idx' in 'train_mogaze_xx.sh' (xx for p1, p2, p4, p5, p6, or p7) to evaluate on different participants. By default, 'train_mogaze_xx.sh' first trains the model from scratch and then tests on different actions. If you only want to evaluate the pre-trained models, please comment the training commands (the commands without the 'is_eval' setting).

Step 4: Set 'data_dir' and 'cuda_idx' in 'train_gimo_xx.sh' (xx for past, present, or future) to evaluate on different generation settings. By default, 'train_gimo_xx.sh' first trains the model from scratch and then tests on different actions. If you only want to evaluate the pre-trained models, please comment the training commands (the commands without the 'is_eval' setting).

Step 5: Set 'data_dir' and 'cuda_idx' in 'train_egobody_xx.sh' (xx for past, present, or future) to evaluate on different generation settings. By default, 'train_egobody_xx.sh' first trains the model from scratch and then tests on different actions. If you only want to evaluate the pre-trained models, please comment the training commands (the commands without the 'is_eval' setting).

Step 6: Set 'data_dir' and 'cuda_idx' in 'train_adt_xx.sh' (xx for past, present, or future) to evaluate on different generation settings. By default, 'train_adt_xx.sh' first trains the model from scratch and then tests on different actions. If you only want to evaluate the pre-trained models, please comment the training commands (the commands without the 'is_eval' setting).

Citation

@article{hu24pose2gaze,
	author={Hu, Zhiming and Xu, Jiahui and Schmitt, Syn and Bulling, Andreas},
	journal={IEEE Transactions on Visualization and Computer Graphics}, 
	title={Pose2Gaze: Eye-body Coordination during Daily Activities for Gaze Prediction from Full-body Poses},
	year={2024}}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
adt_processing		adt_processing
checkpoints		checkpoints
egobody_processing		egobody_processing
environments		environments
gimo_processing		gimo_processing
model		model
mogaze_processing		mogaze_processing
utils		utils
README.md		README.md
pose2gaze_adt.py		pose2gaze_adt.py
pose2gaze_egobody.py		pose2gaze_egobody.py
pose2gaze_gimo.py		pose2gaze_gimo.py
pose2gaze_mogaze.py		pose2gaze_mogaze.py
train_adt_future.sh		train_adt_future.sh
train_adt_past.sh		train_adt_past.sh
train_adt_present.sh		train_adt_present.sh
train_egobody_future.sh		train_egobody_future.sh
train_egobody_past.sh		train_egobody_past.sh
train_egobody_present.sh		train_egobody_present.sh
train_gimo_future.sh		train_gimo_future.sh
train_gimo_past.sh		train_gimo_past.sh
train_gimo_present.sh		train_gimo_present.sh
train_mogaze_p1.sh		train_mogaze_p1.sh
train_mogaze_p2.sh		train_mogaze_p2.sh
train_mogaze_p4.sh		train_mogaze_p4.sh
train_mogaze_p5.sh		train_mogaze_p5.sh
train_mogaze_p6.sh		train_mogaze_p6.sh
train_mogaze_p7.sh		train_mogaze_p7.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pose2Gaze: Eye-body Coordination during Daily Activities for Gaze Prediction from Full-body Poses

Abstract

Environments:

Usage:

Citation

About

Releases

Packages

Languages

CraneHzm/Pose2Gaze

Folders and files

Latest commit

History

Repository files navigation

Pose2Gaze: Eye-body Coordination during Daily Activities for Gaze Prediction from Full-body Poses

Abstract

Environments:

Usage:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages