Audio2Gestures

Official implementation for Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders， ICCV

Dependencies

Data Processing

Download fbx format data from Trinity speech gesture.
Retarget the data into SMPLX model using maya.
convert the fbx data to hdf5 format using script fbx2hdf.py

Training and testing

Run script bash start.sh

Visualizing

Run script python .\smplx2fbx.py --smplx .\for_smplx_retargeting.h5 --key LclRotation --fps 30 --synthesized .\input.h5 --fbx output.fbx

Reference

If you find our code useful for your research, please cite our paper.

@inproceedings{li2021audio2gestures,
title={Audio2Gestures: Generating Diverse Gestures from Speech Audio with Conditional Variational Autoencoders},
author={Li, Jing and Kang, Di and Pei, Wenjie and Zhe, Xuefei and Zhang, Ying and He, Zhenyu and Bao, Linchao},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={11293--11302},
year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md
calculate_training_stat.py		calculate_training_stat.py
data.py		data.py
fbx2hdf.py		fbx2hdf.py
fbxtools.py		fbxtools.py
for_smplx_retargeting.h5		for_smplx_retargeting.h5
main.py		main.py
model.py		model.py
module.py		module.py
options.py		options.py
s2g_stat.npy		s2g_stat.npy
smpl.fbx		smpl.fbx
smplx2fbx.py		smplx2fbx.py
speech_stat.npy		speech_stat.npy
start.sh		start.sh
test.py		test.py
utils.py		utils.py

JingLi513/Audio2Gestures

Folders and files

Latest commit

History

Repository files navigation

Audio2Gestures

Dependencies

Data Processing

Training and testing

Visualizing

Reference

About

Resources

Stars

Watchers

Forks

Languages