-
Python 3.6 , Pytorch >= 1.6 and ffmpeg
-
Other requirements are listed in the 'requirements.txt'
Please download the pretrained checkpoint from google-drive and put it within the folder (/checkpoints
).
python inference.py --audio_path xxx.wav --img_path xxx.jpg
Note that the input images must keep the same height and width and the face should be appropriately cropped as in /demo/img
.
@InProceedings{wang2021audio2head,
author = Suzhen Wang, Lincheng Li, Yu Ding, Changjie Fan, Xin Yu
title = {Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion},
booktitle = {the 30th International Joint Conference on Artificial Intelligence (IJCAI-21)},
year = {2021},
}
This codebase is based on First Order Motion Model, thanks for their contribution.