audio2face-pytorch

This repository provides PyTorch implementations for audio driven face meshes or blendshape models.
Currently, it supports the following models:

Audio2Face
VOCA
FaceFormer

And the following feature extractors are available:

Wav2Vec
MFCCExtractor

Dataset

This repository uses VOCASET as the template, which is introduced in 'Capture, Learning, and Synthesis of 3D Speaking Styles' (CVPR 2019).
Additionally, FLAME_sample has been extracted and converted to assets/FLAME_sample.obj and the Renderer has been redesigned. As a result, the psbody library is not required in this repository, which may cause installation issues for Apple Silicon users.

License

VOCA link

References

VOCASET ref
Cudeiro, Daniel, et al. "Capture, learning, and synthesis of 3D speaking styles." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019. ref
TimoBolkart/voca ref
Fan, Yingruo, et al. "FaceFormer: Speech-Driven 3D Facial Animation with Transformers." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. ref
NVIDIA. Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion. ref

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
assets		assets
src		src
template		template
.gitignore		.gitignore
FLAME_sample.obj		FLAME_sample.obj
README.md		README.md
config.yaml		config.yaml
main.py		main.py
pyproject.toml		pyproject.toml
render.py		render.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio2face-pytorch

Dataset

License

References

About

Releases

Packages

Languages

xtliu97/audio2face-pytorch

Folders and files

Latest commit

History

Repository files navigation

audio2face-pytorch

Dataset

License

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages