GitHub - shingte/Auto-Dubbing: Artificial Intelligence project at Insight Data Science

Face-Expression-Transfer-by-Audio

Introduction

Audio-driven facial animation is the process that aotomatically synthesizes talking head video from speech signals.

This project presents an end-to-end system that take an image and a clip of audio to generate the talking video. The system can simplify the film animation process through automatic generation from the voice acting. It can also be applied inpost-production to achieve better lip-synchronization in movie dubbing.

This repository uses the model described in the paper Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss.

Setup and Running

Pytorch environment:Pytorch 0.4.1. (conda install pytorch=0.4.1 torchvision cuda90 -c pytorch)
Install requirements.txt (pip install -r requirement.txt)
Download the pretrained ATnet and VGnet weights at google drive. Put the weights under model folder.
Run the demo code: python demo.py
- -device_ids: gpu id
- -cuda: using cuda or not
- -vg_model: pretrained VGnet weight
- -at_model: pretrained ATnet weight
- -lstm: use lstm or not
- -p: input example image
- -i: input audio file
- -lstm: use lstm or not
- -sample_dir: folder to save the outputs
- ...

Reference

This repository is based on repository ATVGnet.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
audio		audio
basics		basics
code		code
image		image
model		model
LICENSE		LICENSE
README.md		README.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Face-Expression-Transfer-by-Audio

Introduction

Setup and Running

Reference

License

About

Releases

Packages

Languages

License

shingte/Auto-Dubbing

Folders and files

Latest commit

History

Repository files navigation

Face-Expression-Transfer-by-Audio

Introduction

Setup and Running

Reference

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages