https://github.com/tcwang0509/TalkingHead-1KH modified to preserve audio in the downloaded videos. For more information about the dataset, see the original repository. Download and processing instructions remain the same:
First, unzip the metadata and put it under the root directory:
unzip data_list.zipThis step downloads a small subset of the dataset to verify the scripts are working on your computer. You can also skip this step if you want to directly download the entire dataset.
bash videos_download_and_crop.sh smallThe processed clips should appear in small/cropped_clips.
Please run
bash videos_download_and_crop.sh trainThe script will automatically download the YouTube videos, split them into short clips, and then crop and trim them to include only the face regions. The final processed clips should appear in train/cropped_clips.
To download the evaluation set which consists of only 1080p videos, please run
bash videos_download_and_crop.sh valThe processed clips should appear in val/cropped_clips.