The idea is to generate new dance pose given a audio clip
Step 1
Download dance videos from YouTube/other sources
Step 2
Extract frames from the video
e.g. ffmpeg -i video-path -r 24 folder-path/image%04d.png
This will extract and save images with 24 fps to the given folder path.
Step 3
I have used CVPR'18 spotlight WSHP for this. Repository is forked from here
orginal frame:
Segmented frame:
Till now I experimented with Progressive GAN to generate noble pose trained with 11 dance videos having total 11,931 frames.
Output frames:
Audio guided pose generation
Sequential pose generation
Overall dance choreography