A pipeline for voice-cloning and lip-syncing videos using Open-source tools.
Tech stack:
- Ffmpeg: Audio extraction and minor edits
- Whisper: For audio transcription
- 🐸xTTS: A multi-lingual voice-cloning end-to-end model from Coqui-ai
- Wav2lip: For lip-syncing
Trump's original speech
Trump.says.he.can.end.Ukraine.war.in.a.day.mp4
Trump's speech in Hindi
trump-hindi.mp4
Trump's speech in French