Skip to content

LipWise is a powerful video dubbing tool that leverages optimized inference for Wav2Lip, this also utilizes models like GFPGAN and CodeFormer. These sophisticated models seamlessly integrate the new audio with the lip movements of the reference video, resulting in a stunningly natural and realistic final output.

License

Notifications You must be signed in to change notification settings

brahianrosswill/Lip_Wise

 
 

Repository files navigation

Open in Google Colab

Important

Please Help by starring the repo. 😁


Lip-Wise leverages Wav2Lip for audio-to-lip generation, seamlessly integrating with cutting-edge face restoration models (CodeFormer, GFPGAN, RestoreFormer) for added realism. MediaPipe ensures precise facial landmark detection, while RealESRGAN enhances background quality. Simply provide an audio clip and a reference video, and Lip-Wise orchestrates the process to deliver stunning results.

Here's what makes Lip-Wise stand out:

  • Effortless Workflow: Unleash your creativity with an intuitive and user-friendly interface.
  • Unleash Your Vision: No more limitations - use any video, even those without a face in every frame.
  • Precision Meets Efficiency: Combining enhanced face detection, landmark recognition, and streamlined processing delivers superior results with significantly faster performance.
  • Simplified Setup: Get started quickly with minimal technical hassle - a breeze even for beginners.

UI Screenshots:

👓 INSTALLATION

🥎 GETTING STARTED

Open in Google Colab

💡Tip: Make sure to use GPU runtime for faster processing.


💿 SETUP AND INFERENCE

WindowsnVIDIA

  • Clone this repository:
    • git clone https://github.com/pawansharmaaaa/Lip_Wise
  • Install Python > 3.10 from Official Site or From Microsoft store.
  • Install winget from Microsoft Store.
  • Download and install the CUDA Toolkit that is compatible with your system. The latest version generally supports most NVIDIA 10-series graphics cards and newer models.
  • Run setup.bat
  • Run launch.bat

DebianUbuntuPop!_OSnVIDIA

  • Clone this repository:
    • git clone https://github.com/pawansharmaaaa/Lip_Wise
  • Make sure python --version is >3.10
  • Download and install the CUDA Toolkit that is compatible with your system. The latest version generally supports most NVIDIA 10-series graphics cards and newer models.
  • Make setup.sh an executable
    • chmod +x ./setup.sh
  • Run setup.sh by double clicking on it.
  • Make launch.sh an executable
    • chmod +x ./launch.sh
  • Run launch.sh by double clicking on it.

FEATURES

LipWise empowers you to create stunningly realistic and natural results, combining the power of AI with user-friendly features:

Media Versatility:

  • Process both images and videos: Breathe life into your visuals, regardless of format.
  • Advanced image and video preprocessing: Ensure optimal quality for exceptional results.

Cutting-edge Restoration:

  • Harness the power of leading models: GFPGAN, RestoreFormer, and CodeFormer work in tandem to deliver exceptional detail and clarity.
  • RealESRGAN integration: Enhance the background quality of your visuals effortlessly.

Image Processing:

  • 3D alignment in process image: Achieve unparalleled realism with precise facial landmark detection.

Video Processing:

  • No need for face in every frame: LipWise intelligently interpolates missing frames, ensuring smooth transitions and realistic lip movements.
  • Fast inference: Enjoy a fluid experience with rapid video processing.
  • Video looping: Create seamless looping videos with consistent results.
  • RealESRGAN integration: Elevate the background quality of your videos effortlessly

🤗 ACKNOWLEDGEMENTS:

Thanks to the following open-source projects:

NumPyPyTorchTensorFlow

About

LipWise is a powerful video dubbing tool that leverages optimized inference for Wav2Lip, this also utilizes models like GFPGAN and CodeFormer. These sophisticated models seamlessly integrate the new audio with the lip movements of the reference video, resulting in a stunningly natural and realistic final output.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 65.6%
  • Jupyter Notebook 21.6%
  • CSS 11.7%
  • Other 1.1%