Image-Audio Lip Sync

We have deployed this project here.

Introduction

This project takes in an image and an audio file and maps the audio to the image, creating a lip-synced video. It utilizes two existing open-source repositories: Wav2Lip for audio-to-video lip syncing and One-Shot Free-View Neural Talking-Head Synthesis for generating a video from a single image.

This tool can be useful for a variety of applications, such as creating realistic talking head videos for presentations or video production, or for generating lip-synced animations in real-time applications. The process involves running the audio and image through each repository, combining the resulting video to generate the final output. The resulting image can be saved or further processed as needed.

Here is a video showcasing it in action:

Template for frontend: https://berrydashboard.io/

Usage

To use this tool, follow the steps below:

Open the hamburger menu on the application interface.
Click on the "Upload Media" button to select and upload your image and audio files.
Once you have selected your files, click on the "Send" button to submit the data.
The data will be sent to the Node.js backend, which will further send it to a server running a heavy-duty GPU for processing.
After the processing is complete, the final data will be sent back to the application interface.

Please note that the processing time may vary depending on the size and complexity of the files, as well as the availability of the GPU server. You can monitor the progress of your request in the application interface, and the final output will be displayed once the processing is complete.

For more detailed instructions on how to set up and configure this tool, please refer to the "Installation" and "Configuration" sections of this README file.

Technology Stack

MERN stands for MongoDB, Express.js, React, and Node.js. It is a full-stack JavaScript framework that enables developers to build web applications from front to back using a single programming language.

MongoDB is a NoSQL database that stores data in a JSON-like format, making it easy to work with data in JavaScript.
Express.js is a web application framework for Node.js that provides a set of features for building web applications, including middleware, routing, and templating.
React is a JavaScript library for building user interfaces that allows developers to create reusable UI components.
Node.js is a server-side JavaScript runtime environment that enables developers to build scalable, high-performance web applications.

MERN is a popular choice for building modern web applications because it provides a unified and consistent development experience across the entire stack. It also allows developers to take advantage of the latest trends in front-end development, such as component-based architecture and reactive programming.

Installation

Frontend

To install the frontend of this tool, simply clone the repository and run the following command: yarn ; ./deploy.sh

Backend

To install and run the backend, navigate to the backend directory and run the following command:

yarn ; node server.js

Please note that in order to use the tool, you will also need to set up a separate server with a heavy-duty GPU for processing that utilizes the repositories mentioned above. This server should be specified in the backend.

Additionally, it is recommended to set up SSL for the backend and frontend server in order to avoid browser warnings and blockages. Please refer to the appropriate SSL documentation for your server and configuration.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
backend		backend
frontend		frontend
nginx		nginx
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backend

backend

frontend

frontend

nginx

nginx

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

docker-compose.yml

docker-compose.yml

Repository files navigation

Image-Audio Lip Sync

Introduction

Usage

Technology Stack

Installation

Frontend

Backend

About

Releases

Packages

Contributors 3

Languages

License

Abhi7410/DASS_Project

Folders and files

Latest commit

History

Repository files navigation

Image-Audio Lip Sync

Introduction

Usage

Technology Stack

Installation

Frontend

Backend

About

Topics

Resources

License

Stars

Watchers

Forks

Languages