VoxSlide-AI

VoxSlide-AI is a versatile tool designed to empower users to create short audio-visual clips effortlessly, either based on text files or by recording audio. With its intuitive interface and powerful features, VoxSlide-AI opens up a world of possibilities for content creators, educators, and anyone looking to add a dynamic touch to their presentations or social media content.

Demo Video

demoVideo.mp4

Features

Text-to-Video Conversion: Convert text files into engaging audio-visual clips with ease.
Audio Recording: Record audio directly within the application and transform it into captivating visuals.
Flexible Export: Download video clips in popular formats for easy sharing on social media platforms or integration into presentations.
Subtitles & Audio Voiceover: Generated video will have English subtitles as well as voiceover for interactive content

Use Cases

Content Creation: VoxSlide-AI simplifies the process of creating engaging content for social media platforms, blogs, or websites.
Educational Videos: Teachers and educators can use VoxSlide-AI to create interactive and visually appealing educational videos to enhance learning experiences.
Presentation Enhancement: Add a dynamic element to presentations by converting text or audio content into visually compelling video clips.
Promotional Material: Businesses and marketers can leverage VoxSlide-AI to create simple and interactive content that stands out.

Tech Stack

FrontEnd: React, react-speech-recognition, Bulma(CSS library).
BackEnd: Flask, Moviepy (Video Editing and Manipulation), ImageMagick (Image Processing)
ML Model: Hugging Face's text-to-image model is utilized for generating images based on input text - huggingface.co/stabilityai/stable-diffusion-xl-base-1.0

How to Run VoxSlide-AI

To run VoxSlide-AI on your local machine, follow these steps:

Clone the repository from GitHub:

git clone https://github.com/your-username/VoxSlide-AI.git

Navigate into the server directory
```
 cd VoxSlide-AI/server
```
Install the server dependencies
```
pip install -r requirements.txt
```
Install ImageMagick on your local machine.
Get the Hugging Face Text to Image Model API link and token from Hugging Face and put them in the .env file.
Run the Flask server
```
  python3 app.py
```
Navigate out of the server folder
```
  cd ..
```
Install the frontend dependencies
```
 npm install
```
Start the client server
```
 npm start
```
Access VoxSlide-AI in your web browser at http://localhost:3000. The server runs on port 5000.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
public		public
server		server
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoxSlide-AI

Demo Video

Features

Use Cases

Tech Stack

How to Run VoxSlide-AI

About

Releases

Packages

Languages

kshitijv09/VoxSlide-AI

Folders and files

Latest commit

History

Repository files navigation

VoxSlide-AI

Demo Video

Features

Use Cases

Tech Stack

How to Run VoxSlide-AI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages