Release 0.1.4 · diStyApps/seait

SEAIT 0.1.4.5 Dev Build available for public download https://www.patreon.com/distyx

This version hasn't been released here because it contains some significant changes that could disrupt functionality. It's still under development and might not yet provide best user experience.

I've had a bit of a strange month due to some personal circumstances and
recently, updates have slowed down due to a temporary issue with hardware access.
However, this issue is on its way to being resolved, and I will get back on track and resume frequent updates. This change will also open the door to integrating more advanced features into the SEAIT.

If you like what I'm doing and wish to assist, you can do so at https://www.patreon.com/distyx
Your support will contribute to improving and accelerating the development, and enhance the Super Easy AI Installer Tool (SEAIT) to be better and more efficient. This will help you access to the latest and best open-source projects with the fewest possible clicks.

As always, SEAIT will be available for public download, with major versions continually being released here. However, I'll initially release minor versions featuring new or experimental functionalities on Patreon.
Don't worry if you can't contribute right now - after a while, these minor versions will also be available to the public on Patreon.
Rest assured, these minor versions will be incorporated into the major versions, which will always be available to the public.

For more info about SEAIT 0.1.4.5 https://www.patreon.com/distyx

UPCOMING:
A series of video tutorials on how to correctly set up and use SEAIT.
Improved Projects layout with categories
Improved Project layout with more custom settings
Additional tools in the toolbox
Projects will update remotely, eliminating the need to wait for the app to update.
Ability to add custom projects.

I'm also planning to release two of my older projects:

An interface for SimSwap that allows for face swapping images and videos, among other features.

An video editor that utilizes Google's MediaPipe. This project can segment videos based on person detection, face presence, or face angle. It helps extract scenes with humans, scenes without humans, scenes without a face, scenes with a face, and faces at specific angles. Additionally, it allows you to stitch these segments back together.
Please note, these older projects will require updates before release, which will take some time.
Expect video demos within the week.

There's a lot on the plate. Let's go!

Update [0.1.4]

To create a symbolic link using the symlink tool, you need to run the 'RunAsAdministratorSymlink_seait.bat' file as an administrator:

Right-click on the 'RunAsAdministratorSymlink_seait.bat' file.
Select 'Run as Administrator' from the context menu.
Click 'Yes' when prompted by User Account Control.

This will grant the necessary permissions for creating symbolic links using the symlink tool.

For regular usage of seait.exe, you don't need to perform these steps. Simply start seait.exe as you normally would.
@asashledombos thank you for the script.

Added

bark-gui and openai whisper-ui both tested on GTX 970 4GB and worked great.

bark-gui [text-to-speech and voice cloning]

https://github.com/suno-ai/bark

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.

https://github.com/C0untFloyd/bark-gui

bark-gui is This is a simple Web UI for an extended Bark Version using Gradio, meant to be run locally.
whisper-ui [speech-to-text]

https://github.com/openai/whisper

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

A bit old maybe there new GUIs for whisper but i used this one.

https://github.com/hayabhay/whisper-ui

whisper-ui is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.

I have also have an old fork of this project with some differences that let chose gpu or cpu but its older then this one i might added later if requested.

Minor fixes and changes to the code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.1.4

Update [0.1.4]

bark-gui and openai whisper-ui both tested on GTX 970 4GB and worked great.