Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Best real time cloning? #1165

Open
WYNNGATE opened this issue Feb 20, 2023 · 6 comments
Open

Best real time cloning? #1165

WYNNGATE opened this issue Feb 20, 2023 · 6 comments

Comments

@WYNNGATE
Copy link

Hi, I'm looking for the best voice changer, in other words, my speech to a cloned voice.

Use case is for youtube and other VO

@gabrielmontagne
Copy link

This commercial solution is quite good, I've tried it a lot and it's great. It's still not fully open, but you can request a test drive, https://www.resemble.ai/speech-to-speech/

@Lolagatorade
Copy link

Really sucks there is AI for everything you can run in local hardware. AI art stable diffusion, text GPT you got alpaca and llama. But nothing good for voice cloning.

@tdlio
Copy link

tdlio commented Mar 23, 2023

ElevenLabs in terms of quality has a really effective voice cloning in my opinion. Does anyone have a guess / know what their training protocol may have been? So the base model, and then what else they added to it to bring it to where it is today. Breaking it down is the first step to making an open source alternative which I’m very interested in doing!

@WYNNGATE
Copy link
Author

WYNNGATE commented Mar 23, 2023 via email

@Lolagatorade
Copy link

ElevenLabs in terms of quality has a really effective voice cloning in my opinion. Does anyone have a guess / know what their training protocol may have been? So the base model, and then what else they added to it to bring it to where it is today. Breaking it down is the first step to making an open source alternative which I’m very interested in doing!

Honestly, there's not much things that are open in terms of Voice cloning. You can go on GitHub and type in voice, cloning and search for whatever comes up I believe some of those results have research papers. I remember there was some Chinese repository that has it running locally but of course I don't know Chinese. I just find it very strange how open image generation face swap, and all the other things are, but there's so many companies that are private when it comes to Voice cloning

@WYNNGATE
Copy link
Author

WYNNGATE commented Mar 23, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants