Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: Generate speech at runtime - multiple voice support #6

Open
earboxer opened this issue May 16, 2018 · 2 comments
Open

Feature: Generate speech at runtime - multiple voice support #6

earboxer opened this issue May 16, 2018 · 2 comments

Comments

@earboxer
Copy link

Currently, the audio in this skill is hard-coded.

Mycroft Mark II claims

Custom voices: You can choose the voice of your Mycroft, currently we support American Female and British Male voices with more coming soon.

To support these voices, this skill will either explode with more files, or need the text coded in and audio generated at runtime.

I suggest the latter.

@earboxer
Copy link
Author

Based on performance issues with mimic, (MycroftAI/mimic1#155 MycroftAI/mimic1#77), it seems like the best option is to create audio files of the voice, then combine them with the backing track using something like pydub.

@penrods
Copy link
Collaborator

penrods commented Aug 29, 2018

These songs were hand-created by various users in audio editing software. I'm happy to pull in more songs if others want to create some, but at this point I don't expect Mimic or Mimic2 to be able to generate properly timed music without someone spending a lot of time on it. I can imaging using some of the karaoke tracking software to pull something off, but this isn't something the core team is going to exert effort on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants