Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does it able to learn certain voice style? #5

Open
lucasjinreal opened this issue Sep 28, 2022 · 3 comments
Open

Does it able to learn certain voice style? #5

lucasjinreal opened this issue Sep 28, 2022 · 3 comments

Comments

@lucasjinreal
Copy link

Does it able to learn certain voice style?

@ga642381
Copy link
Owner

ga642381 commented Oct 4, 2022

Hi, thanks for your question. This repo doesn't support learning voice style for now. We might need a style encoder if we want to learn the voice style. Recently, instead, we have been focusing on multilingual TTS. such as supporting Chinese, Taiwanese, and so on.

@ga642381 ga642381 closed this as completed Oct 4, 2022
@ga642381 ga642381 reopened this Oct 4, 2022
@lucasjinreal
Copy link
Author

@ga642381 hi, does multilane tts performant can compatible with single lan? isn't the phone space would be very large?

@ga642381
Copy link
Owner

ga642381 commented Oct 4, 2022

I agree with you. So the collaborator of this repo, Wei-Ping Huang, does have some research on how to use self-supervised features to learn shared phonetic information across different languages. (ref: Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding https://arxiv.org/abs/2206.15427)

As for this repo, I think at least we can support different datasets for various languages to make it more friendly for the community to do multispeaker, multilingual TTS research.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants