Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

regarding english dateset #4

Closed
dutchsing009 opened this issue May 4, 2022 · 1 comment
Closed

regarding english dateset #4

dutchsing009 opened this issue May 4, 2022 · 1 comment

Comments

@dutchsing009
Copy link

hi great work done here !!! I wanted to know if this repo is going to work on English speaking dataset ?? and whether Are there English examples for reference to know the quality ?? and if yes it is going to work on English dataset what exactly should i do ,? like in the "Replace the phoneset and pitchset in text/symbols.py with your own set" what would be the case here if using English ? Also "Provide the path to the data in config.json" is clear but what what would be the format ??

Thanks in advance!

@WelkinYang
Copy link
Owner

Thank you for your interest in our work.

First of all, our method is not limited to particular languages, especially I think in the task of speech synthesis and singing voice synthesis, synthesizing Chinese is harder than synthesizing English (Chinses is a tonal language), so if it works well on the Chinese dataset, then it will definitely work normally on English dataset.

Second, we did not experiment on English data because we did not find a suitable English singing dataset as a singing teacher And although we used Chinese internal datasets as singing teachers, there are open source Chinese singing datasets of the same size that can be used to reproduce our work, such as opencpop (https://arxiv.org/abs/2201.07429).

Finally, experimenting with English dataset on this repo is simple, just prepare the phoneme set of English data to place into text/symbols.py, English phoneme set can be easily obtained from the set provided by cmudict (https://github.com/mozilla/TTS/blob/e9e07844b77a43fb0864354791fb4cf72ffded11/TTS/tts/utils/text/cmudict.py) or use just use characters,(https://github.com/mozilla/TTS/blob/e9e07844b77a43fb0864354791fb4cf72ffded11/TTS/tts/utils/text/symbols.py) and the data format is no different from the format of labels in testdata (only the phoneme set is different).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants