-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset licence #31
Comments
The dataset consists of thousands of audiobooks and podcasts that were scraped from the web. Many are copywritten, which is why I am not releasing the dataset. If you know or believe the laws in your jurisdiction will consider ML models as extensions of their datasets, then you should consider Tortoise license encumbered and you should not use it for commercial purposes. |
Thank you for the clarification. Just one more thing, I am asking because I would like to use the train_ voices
I just want to be sure that they are not 'exact' 1:1 copy of the original voice, because maybe the generalization of the model could be fine according to the law, but I wouldn't be so sure with the exact voice match |
This is a good point. You should not use any of the pre-packaged voices for business purposes for the time being. I will re-open t his and investigate which voices have copywrites attached to them and remove them. |
FYI: LibriTTS and HiFiTTS datasets were used to train Tortoise. If you are looking for license-free voices that will work very well with this program, use one of those. |
Excellent, that is a very valuable information. There shall be plenty of public domain options, it will be just a bit of hit or miss trials |
Just as a (probably) dumb (related) question: is there any reason to favour those datasets over LibriSpeech or some other dataset based on LibriVox (maybe a public domain one, since LibriSpeech is not exactly public domain)? |
Not a dumb question, this is something that took me some pain to figure out. ASR-focused datasets are often poor for TTS because they are missing punctuation and have bad splitting (e.g. not split on sentences). These are both important cues for a TTS system. Both of these applies to LibriSpeech. I believe LibriSpeech intersects with LibriTTS, so the model should work equally well with voices from either datasets. |
…65/tortoise-tts:main into main Reviewed-on: https://git.ecker.tech/mrq/tortoise-tts/pulls/31
Hello,
thank you for this amazing TTS model public. It is by far the best quality tts model I have tried so far.
I would like to ask you about the licensing of the dataset you have used for the training - Am I guessing correctly that you have used your own selection of librivox recordings?
I'm asking just to be sure that I can use the outputs in commercial setting, since all librivox recordings are in the public domain.
The text was updated successfully, but these errors were encountered: