Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about dataset and training processs #2

Closed
980202006 opened this issue Apr 15, 2023 · 4 comments
Closed

Questions about dataset and training processs #2

980202006 opened this issue Apr 15, 2023 · 4 comments

Comments

@980202006
Copy link

Hello, this project is amazing, I want to reproduce your research and improve on it, can you describe in detail the data set used etc.? Or can you provide the training code? Thanks

@980202006
Copy link
Author

Can you provide the method for constructing prompts during training, such as what types of prompts are included, such as speakers?

@gandolfxu
Copy link

Good job. Hope to share training script too!!!

@gkucsko
Copy link
Contributor

gkucsko commented Apr 17, 2023

Happy to hear it's useful. The dataset used is a proprietary dataset, so unfortunately we won't be able to share the dataset itself, however we hope that the pretrained models we provide encapsulate all the useful parts as it relates to TTS.
As for the training code, we might try to release it in the future, however for now it is too intertwined with other internal products. There are also fantastic other repos that can give you an idea such as https://github.com/lucidrains/audiolm-pytorch

@gkucsko gkucsko closed this as completed Apr 17, 2023
@v-yunbin
Copy link

It means that we can't finetune on pretrained models and we can't clone on pretrained models.
011B5553

This was referenced May 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants