Questions about dataset and training processs #2

980202006 · 2023-04-15T09:21:35Z

Hello, this project is amazing, I want to reproduce your research and improve on it, can you describe in detail the data set used etc.? Or can you provide the training code? Thanks

980202006 · 2023-04-15T10:10:26Z

Can you provide the method for constructing prompts during training, such as what types of prompts are included, such as speakers?

gandolfxu · 2023-04-17T04:40:06Z

Good job. Hope to share training script too!!!

gkucsko · 2023-04-17T17:40:05Z

Happy to hear it's useful. The dataset used is a proprietary dataset, so unfortunately we won't be able to share the dataset itself, however we hope that the pretrained models we provide encapsulate all the useful parts as it relates to TTS.
As for the training code, we might try to release it in the future, however for now it is too intertwined with other internal products. There are also fantastic other repos that can give you an idea such as https://github.com/lucidrains/audiolm-pytorch

v-yunbin · 2023-04-18T06:10:10Z

It means that we can't finetune on pretrained models and we can't clone on pretrained models.

gkucsko closed this as completed Apr 17, 2023

This was referenced May 10, 2023

about dataset and model #277

Closed

Datas accountability #293

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about dataset and training processs #2

Questions about dataset and training processs #2

980202006 commented Apr 15, 2023

980202006 commented Apr 15, 2023

gandolfxu commented Apr 17, 2023

gkucsko commented Apr 17, 2023

v-yunbin commented Apr 18, 2023

Questions about dataset and training processs #2

Questions about dataset and training processs #2

Comments

980202006 commented Apr 15, 2023

980202006 commented Apr 15, 2023

gandolfxu commented Apr 17, 2023

gkucsko commented Apr 17, 2023

v-yunbin commented Apr 18, 2023