Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VQ-VAE network not released #33

Closed
mlajszczak opened this issue May 8, 2022 · 7 comments
Closed

VQ-VAE network not released #33

mlajszczak opened this issue May 8, 2022 · 7 comments

Comments

@mlajszczak
Copy link

mlajszczak commented May 8, 2022

Hello! After taking a closer look into the repo it seems that you have not released the VQ-VAE model used to generate discrete speech representations that the autoregressive model was trained on. Is this on purpose? You don't plan to do that in the future?

@neonbjb
Copy link
Owner

neonbjb commented May 9, 2022

Hey there,
Yes, this is on purpose. This is an important ingredient to being able to fine-tune this model, and I do not want to make that process easy/effective for abuse reasons. I am not sure if/when I will release it.

If you are interested in the VQVAE model itself, I would be glad to divulge training details for it so you can build your own. It will not be compatible with the Tortoise codes, though (unless you are really lucky. :) )

@neonbjb neonbjb closed this as completed May 9, 2022
@Nikuson123
Copy link

Nikuson123 commented May 9, 2022

I would be glad to know the details of the training. I am interested in exploring this model.

@neonbjb
Copy link
Owner

neonbjb commented May 9, 2022

I would be glad to know the details of the training.

To be clear - do you want to know the details of training a speech VQVAE (which I can release right now), or how all of the models that compose TorToiSe are trained? (which I haven't released yet, but will do so in the future)

@meltingrock
Copy link

meltingrock commented Feb 2, 2023

I would be glad to know the details of the training.

To be clear - do you want to know the details of training a speech VQVAE (which I can release right now), or how all of the models that compose TorToiSe are trained? (which I haven't released yet, but will do so in the future)

@neonbjb James, well done on building TorToise, reading up about you it seems like you have figured things out for yourself. Quite an achievement and you should tell your parents they have ample reason to brag about you (most likely they have no idea what it is you actually do, other than "work on computers" ;-) )

I sounds like you were going to release details on training TorToise end to end but with your employment at OpenAI (congrats!!!!!) are you reluctant to do so? If so I can totally understand, but it would be great if you could tell me if this is the case.

You have given the half the map to the treasure in your research paper, so it is not that you have not been kind enough.

@neonbjb
Copy link
Owner

neonbjb commented Feb 3, 2023

The doc I wrote up (and still havent finished.......... :( ) was what I meant when I said I would 'release training details'.

The code for training Tortoise is, and always has been available over in https://github.com/neonbjb/DL-Art-School

What is missing is a really good guide on how to do everything. Unfortunately I just don't have to the time to put that together, nor support the inevitable questions that will crop up if I do. The best recommendation I have is clone DL-Art-School, hook up a debugger and get to it!

@meltingrock
Copy link

@neonbjb Got it.

I have started working through DL-Art-School, so wish me luck.

All the best with your endeavours and once again well done.

@neonbjb
Copy link
Owner

neonbjb commented Feb 5, 2023

Good luck! Nice to see someone taking initiative! I'm sure you'll learn a lot along the way that'll be useful regardless of the outcome.

entn-at pushed a commit to entn-at/tortoise-tts that referenced this issue Mar 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants