Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About question of code and synthesis #6

Closed
Dyongh613 opened this issue May 9, 2022 · 8 comments
Closed

About question of code and synthesis #6

Dyongh613 opened this issue May 9, 2022 · 8 comments

Comments

@Dyongh613
Copy link

HI@keonlee9420, Thank you for your suggestions these days, I successfully integrated model PortaSpeech on the basis of this model. These are some questions to ask you! Thank you!

  1. In the DiffGAN-TTS, the return of get_mask from length is mask. And the return of get_mask from length in PortaSpeech is ~mask. I want to know the difference between them,
  2. In DiffGAN-TTS, about def diffuse_trace(self, x_start, mask). I want to know how do the ~ aims to do in def diffuse_trace. In my integrated model, I set the return of get_mask from length is ~mask.
    If I delete the ~ in diffuse_trace, the synthesis mel is error and the voice likes to the voice of water. While If I preserve the ~ in diffuse_trace, the mel is also error and the voice likes to electric voice.
    Thank you very much!
  • Deng Yan
  • 2022.5.9
  • GuangXi University
@keonlee9420
Copy link
Owner

Hi @qw1260497397 , thanks for your attention. It sounds interesting integrating PortaSpeech into DiffGAN-TTS, but don't have idea on it. Could you please elaborate it more? So you add GAN training for PortaSpeech? What about diffusion part?
Besides, here is my answer to your questions:

  1. ~ means 'not' in boolean so if you add it in front of mask, then the mask will be toggled. for example, if a mask value is [True, True, False], then the value of '~mask' is [False, False, True]
  2. so the meaning of ~ in diffuse_trace is the same. the error that you report may be raised from the mismatch of masking value usage between PortaSpeech and DiffGAN-TTS, where you have to sync to have the same masking scheme.

@Dyongh613
Copy link
Author

Dyongh613 commented May 11, 2022 via email

@Dyongh613
Copy link
Author

Dyongh613 commented May 12, 2022 via email

@Dyongh613
Copy link
Author

Dyongh613 commented May 13, 2022 via email

@Dyongh613
Copy link
Author

Dyongh613 commented May 16, 2022 via email

@keonlee9420
Copy link
Owner

Closed due to inactivity.

@Dyongh613
Copy link
Author

Dyongh613 commented Oct 11, 2022 via email

@Dyongh613
Copy link
Author

Dyongh613 commented Oct 11, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants