Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't see GE mentioned in the paper #11

Open
skysbird opened this issue Feb 20, 2024 · 3 comments
Open

Don't see GE mentioned in the paper #11

skysbird opened this issue Feb 20, 2024 · 3 comments

Comments

@skysbird
Copy link

class MRTE(nn.Module):

image

paper:https://arxiv.org/abs/2307.07218

reference:
image

@LSimon95
Copy link
Owner

GE was removed from the current version to test core MRTE's performance and I can't find the exact structure of GE. Maybe I will add to the newer 24k version for comparison.

@skysbird
Copy link
Author

GE was removed from the current version to test core MRTE's performance and I can't find the exact structure of GE. Maybe I will add to the newer 24k version for comparison.

i found timbre encoder description in paper:https://arxiv.org/pdf/2306.03509.pdf (megatts).

i think this can be for your reference:

image

@fighting-zeng
Copy link

fighting-zeng commented Jun 18, 2024

Does anyone know what the difference is between version 1 and version 4 of the paper 'MEGATTS2' on arXiv? I am really confused. The structure of MEGATTS2 differs between v1 and v4. In v4, the prompt's Conditions of PLLM only use Zc, whereas in v1, it uses Hct. Does this mean that timbre information is no longer needed? Additionally, v4 does not mention GE. Does this mean that GE is not important?
v1:
image
image
v4:
image
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants