Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Facing dimension mismatch #176

Closed
nayanjha16 opened this issue Nov 16, 2022 · 4 comments
Closed

Facing dimension mismatch #176

nayanjha16 opened this issue Nov 16, 2022 · 4 comments

Comments

@nayanjha16
Copy link

I am facing a dimension mismatch issue in pitch embedding addition to the encoder output . When I am trying to train the Fastspeech2 model on Hindi data. The Screenshot of the issue is attached as under.

Note :- I have made the necessary changes in the script to adapt it for Hindi dataset
Error_screen

The same code runs for LJSpeech dataset but fails for Hindi Dataset.
Kindly help me resolve the issue !

@hadarishav
Copy link

@nayanjha16 were you able to get it running? I am using the Indic TTS hindi dataset and facing the same issue.

@nayanjha16
Copy link
Author

@hadarishav I was able to fix this issue by fixing the alignments and running the preprocessing step yet again . It works for me when the group size is set to 1.

@hadarishav
Copy link

@nayanjha16 how did you fix the alignments? by group size you mean batch size for training?

@nayanjha16
Copy link
Author

nayanjha16 commented Feb 23, 2023

@nayanjha16 how did you fix the alignments? by group size you mean batch size for training?

@hadarishav My apologies for replying late, there are two parameters one is the batch size and the other being the group size . I had set the group_size to 1.

Attaching the code snippet for your reference from main.py

batch_size = train_config["optimizer"]["batch_size"]
**group_size = 1  # Set this larger than 1 to enable sorting in Dataset**
assert batch_size * group_size < len(dataset)
loader = DataLoader(
    dataset,
    batch_size=batch_size * group_size,
    shuffle=True,
    collate_fn=dataset.collate_fn,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants