Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shape mismatch error on new dataset #6

Open
tekinek opened this issue Oct 3, 2021 · 1 comment
Open

Shape mismatch error on new dataset #6

tekinek opened this issue Oct 3, 2021 · 1 comment

Comments

@tekinek
Copy link

tekinek commented Oct 3, 2021

Hi, thanks for your work!

The frame rate of my dataset is 22050, and hop size of text2mel model is 256. I have changed hparams.py accordingly, but training results in an expcetion: (preprocessing was fine, anyway)

  File "/home/user/speechlab/FastVocoder-main/model/loss/loss.py", line 23, in forward
    assert est_source_sub_band.size(1) == wav_sub_band.size(1)

I figured out that model inference still uses hop-size of 240. So how to make your code fully compatible with other datasets? it seems that the codes are somehow hardcoded for Biaobei dataset.

@v-nhandt21
Copy link

v-nhandt21 commented Jan 11, 2022

@tekinek Have you solved this, is the preprocessing step affect the training process which use difference sample rate:

image

And the weight which is generated from TasNet work in this case?

Can you share your hparam config for 22050Hz?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants