Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SpeechT5 Pretrain ERROR #28

Closed
benyang0506 opened this issue Dec 28, 2022 · 1 comment
Closed

SpeechT5 Pretrain ERROR #28

benyang0506 opened this issue Dec 28, 2022 · 1 comment

Comments

@benyang0506
Copy link

when pretrained 95400 num_updates,

File "/SpeechT5/SpeechT5/SpeechT5/speecht5/data/multitask_dataset.py", line 58, in getitem
sample = self.datasets[dataset_idx][sample_idx]
File "/SpeechT5/SpeechT5/SpeechT5/speecht5/data/text_dataset.py", line 218, in getitem
assert (source[1:-1] >= 1).all()
IndexError: slice() cannot be applied to a 0-dim tensor

the reason comes from text data preparation?

@Ajyy
Copy link
Collaborator

Ajyy commented Dec 28, 2022

Hi,

This error may be related to text data preparation. It means the length of the text sentence is 0. I suggest checking the text data. If there are some empty lines, you should delete them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants