Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

librispeech data validation issue #2566

Closed
shadowoom opened this issue Jul 23, 2018 · 8 comments
Closed

librispeech data validation issue #2566

shadowoom opened this issue Jul 23, 2018 · 8 comments

Comments

@shadowoom
Copy link

capture

I was running the librispeech s5 example and followed the steps in run.sh.
Has anyone encountered similar issue? Can tell me how to resolve this?

@danpovey
Copy link
Contributor

I can't see a reference to make_fbank.sh in any of the scripts in the librispeech s5 directory. Are you using an up-to-date version of Kaldi (if not, how recent?) and are you using Kaldi from the official source? Have you changed the scripts, perhaps?

@shadowoom
Copy link
Author

Thank you for the reply.

I directly cloned from this repo, https://github.com/tramphero/kaldi. The Alibaba_MIT_Speech_DFSMN patch which was built on the Kaldi speech recognition toolkit with commit "04b1f7d6658bc035df93d53cb424edc127fab819", was applied.

The run_fsmn_ivector script in librispeech/s5/local/nnet/ folder can be found here:
https://github.com/tramphero/kaldi/blob/master/egs/librispeech/s5/local/nnet/run_fsmn_ivector.sh

I checked the wav.scp file and reco2dur file in train_960_cleaned folder, the length were already differed before they were copied into the data_fbank folder.

@danpovey
Copy link
Contributor

danpovey commented Jul 23, 2018 via email

@shadowoom
Copy link
Author

Okay, thanks, I will try to merge with master and see how.

@yangxueruivs
Copy link
Contributor

I also met this problem before. I just ignore the data validation part of wav.scp and reco2dur(last one) in utils/validation_data_dir.sh and then it works correctly.

@shadowoom
Copy link
Author

shadowoom commented Jul 23, 2018 via email

@AlexPeng19
Copy link

@shadowoom hi, i trained the dfsmn model too and encountered the same issue, did you resolved it now by skipping the validation line? @yangxueruivs could you provide some detail for skip, such as which file, line numbers. on the other hand, did you exported the trained model? i mean how did you used it.

looking forward your response. thanks in advance.

@danpovey
Copy link
Contributor

danpovey commented Sep 3, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants