Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix utils/split_scp.pl breakage #3308

Merged
merged 3 commits into from
May 10, 2019
Merged

Conversation

kkm000
Copy link
Contributor

@kkm000 kkm000 commented May 9, 2019

Close #3306

@kkm000 kkm000 changed the title Fix utils/split_scp.pl breakage [WIP] Fix utils/split_scp.pl breakage May 9, 2019
@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

Hold on, did not cleanly merge

@kkm000 kkm000 changed the title [WIP] Fix utils/split_scp.pl breakage Fix utils/split_scp.pl breakage May 9, 2019
@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

This is ready to go now. I also cleaned up error messages, and fixed the -j M N argument check with and without one-based.

@danpovey
Copy link
Contributor

danpovey commented May 9, 2019

Thanks! Note: this is a fixed version of #3285 which was reverted in #3307.

@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

Yes, I noticed, I clashed with the revert :)

@danpovey
Copy link
Contributor

danpovey commented May 9, 2019

You sure, right?

@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

I just used this in a practical script, and updated my scripts to --one-based between two runs, both went out ok; I have thorough error checks in my scripts to make sure data counts all match after processing, and they do. I do not have a practical utt2spk-based case to test at the moment, though (although the next step, merging alignments and lattices, that we once talked on the list, likely will use it).

@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

You sure, right?

So pretty much yes, and I am likely to use it more a few times today. I'll be merging a complex dataset, with lattices coming from separate alignments, so if anything comes up, I'll notice.

@danpovey
Copy link
Contributor

danpovey commented May 9, 2019 via email

@kkm000
Copy link
Contributor Author

kkm000 commented May 9, 2019

@danpovey:

Remind me to-morrow, I'll merge then.

Yup. You did the right thing. :) There is more weirdness, fixing now. I'll push when all modes work, and then let you know.

@kkm000 kkm000 changed the title Fix utils/split_scp.pl breakage [WIP] Fix utils/split_scp.pl breakage May 9, 2019
@kkm000 kkm000 changed the title [WIP] Fix utils/split_scp.pl breakage Fix utils/split_scp.pl breakage May 10, 2019
@kkm000
Copy link
Contributor Author

kkm000 commented May 10, 2019

@danpovey, this flies! (and with the updated combine_ali_dirs, which comes tomorrow).

@danpovey
Copy link
Contributor

OK. @kkm000, FYI I usually prefer to use filter_scp.pl using the appropriate utt2spk file, instead of split_scp.pl. It probably doesn't make much difference though.

@kkm000
Copy link
Contributor Author

kkm000 commented May 10, 2019

@dpovey, so can we merge it?

@danpovey danpovey merged commit ba165c8 into kaldi-asr:master May 10, 2019
@kkm000 kkm000 deleted the utils_split_scp.pl branch May 11, 2019 14:50
danpovey pushed a commit to danpovey/kaldi that referenced this pull request Jun 19, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

utils/split_scp.pl broken
2 participants