Use BucketingSampler for dev and test data #73

pzelasko · 2021-10-09T02:34:30Z

As mentioned in #71 -- I simply hardcoded BucketingSampler in place of SingleCutSampler as I don't see a reason not to use it.

Edit: I checked that it works just on yesno.

csukuangfj · 2021-10-09T02:39:10Z

+2

pzelasko · 2021-10-09T14:33:21Z

Do I need to run other tools besides black to make the CI happy?

csukuangfj · 2021-10-09T14:47:21Z

Do I need to run other tools besides black to make the CI happy?

Yes，you have to run flake8

csukuangfj · 2021-10-09T14:57:33Z

Please see icefall/.github/workflow/style_check.ymal

danpovey · 2021-10-27T08:53:37Z

egs/librispeech/ASR/tdnn_lstm_ctc/asr_datamodule.py

-            sampler = SingleCutSampler(
-                cuts_test, max_duration=self.args.max_duration
+            sampler = BucketingSampler(
+                cuts_test, max_duration=self.args.max_duration, shuffle=False


Guys, we need to give this "shuffle" argument some care, for valid and test.
I observed, after merging master code, very bad valid probs for the attention part of the model (like, 0.5 instead of 0.1).
After a lot of experimentation, I found this 'shuffle' arg to be responsible for at least the majority of this effect.
It seems that what happens is, with shuffle=False, within each bucket the durations vary by much less than with shuffle=True, hence there is less padding. At least that is what seems to happen in my setup. It's possible that the attention model is learning to rely on the very low energy at the end of the utterance, for termination; or something like that. We need to do some experiments with this; we should test whether the shuffle={True,False} arg makes a difference for testing as well, especially for the attention decoder.

If that makes sense we can randomly choose to pad from the left or from the right during the training to break the pattern.

Sure, makes sense I think.

See #97 and #98.
It is a bug, not related to padding.

Use BucketingSampler for dev and test data

b682467

Reformatting

069ebaf

pzelasko merged commit d54828e into k2-fsa:master Oct 9, 2021

danpovey reviewed Oct 27, 2021

View reviewed changes

csukuangfj mentioned this pull request Oct 29, 2021

Problem with valid loss functions #97

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use BucketingSampler for dev and test data #73

Use BucketingSampler for dev and test data #73

pzelasko commented Oct 9, 2021 •

edited

Loading

csukuangfj commented Oct 9, 2021

pzelasko commented Oct 9, 2021

csukuangfj commented Oct 9, 2021

csukuangfj commented Oct 9, 2021

danpovey Oct 27, 2021

pzelasko Oct 27, 2021

danpovey Oct 28, 2021

csukuangfj Oct 29, 2021

Use BucketingSampler for dev and test data #73

Use BucketingSampler for dev and test data #73

Conversation

pzelasko commented Oct 9, 2021 • edited Loading

csukuangfj commented Oct 9, 2021

pzelasko commented Oct 9, 2021

csukuangfj commented Oct 9, 2021

csukuangfj commented Oct 9, 2021

danpovey Oct 27, 2021

Choose a reason for hiding this comment

pzelasko Oct 27, 2021

Choose a reason for hiding this comment

danpovey Oct 28, 2021

Choose a reason for hiding this comment

csukuangfj Oct 29, 2021

Choose a reason for hiding this comment

pzelasko commented Oct 9, 2021 •

edited

Loading