[src] batch renormalization finished #65

GaofengCheng · 2019-01-05T12:55:06Z

I have finished the draft of batch-renorm.
I tested it under Switchboard + 2-layer-TDNN-F(without dropout) + 3epoch + 64 batch-size
A: batch-renorm r:1.0 d:0.0
B: batch-renorm r:1.0 d:0.0 -> r:1.2 d:0.4 (at iter 8) -> r:1.6 d:0.8 (at iter 45)

 A         B
# WER on train_dev(tg)      18.37     18.47
# WER on train_dev(fg)      16.69     16.79
# WER on eval2000(tg)        20.7      20.7
# WER on eval2000(fg)        18.8      18.6
# WER on rt03(tg)            25.7      25.6
# WER on rt03(fg)            22.2      22.2
# Final train prob         -0.121    -0.113
# Final valid prob         -0.130    -0.128
# Final train prob (xent)        -2.545    -2.525
# Final valid prob (xent)       -2.4666   -2.4552
# Num-parameters               20992380  20992380

Some notes:

I have verified that the batch-renorm with r-1.0 d-0.0 performs almost the same as the previous batch norm
for batch renorm, I picked the moving averages of one of the parallel jobs instead of averaging them
for debuggin, i kept the sum_mean_/uvar_ of batch-renorm, but they are useless for batch-renorm, we can remove them later
I have checked my derivatives of BP, I should do it right. Maybe you can double check with the original equations from the authors (I have checked it, but this is a fundamental component, I think it's necessary to do it twice).

…ldi-asr#2900)

)

…ailures) (kaldi-asr#2906)

…r#2907) [scripts] Fix bug related to multi-task in train_raw_rnn.py. Thx:tessfu2001@gmail.com

…asr#2912)

…dlium, in accordance with the IS17 paper. (kaldi-asr#2774)

…di-asr#2945) thx: Maxim Korenevsky.

…t cleanup (kaldi-asr#2935)

…aldi-asr#2947) note: if this breaks someone's build we'll have to debug it then.

…sr#2951)

…scripts) (kaldi-asr#2956)

danpovey · 2019-01-05T20:15:44Z

src/nnet3/nnet-normalize-component.h

+  struct Memo {
+    // number of frames (after any reshaping).
+    int32 num_frames;
+    // 'sum_sumsq_scale' is of dimension 5 by block_dim_:


You need to keep the documentation up to date!
But I may rewrite parts of this so this may no end up mattering.
My concern is that the original formulation of BatchNorm does not make sense when minibatch sizes differ and where the stats may differ substantially (e.g. because the language differs).

This reverts commit 5162bd7.

danpovey · 2019-01-11T01:40:08Z

src/nnet3/nnet-normalize-component.cc

  WriteToken(os, binary, "</BatchRenormComponent>");
 }

-void BatchRenormComponent::Scale(BaseFloat scale) {
+void BatchRenormComponent::Scale_Training(BaseFloat scale) {


This is against the Google style guide; should be ScaleTraining.
However, I think it would be better to just use the regular Scale() function, you can see how I've done it in my version.

keli78 and others added 26 commits December 5, 2018 23:17

[egs] Update Librispeech RNNLM results; use correct training data (ka…

37091d6

…ldi-asr#2900)

[scripts] RNNLM: old iteration model cleanup; save space (kaldi-asr#2885

b50a4cf

)

[scripts] Make prepare_lang.sh cleanup beforehand (prevents certain f…

a464bd7

…ailures) (kaldi-asr#2906)

[scripts] Expose dim-range-node at xconfig level (kaldi-asr#2903)

c41cbb1

[scripts] Fix bug related to multi-task in train_raw_rnn.py (kaldi-as…

aa0ac7b

…r#2907) [scripts] Fix bug related to multi-task in train_raw_rnn.py. Thx:tessfu2001@gmail.com

[scripts] Cosmetic fix/clarification to utils/prepare_lang.sh (kaldi-…

3e50be9

…asr#2912)

[scripts,egs] Added a new lexicon learning (adaptation) recipe for te…

791cd82

…dlium, in accordance with the IS17 paper. (kaldi-asr#2774)

[egs] TDNN+LSTM example scripts, with RNNLM, for Librispeech (kaldi-a…

b126161

…sr#2857)

[src] cosmetic fix in nnet1 code (kaldi-asr#2921)

78f0127

[src] Fix incorrect invocation of mutex in nnet-batch-compute code (k…

44980dd

…aldi-asr#2932)

[egs,minor] Fix typo in comment in voxceleb script (kaldi-asr#2926)

a46f554

[src,egs] Mostly cosmetic changes; add some missing includes (kaldi-a…

2edb074

…sr#2936)

[egs] Fix path of rescoring binaries used in tfrnnlm scripts (kaldi-a…

9b320ad

…sr#2941)

[src] Fix bug in nnet3-latgen-faster-batch for determinize=false (kal…

3b0162b

…di-asr#2945) thx: Maxim Korenevsky.

[egs] Add example for rimes handwriting database; Madcat arabic scrip…

b984543

…t cleanup (kaldi-asr#2935)

[egs] Add scripts for yomdle korean (kaldi-asr#2942)

46826d9

[build] Refactor/cleanup build system, easier build on ubuntu 18.04. (k…

3e77220

…aldi-asr#2947) note: if this breaks someone's build we'll have to debug it then.

[scripts,egs] Changes for Python 2/3 compatibility (kaldi-asr#2925)

5a720ac

[egs] Add more modern DNN recipe for fisher_callhome_spanish (kaldi-a…

ca32c4e

…sr#2951)

[scripts] switch from bc to perl to reduce dependencies (diarization …

1ea2ba7

…scripts) (kaldi-asr#2956)

[scripts] Further fix for Python 2/3 compatibility (kaldi-asr#2957)

969869c

First commit

8b800b5

add backprop

1f5c6eb

update

5fd0e8c

Batch-renorm OK

0696624

clean-up

902540e

danpovey reviewed Jan 5, 2019

View reviewed changes

GaofengCheng added 3 commits January 7, 2019 11:05

Update nnet-normalize-component.cc

5162bd7

Revert "Update nnet-normalize-component.cc"

ca3ff04

This reverts commit 5162bd7.

tempt-test

1732ae7

add stats average for batch-renorm

bfe09be

danpovey reviewed Jan 11, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[src] batch renormalization finished #65

[src] batch renormalization finished #65

GaofengCheng commented Jan 5, 2019 •

edited

Loading

danpovey Jan 5, 2019

danpovey Jan 11, 2019

[src] batch renormalization finished #65

Are you sure you want to change the base?

[src] batch renormalization finished #65

Conversation

GaofengCheng commented Jan 5, 2019 • edited Loading

danpovey Jan 5, 2019

Choose a reason for hiding this comment

danpovey Jan 11, 2019

Choose a reason for hiding this comment

GaofengCheng commented Jan 5, 2019 •

edited

Loading