Minor updates to sequence training and adjusting priors. #1345

vimalmanohar · 2017-01-16T19:21:27Z

No description provided.

danpovey · 2017-01-17T00:47:01Z

egs/wsj/s5/steps/nnet3/train_discriminative.sh

@@ -307,24 +303,25 @@ while [ $x -lt $num_iters ]; do
      nnet3-am-copy --set-raw-nnet=- $dir/$x.mdl $dir/$[$x+1].mdl || exit 1;

    rm $nnets_list
+    [ ! -f $dir/$[$x+1].mdl ] && echo "$0: Did not create $dir/$[$x+1].mdl" && exit 1;
+    if [ -f $dir/$[$x-1].mdl ] && $cleanup && \
+       [ $[($x-1)%$keep_model_iters] -ne 0  ] && \


I don't think this has very good defaults... $keep_model_iters defaults to 1 which means all models are kept initially.
And I'm concerned that the code is too complicated... users will assume that these keep_model_iters are kept permanently, not deleted at the end.
I think it would be better to delete the [x-5]'th model (so we have 5 models for debugging purposes), and have the cleanup stage at the end delete the last 5 models... and just hardcode this 5 in the code, I don't see that anyone would ever want to configure it. Right now I think it's configurable to a confusing extent.

... I'd be OK to have a 'keep_model_iters' flag, default, say, 100, that would make it save more models [but permanently, not temporarily].

I made the default as 100. keep_model_iters keeps models permanently. The remove models stage at the end also uses the same shift. This is same as the code that is used in train_tdnn.sh.

danpovey · 2017-01-17T02:00:50Z

egs/wsj/s5/steps/nnet3/train_discriminative.sh

-    --num-jobs-compute-prior $num_archives_priors \
-    --cmd "$cmd $prior_queue_opt" --use-gpu false \
+if [ $stage -le $num_iters ]; then
+  steps/nnet3/adjust_priors.sh --egs-type degs \


Are you sure this doesn't automatically happen on the last iteration anyway, i.e. inside the loop?
Perhaps by being careful about the loop indexes we could make sure it happens in the loop, to avoid code duplication.
Also I think it would be helpful if the script would wait at the very end, for any straggling adjust-priors jobs.
Otherwise the decoding might start before they are done; also the adjust-priors jobs should touch .error on error, and the script at the end should detect this.

I move the code around and added a wait.

…i-asr#1345)

vimalmanohar added 3 commits January 16, 2017 14:08

sequence: Changing priors update

0378912

Update adjust_priors.sh

2f3ef28

sequence: Delete old models

56a1970

danpovey reviewed Jan 17, 2017

View reviewed changes

sequence: Remove duplication

1faef73

danpovey merged commit d3787c1 into kaldi-asr:shortcut Jan 17, 2017

david-ryan-snyder pushed a commit to david-ryan-snyder/kaldi that referenced this pull request Apr 12, 2017

[src]: Minor updates to sequence training and adjusting priors. (kald…

f84d483

…i-asr#1345)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor updates to sequence training and adjusting priors. #1345

Minor updates to sequence training and adjusting priors. #1345

vimalmanohar commented Jan 16, 2017

danpovey Jan 17, 2017

danpovey Jan 17, 2017

vimalmanohar Jan 17, 2017

danpovey Jan 17, 2017

vimalmanohar Jan 17, 2017

Minor updates to sequence training and adjusting priors. #1345

Minor updates to sequence training and adjusting priors. #1345

Conversation

vimalmanohar commented Jan 16, 2017

danpovey Jan 17, 2017

Choose a reason for hiding this comment

danpovey Jan 17, 2017

Choose a reason for hiding this comment

vimalmanohar Jan 17, 2017

Choose a reason for hiding this comment

danpovey Jan 17, 2017

Choose a reason for hiding this comment

vimalmanohar Jan 17, 2017

Choose a reason for hiding this comment