New output in ComputeChainObjfAndDeriv #2924

danpovey · 2018-12-18T23:38:07Z

@hhadian, when you have a chance can you please do the implementation work for the
'numerator_post' thing below? Again, this can go to the 'svd_draft' branch in my personal
repo for now.
I'll need this for both the regular and e2e egs.

diff --git a/src/chain/chain-training.h b/src/chain/chain-training.h
index 6ea70b5ca..63e03c7e3 100644
--- a/src/chain/chain-training.h
+++ b/src/chain/chain-training.h
@@ -99,7 +99,7 @@ struct ChainTrainingOptions {
                            example; you'll want to divide it by 'tot_weight' before
                            displaying it.
    @param [out] l2_term  The l2 regularization term in the objective function, if
-                           the --l2-regularize option is used.  To be added to 'o
+                         the --l2-regularize option is used (else will be set to 0.0).
    @param [out] weight     The weight to normalize the objective function by;
                            equals supervision.weight * supervision.num_sequences *
                            supervision.frames_per_sequence.
@@ -115,6 +115,10 @@ struct ChainTrainingOptions {
                            peak memory use).  xent_output_deriv will be used in
                            the cross-entropy regularization code; it is also
                            used in computing the cross-entropy objective value.
+   @param [out] numerator_post  If non-NULL, then the posterior from the numerator
+                           forward-backward will be written here (note: it won't be
+                           scaled by the supervision weight).  This is intended for
+                           use in the adaptation framework used in "chaina" training.
 */
 void ComputeChainObjfAndDeriv(const ChainTrainingOptions &opts,
                               const DenominatorGraph &den_graph,
@@ -124,7 +128,8 @@ void ComputeChainObjfAndDeriv(const ChainTrainingOptions &opts,
                               BaseFloat *l2_term,
                               BaseFloat *weight,
                               CuMatrixBase<BaseFloat> *nnet_output_deriv,
-                              CuMatrix<BaseFloat> *xent_output_deriv = NULL);
+                              CuMatrix<BaseFloat> *xent_output_deriv = NULL,
+                              Posterior *numerator_post = NULL);

The text was updated successfully, but these errors were encountered:

danpovey · 2018-12-18T23:38:55Z

... and the order of 'numerator_post' should be the same as the order of the rows of 'input'. That should probably be clarified in the documentation.

hhadian · 2018-12-19T22:37:52Z

Will do.

stale · 2020-06-19T07:37:20Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

kkm000 added the in progress Issue has been taken and is being worked on label Mar 31, 2019

stale bot added the stale Stale bot on the loose label Jun 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New output in ComputeChainObjfAndDeriv #2924

New output in ComputeChainObjfAndDeriv #2924

danpovey commented Dec 18, 2018

danpovey commented Dec 18, 2018

hhadian commented Dec 19, 2018

stale bot commented Jun 19, 2020

New output in ComputeChainObjfAndDeriv #2924

New output in ComputeChainObjfAndDeriv #2924

Comments

danpovey commented Dec 18, 2018

danpovey commented Dec 18, 2018

hhadian commented Dec 19, 2018

stale bot commented Jun 19, 2020