Meaning behind --update-mean-var --train-beta-gamma #40

Tamme · 2018-02-09T11:33:00Z

Hi.

I havent encountered in other projects this kind of updating values. Is it something originating from PSPNet or this really is the way to use momentum or is it something third?

Thanks,
Tamme

hellochick · 2018-02-10T17:55:30Z

Hey @Tamme, let me explain the batch normalization layer first. There are four variables in the batch normalization layer, and they are moving_mean, moving_variance, gamma, and beta respectively. And moving_mean and moving variance are not trainable variables, so we need to update them using update ops which is put in tf.GraphKeys.UPDATE_OPS, you can take a look at tensorflow docs. So I use the flag --update-mean-var to decide whether to update mean and var ( Since update them in large batch size is better, if we train in mini-batch, we can frozen these two variables for better reuslts).

manuel-88 · 2018-02-14T09:29:01Z

hey, when I run the training without -update-mean-var the evaluation results are almost zero. Do you know why @hellochick ?

hellochick · 2018-02-15T09:20:03Z

@manuel-88, if you didn't update mean and variance, then the batch normalization layer will do nothing. Maybe this is the problem.

hellochick closed this as completed Oct 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meaning behind --update-mean-var --train-beta-gamma #40

Meaning behind --update-mean-var --train-beta-gamma #40

Tamme commented Feb 9, 2018

hellochick commented Feb 10, 2018 •

edited

Loading

manuel-88 commented Feb 14, 2018

hellochick commented Feb 15, 2018

Meaning behind --update-mean-var --train-beta-gamma #40

Meaning behind --update-mean-var --train-beta-gamma #40

Comments

Tamme commented Feb 9, 2018

hellochick commented Feb 10, 2018 • edited Loading

manuel-88 commented Feb 14, 2018

hellochick commented Feb 15, 2018

hellochick commented Feb 10, 2018 •

edited

Loading