is_training=False #6

wh0 · 2017-02-27T19:52:37Z

Thanks for releasing your pretrained weights checkpoint. I'm trying to use them to run the decompiler. When I set is_training=False in the call to tflib.network.im2latex_cnn, it outputs the wrong answer. Any advice on how to run it not in training mode?

For example, for 3e679e114e.png, I get

#START \begin{array} { l } & { \scriptscriptstyle } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } & { \bf } [truncated]

(originally was #START f _ { ( x , x _ { 0 } ) } ^ { c , L } \equiv f ^ { c } ( x , x _ { 0 } ) #END)

The text was updated successfully, but these errors were encountered:

ritheshkumar95 · 2017-03-06T10:55:52Z

Hey,

This issue seems peculiar to me.

Are you saying is_training=True returns the correct answer but is_training=False returns the wrong one?

wh0 · 2017-03-06T22:39:35Z

Are you saying is_training=True returns the correct answer but is_training=False returns the wrong one?

Yes

wh0 · 2017-03-07T01:55:24Z

I've found that the batch_norm wasn't working right in the training that went into the released weight files. The moving_mean and moving_variance tensors have initial values. We're going to try to update the code to carry out the batch_norm update operations during training. cc @mitar

ritheshkumar95 · 2017-03-07T02:04:15Z

Right,

Sorry. I just noticed in tflib.contrib that batch_norm by default doesn't update moving means / averages. I thought it does by default.

Check this https://www.tensorflow.org/api_docs/python/tf/contrib/layers/batch_norm

mingchen62 · 2017-10-20T18:35:24Z

I encountered the same issue with is_training=False. Not sure if @wh0 get a chance to update the code.

wh0 · 2017-10-20T19:41:15Z

@mingchen62 I ended up not altering the training process at all.

mingchen62 · 2017-10-20T20:52:36Z

All right. According to document, it looks like we will need to update "attention.py"
adding something like this:

update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS)
with tf.control_dependencies(update_ops):
train_step =....

I will give it a try

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is_training=False #6

is_training=False #6

wh0 commented Feb 27, 2017

ritheshkumar95 commented Mar 6, 2017

wh0 commented Mar 6, 2017

wh0 commented Mar 7, 2017

ritheshkumar95 commented Mar 7, 2017

mingchen62 commented Oct 20, 2017

wh0 commented Oct 20, 2017

mingchen62 commented Oct 20, 2017

is_training=False #6

is_training=False #6

Comments

wh0 commented Feb 27, 2017

ritheshkumar95 commented Mar 6, 2017

wh0 commented Mar 6, 2017

wh0 commented Mar 7, 2017

ritheshkumar95 commented Mar 7, 2017

mingchen62 commented Oct 20, 2017

wh0 commented Oct 20, 2017

mingchen62 commented Oct 20, 2017