FIX: lm pre-training #58

madisonmay · 2018-07-20T22:44:28Z

Resolves #56

benleetownsend · 2018-07-23T09:28:51Z

finetune/base.py

@@ -489,6 +493,8 @@ def _build_model(self, n_updates_total, target_dim, train=True):
        elif not self.is_trained:
            self._load_base_model()

+        guarantee_initialized_variables(self.sess)


Can you explain the thought process behind this? Can't we explicitly initialize the variables under model/clf instead? Or add some logs for the variables that this call initializes. I can foresee a future where we accidentally mess up the base model and this could hide problems?

Specify model/clf as scope

benleetownsend · 2018-07-23T09:34:17Z

finetune/base.py

        self.do_dropout = tf.placeholder(tf.float32)  # 1 for do dropout and 0 to not do dropout
        if self.target_type == SEQUENCE_LABELING:
            self.Y = tf.placeholder(tf.int32, [None, self.config.max_length])  # classification targets
        else:
-            self.Y = tf.placeholder(tf.float32, [None, self.target_dim])  # classification targets
+            self.Y = tf.stop_gradient(tf.placeholder(tf.float32, [None, self.target_dim or 1]))  # classification targets


Gradients already do not flow into placeholders The tf.stop_gradient should be on the loss function to protect against changes to self.Y

benleetownsend · 2018-07-23T09:35:02Z

finetune/utils.py

@@ -95,6 +95,14 @@ def np_init(w):
    return partial(_np_init, w=w)


+def guarantee_initialized_variables(sess):


I really like this.

madisonmay · 2018-07-23T12:06:57Z

finetune/utils.py

@@ -95,6 +95,14 @@ def np_init(w):
    return partial(_np_init, w=w)


+def guarantee_initialized_variables(sess):
+    global_vars = tf.global_variables()


Credit stackoverflow

benleetownsend

👍

FIX: lm pre-training

bb99f5c

benleetownsend reviewed Jul 23, 2018

View reviewed changes

madisonmay commented Jul 23, 2018

View reviewed changes

madisonmay added 2 commits July 23, 2018 08:49

FIX: stop_gradient location

6ad72eb

REFACTOR: guarantee_initialized_variables

ae3823d

madisonmay force-pushed the madison/lm-pretraining branch from 0a7f620 to ae3823d Compare July 23, 2018 16:14

benleetownsend approved these changes Jul 23, 2018

View reviewed changes

FIX: uninitialized adam vars

0988082

madisonmay force-pushed the madison/lm-pretraining branch from 0ab0fa0 to 0988082 Compare July 23, 2018 17:05

madisonmay merged commit 21385a8 into master Jul 23, 2018

madisonmay deleted the madison/lm-pretraining branch July 24, 2018 17:17

madisonmay mentioned this pull request Aug 6, 2018

Support for pre-training the language model #56

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: lm pre-training #58

FIX: lm pre-training #58

madisonmay commented Jul 20, 2018 •

edited

Loading

benleetownsend Jul 23, 2018

madisonmay Jul 23, 2018

benleetownsend Jul 23, 2018

benleetownsend Jul 23, 2018

madisonmay Jul 23, 2018

benleetownsend left a comment

		@@ -95,6 +95,14 @@ def np_init(w):
		return partial(_np_init, w=w)


		def guarantee_initialized_variables(sess):

FIX: lm pre-training #58

FIX: lm pre-training #58

Conversation

madisonmay commented Jul 20, 2018 • edited Loading

benleetownsend Jul 23, 2018

Choose a reason for hiding this comment

madisonmay Jul 23, 2018

Choose a reason for hiding this comment

benleetownsend Jul 23, 2018

Choose a reason for hiding this comment

benleetownsend Jul 23, 2018

Choose a reason for hiding this comment

madisonmay Jul 23, 2018

Choose a reason for hiding this comment

benleetownsend left a comment

Choose a reason for hiding this comment

madisonmay commented Jul 20, 2018 •

edited

Loading