Features/evaluation step #22

ottonemo · 2017-07-28T09:23:44Z

See also #12 (comment)

…ted to a 3d torch tensor; did this test ever pass?

Related to pull/12#discussion_r129337961

Generator does not use validation data

This corresponds with the original code

Sampling is only necessary in text generation

…_step

…is collated to a 3d torch tensor; did this test ever pass?" This reverts commit fdcc5bf. This is not necessary anymore (as per discussion). We expect that `forward` receives a list of torch tensors in case the input is a list of sequences (e.g. numpy arrays).

This reverts commit 2b7a005. Belongs into own branch

This reverts commit b54b1ef. Belongs into own branch

…_step

As `to_var` is already needed in every `*_step()` function we don't call it in `infer()` as well. This also makes it easier to overwrite individual step functions without touching `infer`.

This makes it possible to have CUDA models as the call from to_tensor on dummy values does not fail.

Rationale being that one sequence can be evaluated in one step and processes that use statefulness should resort to custom methods.

Also implement example sentence printer callback to training process. Simplify generation process accordingly.

benjamin-work · 2017-08-08T11:53:43Z

examples/word_language_model/learner.py

-    def on_train_begin(self, net, *args, **kwargs):
-        self.hidden = self.module_.init_hidden(self.batch_size)
+    def on_train_begin(self, *args, **kwargs):
+        super().on_train_begin(*args, **kwargs)
        if self.use_cuda:
            self.module_.cuda()


Do we still need the cuda call? We now have it in initialize_module.

benjamin-work · 2017-08-08T11:54:51Z

examples/word_language_model/learner.py

+        self.hidden = self.module_.init_hidden(self.batch_size)
+
+    def sample(self, input, temperature=1., hidden=None):
+        hidden = self.module_.init_hidden(1) if hidden is not None else hidden


Is the logic correct here: Don't use provided hidden if it is not None?

benjamin-work · 2017-08-08T12:02:15Z

examples/word_language_model/train.py

 params = [
    {
        'lr': [10,20,30],
    },
 ]

 pl = GridSearchCV(learner, params)
-pl.fit(corpus.train[:1000], corpus.train[:1000])
+pl.fit(corpus.train[:1000])


Leave this restriction in?

ottonemo · 2017-08-08T15:21:40Z

@benjamin-work review again pls

benjamin-work and others added 9 commits July 28, 2017 10:38

Set y to None by default in fit-related methods.

b3d0cc9

There was a bug in a test; when X is a list of 2d arrays, it is colla…

fdcc5bf

…ted to a 3d torch tensor; did this test ever pass?

Simplify RNN example based on recent changes.

f7cc082

Move evaluation from forward to its own step

38fdcd8

Related to pull/12#discussion_r129337961

Unify training behavior

f51230b

Remove superfluous code

600f3f7

Generator does not use validation data

Move hidden vector initialization to epoch begin

ccffb5b

This corresponds with the original code

Make evaluation step deterministic

a5c1511

Sampling is only necessary in text generation

Mainly TODOs to discuss

13af4cd

ottonemo mentioned this pull request Aug 1, 2017

Add an example for using an RNN #7

Closed

ottonemo added 18 commits August 1, 2017 15:32

Add FIXME

4722798

Merge remote-tracking branch 'origin/master' into features/evaluation…

5ece88e

…_step

Merge remote-tracking branch 'origin/master' into features/evaluation…

e1a8ab2

…_step

Better dummy value and tests for y=None

b54b1ef

Remove now unnecessary code

2b7a005

Revert "Remove now unnecessary code"

f26674f

This reverts commit 2b7a005. Belongs into own branch

Revert "Better dummy value and tests for y=None"

8a61e3a

This reverts commit b54b1ef. Belongs into own branch

This is handled in predict already

3922915

Compatibility with current state

f4969e2

Merge remote-tracking branch 'origin/master' into features/evaluation…

778b40a

…_step

Merge remote-tracking branch 'origin/master' into features/evaluation…

f64c58b

…_step

Make to_var calls consistent

53531a5

As `to_var` is already needed in every `*_step()` function we don't call it in `infer()` as well. This also makes it easier to overwrite individual step functions without touching `infer`.

Update TODO

ee7952a

Use Tensor instead of scalar for dummy in Dataset

e2c2155

This makes it possible to have CUDA models as the call from to_tensor on dummy values does not fail.

Add epoch parameter

7b5a114

Make evaluation step stateless

48d259b

Rationale being that one sequence can be evaluated in one step and processes that use statefulness should resort to custom methods.

Add learner.sample() and .sample_n()

a7cd7a1

Also implement example sentence printer callback to training process. Simplify generation process accordingly.

ottonemo requested a review from benjamin-work August 8, 2017 11:40

benjamin-work reviewed Aug 8, 2017

View reviewed changes

ottonemo added 6 commits August 8, 2017 17:18

Use correct shape for sampling sentences

721f75e

Remove corpus restriction

71704a3

Remove CUDA initialization (done in net now)

9dbf97f

Fix logic inversion bug

c046111

Prevent exploding computation graph when generating

a3edbe0

Split data in scoring due to memory constraints

5d0e9cc

benjamin-work merged commit 5f0f29a into master Aug 8, 2017

ottonemo deleted the features/evaluation_step branch October 2, 2017 11:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/evaluation step #22

Features/evaluation step #22

ottonemo commented Jul 28, 2017

benjamin-work Aug 8, 2017

benjamin-work Aug 8, 2017

benjamin-work Aug 8, 2017

ottonemo commented Aug 8, 2017

Features/evaluation step #22

Features/evaluation step #22

Conversation

ottonemo commented Jul 28, 2017

benjamin-work Aug 8, 2017

Choose a reason for hiding this comment

benjamin-work Aug 8, 2017

Choose a reason for hiding this comment

benjamin-work Aug 8, 2017

Choose a reason for hiding this comment

ottonemo commented Aug 8, 2017