Fix hidden_layer size for one-directional decoder #99

dieuwkehupkes · 2017-11-17T17:56:23Z

Hidden layer size of the decoder was given by

hidden_size * 2 if bidirectional else 1

resulting in a dimensionality error when bidirectional was set to False. To fix this, corrected it to

hidden_size * 2 if bidirectional else hidden_size.

* Modified parameter order of DecoderRNN.forward (#85) * Updated TopKDecoder (#86) * Fixed topk decoder. * Use torchtext from pipy (#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style.

kylegao91 · 2017-11-20T14:01:58Z

Good catch! Is train_model.py a copy of sample.py? I think the name makes sense, but let's keep sample.py for now. Can you remove train_model.py from the commit?

dieuwkehupkes · 2017-11-20T14:07:50Z

Heyy, oh sorry! It is based on it but I added some arguments so that I can easier run it myself, I thought I had removed it the commit in the branch, but apparently I didn't, sorry. I'll remove it! 2017-11-20 15:04 GMT+01:00 Kyle Gao <notifications@github.com>:

…

Good catch! Is train_model.py a copy of sample.py? I think the name makes sense, but let's keep sample.py for now. Can you remove train_model.py from the commit? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#99 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACut4MHdu9YMuNET2SVO0qa4FDJVLls-ks5s4YZXgaJpZM4QiZt9> .

Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`.

kylegao91

Thanks!

* Modified parameter order of DecoderRNN.forward (#85) * Updated TopKDecoder (#86) * Fixed topk decoder. * Use torchtext from pipy (#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * bugfix (#92) Fixed field arguments validation. * Removed `initial_lr` when resuming optimizer with scheduler. (#95) * shuffle the training data (#97) * 0.1.5 (#91) * Modified parameter order of DecoderRNN.forward (#85) * Updated TopKDecoder (#86) * Fixed topk decoder. * Use torchtext from pipy (#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * shuffle the training data * fix example of inflate function in TopKDecoer.py (#98) * fix example of inflate function in TopKDecoer.py * Fix hidden_layer size for one-directional decoder (#99) * Fix hidden_layer size for one-directional decoder Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`. * Adapt load to allow CPU loading of GPU models (#100) * Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda. * Fix wrong parameter use on DecoderRNN (#103) * Fix wrong parameter use on DecoderRNN

* Modified parameter order of DecoderRNN.forward (#85) * Updated TopKDecoder (#86) * Fixed topk decoder. * Use torchtext from pipy (#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * bugfix (#92) Fixed field arguments validation. * Removed `initial_lr` when resuming optimizer with scheduler. (#95) * shuffle the training data (#97) * 0.1.5 (#91) * Modified parameter order of DecoderRNN.forward (#85) * Updated TopKDecoder (#86) * Fixed topk decoder. * Use torchtext from pipy (#87) * Use torchtext from pipe. * Fixed torch text sorting order. * attention is not required when only using teacher forcing in decoder (#90) * attention is not required when only using teacher forcing in decoder * Updated docs and version. * Fixed code style. * shuffle the training data * fix example of inflate function in TopKDecoer.py (#98) * fix example of inflate function in TopKDecoer.py * Fix hidden_layer size for one-directional decoder (#99) * Fix hidden_layer size for one-directional decoder Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`. * Adapt load to allow CPU loading of GPU models (#100) * Adapt load to allow CPU loading of GPU models Add storage parameter to torch.load to allow loading models on a CPU that are trained on the GPU, depending on availability of cuda. * Fix wrong parameter use on DecoderRNN (#103) * Fix wrong parameter use on DecoderRNN * Upgrade to pytorch-0.3.0 (#111) * Upgrade to pytorch-0.3.0 * Use pytorch 3.0 in travis env. * Make sure tensor contiguous when attention's not used. (#112) * Implementing the predict_n method. Using the beam search outputs it returns several seqs for a given seq (#116) * Adding a predictor method to return n predicted seqs for a src_seq input (intended to be used along to Beam Search using TopKDecoder) * Checkpoint after batches not epochs (#119) * Pytorch 0.4 (#134) * add contiguous call to tensor (#127) when attention is turned off, pytorch (well, 0.4 at least) gets angry about calling view on a non-contiguous tensor * Fixed shape documentation (#131) * Update to pytorch-0.4 * Remove pytorch manual install in travis. * Allow using pre-trained embedding (#135) * updated docs

kylegao91 changed the base branch from master to develop November 20, 2017 14:02

dieuwkehupkes and others added 4 commits November 20, 2017 15:08

Fix hidden_layer size for one-directional decoder

328e9ef

Hidden layer size of the decoder was given `hidden_size * 2 if bidirectional else 1`, resulting in a dimensionality error for non-bidirectional decoders. Changed `1` to `hidden_size`.

Merge branch 'develop' into fix_sample

ea426b0

Merge branch 'develop' into fix_sample

6c2ff16

Merge branch 'develop' into fix_sample

4a413ce

kylegao91 approved these changes Nov 20, 2017

View reviewed changes

kylegao91 merged commit 626842c into IBM:develop Nov 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hidden_layer size for one-directional decoder #99

Fix hidden_layer size for one-directional decoder #99

dieuwkehupkes commented Nov 17, 2017

kylegao91 commented Nov 20, 2017

dieuwkehupkes commented Nov 20, 2017 via email

kylegao91 left a comment

Fix hidden_layer size for one-directional decoder #99

Fix hidden_layer size for one-directional decoder #99

Conversation

dieuwkehupkes commented Nov 17, 2017

kylegao91 commented Nov 20, 2017

dieuwkehupkes commented Nov 20, 2017 via email

kylegao91 left a comment

Choose a reason for hiding this comment