(Depends on PR #18 #17) RNN Decoder with attention #15

aasseman · 2019-04-25T01:59:48Z

(Depends on PR #18 #17)

Adding RNN Decoder w/ attention, as per this pytorch tutorial. Itself based on the paper Neural Machine Translation by Jointly Learning to Align and Translate.

…te streams

aasseman · 2019-04-25T02:03:26Z

Seems to work well enough for now to run wikitext_language_modeling_encoder_attndecoder.yml, which is a basic copy task.
Supports only single layer GRU for now. Will probably not try extending that. Are multiple layers useful?

Will try to add the translation task from the pytorch tutorial, as to compare apples to apples if my implementation is doing what it should.

tkornuta-ibm · 2019-04-26T02:07:17Z

This pull request introduces 1 alert when merging df60fb4 into aed980d - view on LGTM.com

new alerts:

1 for Unused local variable

Comment posted by LGTM.com

…ecoder-rnn

tkornuta-ibm · 2019-04-26T03:32:28Z

This pull request introduces 1 alert when merging 0138ef2 into aed980d - view on LGTM.com

new alerts:

1 for Unused local variable

Comment posted by LGTM.com

…ecoder-rnn

aasseman · 2019-04-26T17:58:41Z

Ready for review.
Got good results on the translation task.
One thing that could be checked (visually), if time permits, is the attention matrix of the GRU decoder, to really make sure that we get similar results to the paper.

…/attn-decoder-rnn

tkornuta-ibm

Please cleanup that wikitext_language_modeling_seq2seq.yml file

Aside of that, some of the changes were already merged... why GH is not seeing that? Are you squashing the commits or sth?

tkornuta-ibm · 2019-04-27T01:14:07Z

configs/wikitext/wikitext_language_modeling_seq2seq.yml

+    hidden_size: 50
+    num_layers: 1
    use_logsoftmax: False
+<<<<<<< Updated upstream


Hmmmm, I guess the "updated upstream" is the right one... but not sure..

I guess I got mixed up with my stashes. I checked-out the develop version of the two files in question, and that solved it.

…ecoder-rnn

aasseman added 7 commits April 24, 2019 10:45

Merge branch 'feat/extend-rnn' into feat/attn-decoder-rnn

cbdf56e

Merge branch 'feat/extend-rnn' into feat/attn-decoder-rnn

4d698e1

Fixed DataDefinition of RecurrentNeuralNetwork's output and input sta…

09005c7

…te streams

Merge branch 'feat/extend-rnn' into feat/attn-decoder-rnn

0ba7a73

Merge branch 'feat/extend-rnn' into feat/attn-decoder-rnn

3ab87f7

Merge branch 'feat/extend-rnn' into feat/attn-decoder-rnn

964a407

Added first prototype of Attn_Decoder, with dummy wikitext test

0ec24fa

aasseman added enhancement New feature or request WIP Work in progress, not ready for merge yet. labels Apr 25, 2019

aasseman requested a review from tkornuta-ibm April 25, 2019 01:59

aasseman changed the title ~~(depends on PR #13) (WIP) RNN Decoder with attention~~ (WIP) RNN Decoder with attention Apr 26, 2019

aasseman added 2 commits April 25, 2019 18:27

Merge branch 'refact/better-download' into feat/attn-decoder-rnn

0a53520

Added translation problem

df60fb4

aasseman added 2 commits April 25, 2019 19:48

Add fixed padding option to sentence_embeddings, sentence_indexer

04a42d6

Changed translation config for fixed padding compatibility

82a2121

aasseman changed the title ~~(WIP) RNN Decoder with attention~~ (Depends on PR #18 #17)(WIP) RNN Decoder with attention Apr 26, 2019

aasseman assigned tkornuta-ibm Apr 26, 2019

Merge branch 'develop' of github.com:IBM/pytorchpipe into feat/attn-d…

0138ef2

…ecoder-rnn

aasseman added 3 commits April 26, 2019 09:55

Merge branch 'feat/fixed-sentence-padding' into feat/attn-decoder-rnn

dd1477d

Merge branch 'develop' of github.com:IBM/pytorchpipe into feat/attn-d…

1fad1fc

…ecoder-rnn

Cleaning

afaf7df

aasseman changed the title ~~(Depends on PR #18 #17)(WIP) RNN Decoder with attention~~ (Depends on PR #18 #17) RNN Decoder with attention Apr 26, 2019

aasseman marked this pull request as ready for review April 26, 2019 17:56

aasseman removed the WIP Work in progress, not ready for merge yet. label Apr 26, 2019

Merge branch 'vqa_med_yn_fix' of github.com:IBM/pytorchpipe into feat…

93a1167

…/attn-decoder-rnn

tkornuta-ibm suggested changes Apr 27, 2019

View reviewed changes

aasseman added 2 commits April 26, 2019 18:19

Merge branch 'develop' of github.com:IBM/pytorchpipe into feat/attn-d…

bc58318

…ecoder-rnn

Cleaning

b8ed220

aasseman requested a review from tkornuta-ibm April 27, 2019 01:25

tkornuta-ibm merged commit 02a9179 into IBM:develop Apr 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(Depends on PR #18 #17) RNN Decoder with attention #15

(Depends on PR #18 #17) RNN Decoder with attention #15

Uh oh!

aasseman commented Apr 25, 2019 •

edited

Loading

Uh oh!

aasseman commented Apr 25, 2019 •

edited

Loading

Uh oh!

tkornuta-ibm commented Apr 26, 2019

Uh oh!

tkornuta-ibm commented Apr 26, 2019

Uh oh!

aasseman commented Apr 26, 2019

Uh oh!

tkornuta-ibm left a comment

Uh oh!

tkornuta-ibm Apr 27, 2019

Uh oh!

aasseman Apr 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

(Depends on PR #18 #17) RNN Decoder with attention #15

(Depends on PR #18 #17) RNN Decoder with attention #15

Uh oh!

Conversation

aasseman commented Apr 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aasseman commented Apr 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tkornuta-ibm commented Apr 26, 2019

Uh oh!

tkornuta-ibm commented Apr 26, 2019

Uh oh!

aasseman commented Apr 26, 2019

Uh oh!

tkornuta-ibm left a comment

Choose a reason for hiding this comment

Uh oh!

tkornuta-ibm Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

aasseman Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aasseman commented Apr 25, 2019 •

edited

Loading

aasseman commented Apr 25, 2019 •

edited

Loading