Input-feeding Approach #33

TinaB19 · 2017-06-12T19:07:01Z

Thank you very much for the awesome work, I need a clarification in the decoder part of seq2seq-translation.
#Combine embedded input word and last context, run through RNN
rnn_input = torch.cat((word_embedded, last_context.unsqueeze(0)), 2)

Is the above code an implementation of Input-feeding Approach in the Effective Approaches to Attention-based Neural Machine Translation paper?

The text was updated successfully, but these errors were encountered:

spro · 2017-06-12T19:17:56Z

Yes it is, though looking back at it I'm missing one layer between the context vector c_t and the softmax layer, to create the "attentional hidden state" ~h_t, which is what they use for input feeding.

TinaB19 · 2017-06-12T21:04:50Z

It would be great if you add it to the tutorial later, thank you very much.

TinaB19 · 2017-06-13T19:56:31Z

I just saw seq2seq-translation-batched.

concat_input = torch.cat((rnn_output, context), 1)
concat_output = F.tanh(self.concat(concat_input))

So I guess in this case we should concatenate concat_output with embedded in the next time step and then feed them to gru. Is this correct?

spro · 2017-06-14T00:39:14Z

Correct.

spro closed this as completed Jun 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input-feeding Approach #33

Input-feeding Approach #33

TinaB19 commented Jun 12, 2017

spro commented Jun 12, 2017

TinaB19 commented Jun 12, 2017

TinaB19 commented Jun 13, 2017

spro commented Jun 14, 2017

Input-feeding Approach #33

Input-feeding Approach #33

Comments

TinaB19 commented Jun 12, 2017

spro commented Jun 12, 2017

TinaB19 commented Jun 12, 2017

TinaB19 commented Jun 13, 2017

spro commented Jun 14, 2017