How to train reversed model for RL model #10

bobbercheng · 2018-03-07T17:51:30Z

You mentioned "
When training with policy gradient (pg)

you may need a reversed model

the reversed model is also trained by cornell movie-dialogs dataset, but with source and target reversed.
"
Except downloading pre-trained reversed model, could you please tell how to rain it?

Thank you a lot.

pochih · 2018-03-09T08:00:38Z

For instance, if a sample in training set is ('How is the weather?', 'It's sunny.').
To train a reversed model, you need to reverse the data into ('It's sunny.', 'How is the weather?').
That means you need to predict the former sentence using the later sentence.

pochih added the question label Mar 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train reversed model for RL model #10

How to train reversed model for RL model #10

bobbercheng commented Mar 7, 2018

pochih commented Mar 9, 2018

How to train reversed model for RL model #10

How to train reversed model for RL model #10

Comments

bobbercheng commented Mar 7, 2018

pochih commented Mar 9, 2018