New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Sequence-to-Sequence Learning as Beam-Search Optimization, Wiseman+, EMNLP'16 #80

Open

AkihikoWatanabe opened this issue Dec 30, 2017 · 1 comment

Labels

BeamSearch Neural NLP

Owner

AkihikoWatanabe commented Dec 30, 2017

https://arxiv.org/pdf/1606.02960.pdf

AkihikoWatanabe added BeamSearch Neural NLP labels

Owner Author

AkihikoWatanabe commented Dec 30, 2017 •

edited

seq2seqを学習する際には、gold-history（これまで生成した単語がgoldなものと一緒）を使用し、次に続く単語の尤度を最大化するように学習するが、これには、

Explosure Bias: test時ではtraining時と違いgold historyを使えないし、training時には過去に生成した単語に誤りがあるみたいな状況がない
Loss-Evaluation Mismatch: training時は単語レベルのlossを使うが、だいたいはsentence-levelのmetrics (BLEUなど)を改善したい
Label Bias: 各タイムステップでの単語の生起確率が局所的に正規化され、誤ったhistoryに続く単語がgoldな履歴に続く単語と同じ量（の確率？）を受け取ってしまう

これらを解決するために、targetの"sequence"に対してスコア（確率ではない）を与えるようなseq2seqモデルを提案し、訓練方法として、beam search optimization（training時のlossとしてbeam searchの結果得られるerrorを用いる）を提案。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment