Memoize beam hypos on the way #1267

uralik · 2018-11-02T23:55:11Z

So actually previous way of getting current hypos was terrible (I did it so yeah, terrible), with ngram block 7 speed was like this on twitter:

33s elapsed: {'exs': 32, '%done': '0.31%', 'time_left': '10717s', 'accuracy': 0, 'f1': 0.08652, 'bleu': 5.141e-06, 'token_acc': 0.3022, 'loss': 4.142, 'ppl': 62.91}                                                                                                                                                        
60s elapsed: {'exs': 64, '%done': '0.62%', 'time_left': '9728s', 'accuracy': 0, 'f1': 0.07378, 'bleu': 5.256e-06, 'token_acc': 0.3022, 'loss': 4.177, 'ppl': 65.2}                                                                                                                                                          
109s elapsed: {'exs': 96, '%done': '0.92%', 'time_left': '11792s', 'accuracy': 0, 'f1': 0.08872, 'bleu': 0.004175, 'token_acc': 0.3049, 'loss': 4.102, 'ppl': 60.47}                                                                                                                                                        
159s elapsed: {'exs': 128, '%done': '1.23%', 'time_left': '12795s', 'accuracy': 0, 'f1': 0.08498, 'bleu': 0.003132, 'token_acc': 0.3106, 'loss': 4.014, 'ppl': 55.35}

with new hyps cache it is like this:

2s elapsed: {'exs': 32, '%done': '0.31%', 'time_left': '704s', 'accuracy': 0, 'f1': 0.08652, 'bleu': 5.141e-06, 'token_acc': 0.3022, 'loss': 4.142, 'ppl': 62.91}                                                                                                                                                          
6s elapsed: {'exs': 96, '%done': '0.92%', 'time_left': '733s', 'accuracy': 0, 'f1': 0.08872, 'bleu': 0.004175, 'token_acc': 0.3049, 'loss': 4.102, 'ppl': 60.47}                                                                                                                                                           
9s elapsed: {'exs': 128, '%done': '1.23%', 'time_left': '767s', 'accuracy': 0, 'f1': 0.08498, 'bleu': 0.003132, 'token_acc': 0.3106, 'loss': 4.014, 'ppl': 55.35}                                                                                                                                                          
11s elapsed: {'exs': 160, '%done': '1.54%', 'time_left': '758s', 'accuracy': 0, 'f1': 0.07828, 'bleu': 0.002505, 'token_acc': 0.2996, 'loss': 4.15, 'ppl': 63.41}

my bad

* Dab. * Simple code movement. Nothing generalizes. * More code cut-paste. * Docstring formatting. * Moderate reorganization; minor docstring formatting. * A little bit closer. * Simplfiy slightly. * Opinionated formatting. * Beam search seems works for transformer, broken for seq2seq. * Finally move the time iteration into decode for seq2seq. * Fix some bugs around loading saved models. * Support skipping generation for faster ppl-only validation. * Move --input-dropout to seq2seq * Support incremental decoding. * Name be changin' * Support multigpu. Integrate in Abi's fixes. * Fix ranked candidates * Dead code society. * warn_once * Lint. * Forgot to reorder and select incremental states. * More multigpu workarounds. * lint. * Be consistent in initialization. * Make sure outputlayer has backwards compatibility. * Bring #1267 into this branch. * Add fast termination in greedy_decode. Bring back _init_cuda_buffer * Fix bugs in hogwild * Address review comments. * Whoops forgot a cuda call. * Simplify output layer since binary compatibility is no longer needed. * Seq2seq version bump. (#1282) * Seq2seq version bump.

uralik added 2 commits November 2, 2018 19:45

memoize hypos on the way

3a0a269

hey travis

009b8c8

uralik requested review from stephenroller and alexholdenmiller November 2, 2018 23:55

facebook-github-bot added the CLA Signed label Nov 2, 2018

uralik added 2 commits November 2, 2018 20:15

please travis

9d75072

remove as

48a9189

alexholdenmiller approved these changes Nov 3, 2018

View reviewed changes

uralik merged commit 6251f6b into master Nov 3, 2018

uralik deleted the current-hypos branch November 3, 2018 14:54

stephenroller added a commit that referenced this pull request Nov 5, 2018

Bring #1267 into this branch.

17e6759

stephenroller added a commit that referenced this pull request Nov 13, 2018

Bring #1267 into this branch.

94fcdf9

stephenroller added a commit that referenced this pull request Nov 15, 2018

Bring #1267 into this branch.

4492e2b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memoize beam hypos on the way #1267

Memoize beam hypos on the way #1267

uralik commented Nov 2, 2018

Memoize beam hypos on the way #1267

Memoize beam hypos on the way #1267

Conversation

uralik commented Nov 2, 2018