Guide to fixed-length model perplexity evaluation #5449

joeddav · 2020-07-01T22:05:44Z

This post / guide is inspired by this recent Twitter discussion and this gist on the different ways that perplexity can be evaluated and the optimal strategy of a strided "sliding window".

Interested in feedback both on the guide/writing component as well as the theoretical discussion on PPL. Right now my understanding is that our language modeling script uses non-overlapping segments rather than the sliding window.

Relevant to #4415, #4219.

codecov · 2020-07-01T22:34:39Z

Codecov Report

Merging #5449 into master will increase coverage by 0.86%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #5449      +/-   ##
==========================================
+ Coverage   77.82%   78.68%   +0.86%     
==========================================
  Files         141      141              
  Lines       24608    24608              
==========================================
+ Hits        19150    19364     +214     
+ Misses       5458     5244     -214

Impacted Files	Coverage Δ
src/transformers/generation_utils.py	`97.10% <0.00%> (+0.28%)`	⬆️
src/transformers/generation_tf_utils.py	`86.68% <0.00%> (+0.50%)`	⬆️
src/transformers/modeling_openai.py	`81.09% <0.00%> (+1.37%)`	⬆️
src/transformers/modeling_tf_openai.py	`94.98% <0.00%> (+74.19%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d16e36c...b3dae20. Read the comment docs.

docs/source/perplexity.rst

sgugger

Left some comments but I really like this, and the Research section is exactly right for that kind of docs!

joeddav added 2 commits July 1, 2020 15:52

add first draft ppl guide

a3d3053

upload imgs

a19de2e

joeddav requested review from sgugger, srush, VictorSanh and yjernite July 1, 2020 22:42

sgugger reviewed Jul 2, 2020

View reviewed changes

docs/source/perplexity.rst Outdated Show resolved Hide resolved

sgugger reviewed Jul 2, 2020

View reviewed changes

docs/source/perplexity.rst Show resolved Hide resolved

sgugger reviewed Jul 2, 2020

View reviewed changes

docs/source/perplexity.rst Outdated Show resolved Hide resolved

sgugger approved these changes Jul 2, 2020

View reviewed changes

joeddav added 3 commits July 2, 2020 09:44

expand on strides

a0679aa

ref typo

650234a

rm superfluous past var

8df9d9a

VictorSanh approved these changes Jul 2, 2020

View reviewed changes

add tokenization disclaimer

b3dae20

joeddav merged commit b4b33fd into huggingface:master Jul 7, 2020

joeddav deleted the ppl branch August 28, 2020 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guide to fixed-length model perplexity evaluation #5449

Guide to fixed-length model perplexity evaluation #5449

joeddav commented Jul 1, 2020

codecov bot commented Jul 1, 2020 •

edited

Loading

sgugger left a comment

Guide to fixed-length model perplexity evaluation #5449

Guide to fixed-length model perplexity evaluation #5449

Conversation

joeddav commented Jul 1, 2020

codecov bot commented Jul 1, 2020 • edited Loading

Codecov Report

sgugger left a comment

Choose a reason for hiding this comment

codecov bot commented Jul 1, 2020 •

edited

Loading