Add missing LM SOTA result + # params + prev SOTA #195

cwenner · 2018-12-31T13:39:24Z

Add missing LM ensemble which is SOTA for PTB.
Add second-in-line LM SOTA for strict interpretation.
Add number of params for LM results.

(unsure why it lists commits that have already been merged)

For better comparison of results wrt params. Also normalizes layer qualification.

Fix cited paper for Graves et al and include their primary result; previously only baseline.

Add results for the paper, Adaptive Input Representations for Neural Language Modeling

* Add results for LM paper, Transformer-XL * Remove superfluous vbar and normalize header * Add missing authors to item * Make under-review mark more discrete

Add results for the paper, Adaptive Input Representations for Neural Language Modeling

… into sebastianruder-master

Extend LM results (sebastianruder#194)

Add SOTA "ensemble" model for PTB. Add number of params for PTB and WT2

Add second-in-line LM SOTA for a strict interpretation of competing results.

cwenner · 2018-12-31T14:24:21Z

@sebastianruder - Not promising anything but would you object to any of these additions?

Split LM dynamic evaluation to separate tables. IMO they are not as relevant for downstream tasks and it may be more relevant to compare results against those with the same evaluation.
Add examples of generated texts for some of the LMs.
Possibly add some less popular LM datasets like PANAMA.
If there are datasets which still seem relevant, I was thinking of listing embedding comparisons. E.g. the google analogy test set. Would these work under the Text Similarity page or should they get a new page?

At some point, these tables were generated from yaml data. These data files do not appear to exist anymore.

sebastianruder · 2019-01-01T10:56:21Z

These all sound great. Thanks for your efforts, @cwenner! For the word similarity and analogy datasets, let's add them under the text similarity page for now. 👍

cwenner · 2019-01-02T11:58:35Z

@sebastianruder - Okay, thanks. What's the goal for completeness vs relevance of the lists? E.g. the more the better; prune all but top-1 or top-2 non-dominated results (eg for LM, params vs ppl); keep seminal for comparison?

Btw this is an unmerged PR.

sebastianruder · 2019-01-02T12:01:17Z

Yep, ~ top-2 state-of-the-art results with seminal methods for comparison.
Happy to merge this one. Other changes can go on a new PR.

sebastianruder · 2019-01-03T11:07:07Z

Nice work! I'll merge this now. Feel free to submit any other changes in a new PR.

cwenner · 2019-01-04T13:11:05Z

I think that's easier - thanks!

cwenner added 24 commits December 29, 2018 03:49

Include explanation for dynamic evaluation

7f2cbf2

Add results for included smaller sota models

6673ba3

For better comparison of results wrt params. Also normalizes layer qualification.

Fix order of new item

262d71c

Add code repositories for LM results

cfcbc54

Fix listed results for Graves et al LM

30b7dde

Fix cited paper for Graves et al and include their primary result; previously only baseline.

Correct author name

1f504be

Add results for LM paper, Transformer-XL

47d6cee

Remove superfluous vbar and normalize header

b6ff008

Add missing authors to item

72b8861

Make under-review mark more discrete

0453828

Add another SOTA LM result

246187f

Add results for the paper, Adaptive Input Representations for Neural Language Modeling

Add results for LM paper, Transformer-XL (sebastianruder#193)

fe8effc

* Add results for LM paper, Transformer-XL * Remove superfluous vbar and normalize header * Add missing authors to item * Make under-review mark more discrete

Include explanation for dynamic evaluation (sebastianruder#190)

1602685

Resolve merge conflicts

103abb4

Add explanation of dynamic evaluation

83d547c

Add code repositories for LM results

c5f986f

Fix listed results for Grave et al. LM

1ca7f03

Add another SOTA LM result

8bd5303

Add results for the paper, Adaptive Input Representations for Neural Language Modeling

Merge branch 'master' of https://github.com/sebastianruder/NLP-progress…

f37a2a5

… into sebastianruder-master

Fix alignment in table with added column

51e575d

Add LM dataset: 1 billion words

4261771

Merge pull request #4 from sebastianruder/master

27e7a85

Extend LM results (sebastianruder#194)

Add missing SOTA LM result as well as param counts

780b53b

Add SOTA "ensemble" model for PTB. Add number of params for PTB and WT2

Add previous LM SOTA

bcac5b0

Add second-in-line LM SOTA for a strict interpretation of competing results.

Remove left-over import of removed file

07bfc24

At some point, these tables were generated from yaml data. These data files do not appear to exist anymore.

cwenner added 2 commits January 2, 2019 12:14

Add additional relevant size for LM Transformer-XL

11a6d60

Add near-SOTA result for one-billion word LM

89c0a6c

Reorder enwiki8 results

b19d3bb

sebastianruder merged commit e3e7939 into sebastianruder:master Jan 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add missing LM SOTA result + # params + prev SOTA #195

Add missing LM SOTA result + # params + prev SOTA #195

cwenner commented Dec 31, 2018

cwenner commented Dec 31, 2018 •

edited

sebastianruder commented Jan 1, 2019

cwenner commented Jan 2, 2019 •

edited

sebastianruder commented Jan 2, 2019

sebastianruder commented Jan 3, 2019

cwenner commented Jan 4, 2019

Add missing LM SOTA result + # params + prev SOTA #195

Add missing LM SOTA result + # params + prev SOTA #195

Conversation

cwenner commented Dec 31, 2018

cwenner commented Dec 31, 2018 • edited

sebastianruder commented Jan 1, 2019

cwenner commented Jan 2, 2019 • edited

sebastianruder commented Jan 2, 2019

sebastianruder commented Jan 3, 2019

cwenner commented Jan 4, 2019

cwenner commented Dec 31, 2018 •

edited

cwenner commented Jan 2, 2019 •

edited