Experiment with XLnet? #2

LifeIsStrange · 2019-08-14T01:00:36Z

Firstly I would like to say that reading your paper was fascinating.
Secondly I would like to thank you for advancing the state of the art on both constituency parsing and dependency parsing. (first place on NLP-progress)

I've not yet read all your paper, but it seems you used BERT, and BERT was state of the art, but is no longer.
It has been obscoleted by significant margins by [XLnet] (https://github.com/zihangdai/xlnet)
I think it would be really interesting to train your neural net with XLnet instead of BERT to see if you can advance even more the state of the art!

LifeIsStrange · 2019-08-14T01:02:26Z

@DoodleJZ

DoodleJZ · 2019-08-14T07:14:08Z

YES, thank you for your concern! We are also interesting with the strong performance of XLnet and will consider trying XLnet later.

LifeIsStrange · 2019-08-14T18:33:27Z

Really nice to hear that!
Could you please update the Nlp progress [1] results if this experiment improve state of the art performance, or tell me so I could update results for you.

[1]
https://github.com/sebastianruder/NLP-progress/blob/master/english/dependency_parsing.md
Thanks in advance.

LifeIsStrange · 2019-09-14T13:05:37Z

Hi @DoodleJZ
I saw that you did the experiment with XLnet, had very successful results and that you merged your results on NLP-progress!
( sebastianruder/NLP-progress@18b8b85 )

Please, let's not stop here!
The world needs high accuracy dep/const parsing and you are the one that can improve the state of the art.
You already beated the SOTA twice!
Let's do it AGAIN :)

I propose to experiment with two simple, high returns things in addition to XLnet:
Firstly, using the state of the art activation function Mish can give high accuracy gains!
https://github.com/digantamisra98/Mish

Secondly there are two new state-of-the-art Optimizer in town:
RAdam (rectified Adam) and Lookahead.
And the beauty is that they can work together synergistically.
You should try this SOTA optimizer:
https://github.com/lessw2020/Ranger-Deep-Learning-Optimizer
(the medium blog is insightful)

Related:
https://github.com/mgrankin/over9000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment with XLnet? #2

Experiment with XLnet? #2

LifeIsStrange commented Aug 14, 2019 •

edited

LifeIsStrange commented Aug 14, 2019

DoodleJZ commented Aug 14, 2019

LifeIsStrange commented Aug 14, 2019

LifeIsStrange commented Sep 14, 2019

Experiment with XLnet? #2

Experiment with XLnet? #2

Comments

LifeIsStrange commented Aug 14, 2019 • edited

LifeIsStrange commented Aug 14, 2019

DoodleJZ commented Aug 14, 2019

LifeIsStrange commented Aug 14, 2019

LifeIsStrange commented Sep 14, 2019

LifeIsStrange commented Aug 14, 2019 •

edited