Could you improve the state of the art once again? #1

LifeIsStrange · 2022-04-07T15:57:51Z

@dirkneuhaeuser Thanks for making the world a better place, your classifier is extremely helpful for natural language understanding.
Unfortunately, 91% accuracy is still not really great for widespread use. I actively follow the evolution of transformers.
Your use of BERT was a great choice at the time since it is a strong baseline (I'll assume you already use BERT-large),
however there are now significantly better transformers than BERT, which generally bring a few percents accuracy gains and this difference can be major for enabling real world use.
As such, I would love if you could replace your BERT implementation by a XLnet one (best transformer out there) or by https://github.com/microsoft/MPNet (MPNet is an evolution of XLnet although it might be significantly slower in terms of training time), xlnet on the other hand is relatively comparable to BERT regarding training times.

Another, lesser known and complementary way to reach higher accuracy would be to use a better activtion function (Mish), a better optimizer (RAdam), eventually with fallback optimizers (such as lookahead) and methods such as gradient centralization. Each of those generally bring ~1-2% accuracy gains.
cf https://github.com/lessw2020/Best-Deep-Learning-Optimizers
As for XLnet it can in many case bring a +5% accuracy gain over BERT-large

LifeIsStrange · 2023-07-03T22:13:28Z

note that the transformers library could be used and that ROBERTA would be an even lower hanging fruit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you improve the state of the art once again? #1

Could you improve the state of the art once again? #1

LifeIsStrange commented Apr 7, 2022 •

edited

LifeIsStrange commented Jul 3, 2023

Could you improve the state of the art once again? #1

Could you improve the state of the art once again? #1

Comments

LifeIsStrange commented Apr 7, 2022 • edited

LifeIsStrange commented Jul 3, 2023

LifeIsStrange commented Apr 7, 2022 •

edited