Load BERT-esque checkpoints in pytorch formats #20

jonathanbratt · 2019-09-13T16:07:28Z

The original BERT checkpoints released by Google are in a TensorFlow format.
It seems that most of the related work done by other teams is in the PyTorch implementation.
In particular, pre-trained models such as RoBERTa and DistilBERT have been released for PyTorch.

Many of these models are compatible with the BERT architecture, though possibly with different parameters or vocabularies. It would be great to be able to easily load these into RBERT.

jonathanbratt · 2019-09-13T16:16:17Z

One possible approach is to write some code to convert PyTorch models into TensorFlow checkpoints, at which point it should be possible to use existing code to load/use. I don't know how to do this, though. Anybody with more PyTorch experience want to give this a shot?

jonathanbratt · 2019-09-20T18:49:54Z

Some possibly helpful links:

https://forums.fast.ai/t/converting-a-pytorch-model-to-tensorflow-or-keras-for-production/14016/30

https://github.com/onnx/tutorials

https://github.com/onnx/tutorials/blob/master/tutorials/PytorchTensorflowMnist.ipynb

jonathanbratt · 2019-10-30T13:47:53Z

Related: SciBERT checkpoints have been released in tensorflow format, so those are already available in RBERT.
https://github.com/allenai/scibert

jonathanbratt added help wanted Extra attention is needed enhancement New feature or request labels Sep 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load BERT-esque checkpoints in pytorch formats #20

Load BERT-esque checkpoints in pytorch formats #20

jonathanbratt commented Sep 13, 2019 •

edited

jonathanbratt commented Sep 13, 2019

jonathanbratt commented Sep 20, 2019 •

edited

jonathanbratt commented Oct 30, 2019

Load BERT-esque checkpoints in pytorch formats #20

Load BERT-esque checkpoints in pytorch formats #20

Comments

jonathanbratt commented Sep 13, 2019 • edited

jonathanbratt commented Sep 13, 2019

jonathanbratt commented Sep 20, 2019 • edited

jonathanbratt commented Oct 30, 2019

jonathanbratt commented Sep 13, 2019 •

edited

jonathanbratt commented Sep 20, 2019 •

edited